patternmatching.bsky.social
@patternmatching.bsky.social
Reposted
I actually don't mind that people want to have control of their data. I do mind that they are ignorant reactionary humanists with a knee-jerk hatred of technology who call themselves progressive.

We could both get what we want, if they would simply keep their data to themselves
November 28, 2024 at 3:19 PM
what people really arent getting is that you are being farmed by Bluesky, the api PROVIDER! It doesnt matter that some guy from huggingface collected the data, it was already farmed and there for the taking, and dozens have already done the same privately.
November 28, 2024 at 2:11 PM
Reposted
A librarian that previously worked at the British Library created a relatively small dataset of bsky posts, hundreds of times smaller than previous researchers, to help folks create toxicity filters and stuff.

So people bullied him & posted death threats.

He took it down.

Nice one, folks.
November 28, 2024 at 5:33 AM
the reactions to this are insane, do people seriously not realize this is from a publicly accessible API ??? that anyone else can access at any time? your data was never private
First dataset for the new @huggingface.bsky.social @bsky.app community organisation: one-million-bluesky-posts 🦋

📊 1M public posts from Bluesky's firehose API
🔍 Includes text, metadata, and language predictions
🔬 Perfect to experiment with using ML for Bluesky 🤗

huggingface.co/datasets/blu...
bluesky-community/one-million-bluesky-posts · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
November 27, 2024 at 6:32 PM