theallgolden.bsky.social
@theallgolden.bsky.social
canceled
September 18, 2025 at 7:25 PM
fwiw this is acyclical but I'm sure there is tarpit detection in most major crawlers. the goals I think are (1) waste OpenAI resources, and (2) get them to blackhole your site from their index -- seems like (1) probably won't work but (2) maybe will?
January 26, 2025 at 10:29 PM
thoughts on this thread? bsky.app/profile/cfie...
Hi, so I've spent the past almost-decade studying research uses of public social media data, like e.g. ML researchers using content from Twitter, Reddit, and Mastodon.

Anyway, buckle up this is about to be a VERY long thread with lots of thoughts and links to papers. 🧵
First dataset for the new @huggingface.bsky.social @bsky.app community organisation: one-million-bluesky-posts 🦋

📊 1M public posts from Bluesky's firehose API
🔍 Includes text, metadata, and language predictions
🔬 Perfect to experiment with using ML for Bluesky 🤗

huggingface.co/datasets/blu...
November 27, 2024 at 10:55 PM
took this at the museum of natural history 2 weeks ago
October 3, 2023 at 3:37 PM
cocktail royale
punchdrink.com
May 9, 2023 at 4:56 PM