thedukeliberty.bsky.social
@thedukeliberty.bsky.social
Reposted
Hugging Face put up a similar dataset (only 1 million rows), but they unfortunately removed it :(

This is an attempt to undo that injustice, and provide double the amount of data. The data gathering process is fully legal, as per the Bluesky Terms & Service.

huggingface.co/datasets/alp...
alpindale/two-million-bluesky-posts · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
November 27, 2024 at 7:13 PM
Reposted
Releasing: a dataset of two million Bluesky posts.

This dataset has been collected using Bluesky's API, and I hope it will be useful for all the researchers out there!
November 27, 2024 at 7:13 PM