colinacherry.bsky.social
@colinacherry.bsky.social
Reposted
😼SMOL DATA ALERT! 😼Anouncing SMOL, a professionally-translated dataset for 115 very low-resource languages! Paper: arxiv.org/pdf/2502.12301
Huggingface: huggingface.co/datasets/goo...
February 19, 2025 at 5:36 PM