Lightnews — Scholar-powered news

Prasanna Mayilvahanan

@prasannamayil.bsky.social

26 followers 35 following 8 posts

PhD student in ML at MPI-IS. Prev Apple.

Interested in robustness at scale and reasoning.

Posts Replies Media Videos

Reposted by Prasanna Mayilvahanan

Andreas Hochlehnert

@ahochlehnert.bsky.social

CuratedThoughts: Data Curation for RL Datasets 🚀

Since DeepSeek-R1 introduced reasoning-based RL, datasets like Open-R1 & OpenThoughts emerged for fine-tuning & GRPO. Our deep dive found major flaws — 25% of OpenThoughts needed elimination by data curation.

Here's why 👇🧵

February 17, 2025 at 6:22 PM

Prasanna Mayilvahanan

@prasannamayil.bsky.social

New preprint out! 🎉

How does LLM training loss translate to downstream performance?

We show that pretraining data and tokenizer shape loss-to-loss scaling, while architecture and other factors play a surprisingly minor role!
brendel-group.github.io/llm-line/ 🧵1/8

February 18, 2025 at 2:09 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news