Lightnews — Scholar-powered news

Syeda Nahida Akter

@reasyaay.bsky.social

150 followers 120 following 9 posts

PhD-ing @ LTI, CMU; Intern @ NVIDIA. Doing Reasoning with Gen AI!

Posts Replies Media Videos

Syeda Nahida Akter

@reasyaay.bsky.social

🧠 Selective difficulty > data volume
✅Filtering out easy samples—i.e., those solved by a 7B model—leads to +2.15% accuracy gain when training a 32B model.
✅Harder questions push the model to learn deeper reasoning patterns.

May 1, 2025 at 5:42 PM

Syeda Nahida Akter

@reasyaay.bsky.social

💡 Better formatting → Stronger reasoning

➣ Open-ended questions boost accuracy (+1.21%) by forcing models to reason, not guess!
➣ Short-form answers—reduce ambiguity & avoid noisy rewards—boosts accuracy by +1.20%!

👉 Thoughtful templates = clearer supervision, better RL

May 1, 2025 at 5:42 PM

Syeda Nahida Akter

@reasyaay.bsky.social

🔥Nemotron-CrossThink achieves 28% token efficiency by adapting to task needs

➣ concise on general reasoning (229 tokens on MMLU) and
➣ detailed on math (+62% token increase)

Unlike math-only models, which barely adapt (12–14% token increase).

May 1, 2025 at 5:42 PM

Syeda Nahida Akter

@reasyaay.bsky.social

🎯 Why it matters:
Nemotron-CrossThink achieves:
📈 +30.1% on MATH-500, +15.1% on AGIEVAL, +12.8% on MMLU-Pro compared to base LLM
📉 28% fewer tokens per correct answer
🏆 Outperforms math-only blends by training on broader, more diverse reasoning data

May 1, 2025 at 5:42 PM

Syeda Nahida Akter

@reasyaay.bsky.social

How does Nemotron-CrossThink work?
➣Curate QA pairs from Common Crawl + open datasets
➣Apply structured templates: multiple-choice + open-ended
➣Filter out unverifiable / ambiguous samples
➣Train LLM with GRPO—a scalable RL algorithm

May 1, 2025 at 5:42 PM

Syeda Nahida Akter

@reasyaay.bsky.social

RL boosts LLM reasoning—but why stop at math & code? 🤔
Meet Nemotron-CrossThink—a method to scale RL-based self-learning across law, physics, social science & more.

🔥Resulting in a model that reasons broadly, adapts dynamically, & uses 28% fewer tokens for correct answers!
🧵↓

May 1, 2025 at 5:42 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news