Lightnews — Scholar-powered news

Anikait Singh

@asap7772.bsky.social

PhD Student @StanfordAILab @stanfordnlp.bsky.social, Previously SR @GoogleDeepMind.bsky.social, Undergraduate @Berkeley_AI

Posts Replies Media Videos

Anikait Singh

@asap7772.bsky.social

🚨🚨New Paper: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Introducing RLAD, a two-player RL framework for LLMs to discover 'reasoning abstractions'—natural language hints that encode procedural knowledge for structured exploration in reasoning.🧵⬇️

October 3, 2025 at 7:33 PM

Reposted by Anikait Singh

Kanishk Gandhi

@gandhikanishk.bsky.social

1/13 New Paper!! We try to understand why some LMs self-improve their reasoning while others hit a wall. The key? Cognitive behaviors! Read our paper on how the right cognitive behaviors can make all the difference in a model's ability to improve with RL! 🧵

March 4, 2025 at 6:15 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news