Anikait Singh
banner
asap7772.bsky.social
Anikait Singh
@asap7772.bsky.social
PhD Student @StanfordAILab @stanfordnlp.bsky.social, Previously SR @GoogleDeepMind.bsky.social, Undergraduate @Berkeley_AI
🚨🚨New Paper: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Introducing RLAD, a two-player RL framework for LLMs to discover 'reasoning abstractions'—natural language hints that encode procedural knowledge for structured exploration in reasoning.🧵⬇️
October 3, 2025 at 7:33 PM
Reposted by Anikait Singh
1/13 New Paper!! We try to understand why some LMs self-improve their reasoning while others hit a wall. The key? Cognitive behaviors! Read our paper on how the right cognitive behaviors can make all the difference in a model's ability to improve with RL! 🧵
March 4, 2025 at 6:15 PM