Check out the top 10 papers for the week👇
- Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Check out the top 10 papers for the week👇
- Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning
Check out the top 10 papers for the week👇
- Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers
Check out the top 10 papers for the week👇
- Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers
Check out the top 10 papers for the week👇
- Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures
Check out the top 10 papers for the week👇
- Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures
Check out the top 10 papers for the week👇
- From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Check out the top 10 papers for the week👇
- From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review
Check out the top 10 papers for the week👇
- TTRL: Test-Time Reinforcement Learning
Check out the top 10 papers for the week👇
- TTRL: Test-Time Reinforcement Learning
Check out the top 10 papers for the week👇
- Reasoning Models Can Be Effective Without Thinking
Check out the top 10 papers for the week👇
- Reasoning Models Can Be Effective Without Thinking
Check out the top 10 papers for the week👇
- Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
Check out the top 10 papers for the week👇
- Hogwild! Inference: Parallel LLM Generation via Concurrent Attention
Ask questions like 'What are the latest breakthroughs in RL fine-tuning?' and get comprehensive lit reviews with trending papers automatically included
Turn hours of literature searches into seconds with AI-powered research context ⚡
Ask questions like 'What are the latest breakthroughs in RL fine-tuning?' and get comprehensive lit reviews with trending papers automatically included
Turn hours of literature searches into seconds with AI-powered research context ⚡
Highlight any section of a paper to ask questions and “@” other papers for quick context, comparisons, and benchmark references
Highlight any section of a paper to ask questions and “@” other papers for quick context, comparisons, and benchmark references
Check out the top 10 papers for the week👇
- What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models
Check out the top 10 papers for the week👇
- What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models
Baichuan introduces ReSearch, an RL framework that teaches LLMs to reason with search from scratch
Outperforms RAG baselines
No supervised data on reasoning steps
Simple & generalizable
Trending #1 on alphaXiv 📈
Baichuan introduces ReSearch, an RL framework that teaches LLMs to reason with search from scratch
Outperforms RAG baselines
No supervised data on reasoning steps
Simple & generalizable
Trending #1 on alphaXiv 📈
Check out the top 10 papers for the week👇
- Reasoning to Learn from Latent Thoughts
Check out the top 10 papers for the week👇
- Reasoning to Learn from Latent Thoughts
- DAPO: An Open-Source LLM Reinforcement Learning System at Scale
- DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Generate beautiful research blogs with figures, key insights, and clear explanations from the paper with just one click
Understand papers in minutes - not hours
Generate beautiful research blogs with figures, key insights, and clear explanations from the paper with just one click
Understand papers in minutes - not hours
arXivLabs is how researchers launch new features on arXiv. Working together to make research better - all your faves are arXivLabs: @hf.co @alphaxiv.org @litmaps.com & more.
https://givingday.cornell.edu/campaigns/arxiv
- Nature-Inspired Population-Based Evolution of Large Language Models
- Nature-Inspired Population-Based Evolution of Large Language Models
- Towards an AI co-scientist
- Towards an AI co-scientist
🚀From teaching themselves to predict the future to solving strategic social deduction, AI this week is discovering the hidden geometry of prompts, scaling reasoning, and rethinking what’s possible with less.🚨
🚀From teaching themselves to predict the future to solving strategic social deduction, AI this week is discovering the hidden geometry of prompts, scaling reasoning, and rethinking what’s possible with less.🚨
2016: AlphaGo masters the game of Go
2025: Stanford researchers crack Among Us
Trending on alphaXiv 📈
Remarkable new work trains LLMs to master strategic social deduction through multi-agent RL, doubling win rates over standard RL.
2016: AlphaGo masters the game of Go
2025: Stanford researchers crack Among Us
Trending on alphaXiv 📈
Remarkable new work trains LLMs to master strategic social deduction through multi-agent RL, doubling win rates over standard RL.
Now you can instantly filter to see what's trending in each area 🚨
Now you can instantly filter to see what's trending in each area 🚨
🚀This week, AI is leveling up—from solving Olympiad geometry with AlphaGeometry2 to generating motion with VideoJAM, while also mastering the art of reasoning and adversarial resilience with tools like DeepRAG and LIMO. 🚨
🚀This week, AI is leveling up—from solving Olympiad geometry with AlphaGeometry2 to generating motion with VideoJAM, while also mastering the art of reasoning and adversarial resilience with tools like DeepRAG and LIMO. 🚨
Fancy Research Profile - michelle.alphaxiv.io
Comment-able papers - www.alphaxiv.org/profile/6793...
Thanks to @ml-collective.bsky.social for sharing this platform!
Fancy Research Profile - michelle.alphaxiv.io
Comment-able papers - www.alphaxiv.org/profile/6793...
Thanks to @ml-collective.bsky.social for sharing this platform!