Writes http://interconnects.ai
At Ai2 via HuggingFace, Berkeley, and normal places
rlhfbook.com
natolambert.com/slides
natolambert.com/slides
My answer is helping perform a major polish for the rlhfbook (really nice editor), and some fancy website automations.
My answer is helping perform a major polish for the rlhfbook (really nice editor), and some fancy website automations.
Happy holidays all!
Happy holidays all!
A short reflection on what happens when your two long-term areas of focus — open models and RL — become the centerpiece of the AI boom.
Thx for your support!
www.interconnects.ai/p/2025-inter...
A short reflection on what happens when your two long-term areas of focus — open models and RL — become the centerpiece of the AI boom.
Thx for your support!
www.interconnects.ai/p/2025-inter...
rlhfbook dot com
rlhfbook dot com
Fast, precise image gen and editing is such a joy to use and surely mass market adoption is cranking.
More warranted to call a Code Red for that than Gemini 3.
Fast, precise image gen and editing is such a joy to use and surely mass market adoption is cranking.
More warranted to call a Code Red for that than Gemini 3.
cameronrwolfe.substack.com/p/olmo-3
cameronrwolfe.substack.com/p/olmo-3
What a year! We're back with an updated open model builder tier list, our top models of the year, and our predictions for 2026.
www.interconnects.ai/p/2025-open-...
What a year! We're back with an updated open model builder tier list, our top models of the year, and our predictions for 2026.
www.interconnects.ai/p/2025-open-...
@saurabhshah2.bsky.social and I tweaked some hyperparams and prompts, @hamishivi.bsky.social and @finbarr.bsky.social improved the code and boom!
New Olmo 3.1 RL-Zero 👾 An updated, solid baseline for your RL and reasoning research
My favorite RL run yet over 7+ years of doing RL.
The biggest fully open RL run ever?
Gold stars on downstream evals is our original release, this latest one is the final checkpoint on the plot.
@saurabhshah2.bsky.social and I tweaked some hyperparams and prompts, @hamishivi.bsky.social and @finbarr.bsky.social improved the code and boom!
New Olmo 3.1 RL-Zero 👾 An updated, solid baseline for your RL and reasoning research
at $2/H100 hour, Olmo 3 start to end would cost $2.75M
allenai.org/papers/olmo3
at $2/H100 hour, Olmo 3 start to end would cost $2.75M
allenai.org/papers/olmo3
My favorite RL run yet over 7+ years of doing RL.
The biggest fully open RL run ever?
Gold stars on downstream evals is our original release, this latest one is the final checkpoint on the plot.
My favorite RL run yet over 7+ years of doing RL.
The biggest fully open RL run ever?
Gold stars on downstream evals is our original release, this latest one is the final checkpoint on the plot.
Foundations of Reasoning in Language Models @ NeurIPS 2025
Today 13:45 - 14:30
Foundations of Reasoning in Language Models @ NeurIPS 2025
Today 13:45 - 14:30
The story of Olmo 3 (post-training), told through evals
NeurIPS Talk tomorrow.
Upper Level Room 2, 10:35AM.
Slides: docs.google.com/presentation...
The story of Olmo 3 (post-training), told through evals
NeurIPS Talk tomorrow.
Upper Level Room 2, 10:35AM.
Slides: docs.google.com/presentation...
Clips ⬇️
youtube.com/shorts/v0k5D...
youtube.com/shorts/4R-BF...
#AI4Science #NeurIPS2025
Clips ⬇️
youtube.com/shorts/v0k5D...
youtube.com/shorts/4R-BF...
#AI4Science #NeurIPS2025
Lots of need for fast, efficient open models. Reasoning model usage is likely closed labs more.
Lots of need for fast, efficient open models. Reasoning model usage is likely closed labs more.
Let us know what you think and what to improve!
(Hosted by Parasail)
This may give it the hug of death... would be my dream.
openrouter.ai/allenai/olmo...
Let us know what you think and what to improve!
(Hosted by Parasail)
This may give it the hug of death... would be my dream.
openrouter.ai/allenai/olmo...
Talks: I'm giving two talks on the last day at Workshops Dec. 7th
1. Good researchers obsess over evals.
10:35am-11:05am: Evaluating the Evolving LLM Lifecycle
2. Building Olmo 3 Think.
1:45pm-2:30pm: Foundations of Reasoning in Language Models
Talks: I'm giving two talks on the last day at Workshops Dec. 7th
1. Good researchers obsess over evals.
10:35am-11:05am: Evaluating the Evolving LLM Lifecycle
2. Building Olmo 3 Think.
1:45pm-2:30pm: Foundations of Reasoning in Language Models
We're hosting a dedicated booth to record researchers talking about their work, share that audio&video content on our socials, and start great conversations.
calendly.com/contact-read...
We're hosting a dedicated booth to record researchers talking about their work, share that audio&video content on our socials, and start great conversations.
calendly.com/contact-read...
This plot feels like a solid representation of ability v openness. My favorite part is Olmo 3 mogging on Llama 4 Maverick in every plot 🤭. It's pretty sad that the two values are pretty much perfectly anti-correlated.
This plot feels like a solid representation of ability v openness. My favorite part is Olmo 3 mogging on Llama 4 Maverick in every plot 🤭. It's pretty sad that the two values are pretty much perfectly anti-correlated.
I've just been so GPT Pro pilled. Why use DR?
I've just been so GPT Pro pilled. Why use DR?
We love releasing things that serve as a comprehensive snapshot of public knowledge on training leading language models.
There's an award for whoever finds all the secrets first in the new arxiv version
allenai.org/papers/olmo3
We love releasing things that serve as a comprehensive snapshot of public knowledge on training leading language models.
There's an award for whoever finds all the secrets first in the new arxiv version
allenai.org/papers/olmo3