Ai2
banner
ai2.bsky.social
Ai2
@ai2.bsky.social
Breakthrough AI to solve the world's biggest problems.

› Join us: http://allenai.org/careers
› Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm
Pinned
Ai2 @ai2.bsky.social · Dec 16
Last year Molmo set SOTA on image benchmarks + pioneered image pointing. Millions of downloads later, Molmo 2 brings Molmo’s grounded multimodal capabilities to video 🎥—and leads many open models on challenging industry video benchmarks. 🧵
"We wanted to provide subject matter experts and communities that have the expertise on the ground with the tools to engage with AI without having to learn AI deeply.” - Ted Schmitt. Thanks @mongabay.com for diving into our new OlmoEarth platform. 📷
January 16, 2026 at 8:55 PM
SciArena update: our Olmo 3.1 32B Instruct scores 963.6 Elo overall at just $0.17/100 calls—ahead of OpenAI’s GPT-OSS-20B. In Engineering, it hits 1039.2 Elo, only 2.5 behind GPT-OSS-120B—a model ~4× its size. 🧵
January 16, 2026 at 5:57 PM
Molmo 2 is now available via API on @openrouter.bsky.social, courtesy of Parasail—free until 1/29.
State-of-the-art video understanding with pointing, counting, and multi-frame reasoning—track objects through scenes & identify where + when events occur.
Open. Apache 2.0. 👇
January 13, 2026 at 5:59 PM
Olmo 3.1 32B Instruct is now on @openrouter.bsky.social, hosted by DeepInfra. Built for real-world use: reliable instruction following & function calling for agentic workflows + research. Fully open & leading benchmark performance, ready to plug into your stack. 👇
January 8, 2026 at 8:00 PM
🆕 New in Asta: multi-turn report generation.
You can now have back-and-forth conversations with Asta, our agentic platform for scientific research, to refine long-form, fully cited reports instead of relying on single-shot prompts.
December 18, 2025 at 4:09 PM
Now you can use our most powerful models via API.
Olmo 3.1 32B Think, our reasoning model for complex problems, is on @openrouter.bsky.social—free through 12/22. And Olmo 3.1 32B Instruct, our flagship chat model with tool use, is available through @hf.co Inference Providers. 👇
December 17, 2025 at 9:02 PM
🎥 Introducing SAGE, an agentic system for long video reasoning on entertainment videos—sports, vlogs, & more. It learns when to skim, zoom in, & answer questions directly. On our SAGE-Bench eval, SAGE with a Molmo 2 (8B)-based orchestrator lifts accuracy from 61.8% → 66.1%. 🧵
December 17, 2025 at 5:57 PM
🎗️ Reminder, our Molmo 2 and Olmo 3 Reddit AMA begins soon at 1pm PST / 4pm EST. www.reddit.com/r/LocalLLaMA...
From the LocalLLaMA community on Reddit
Explore this post and more from the LocalLLaMA community
www.reddit.com
December 16, 2025 at 8:41 PM
Last year Molmo set SOTA on image benchmarks + pioneered image pointing. Millions of downloads later, Molmo 2 brings Molmo’s grounded multimodal capabilities to video 🎥—and leads many open models on challenging industry video benchmarks. 🧵
December 16, 2025 at 4:52 PM
🗓️ Tue Dec 16, 1–2pm PT: AMA with researchers + engineers from our Olmo & Molmo teams, hosted by r/LocalLLaMA.
💬 Ask your questions now—we’ll start answering when the AMA begins!
December 15, 2025 at 10:25 PM
Introducing Bolmo, a new family of byte-level language models built by "byteifying" our open Olmo 3—and to our knowledge, the first fully open byte-level LM to match or surpass SOTA subword models across a wide range of tasks. 🧵
December 15, 2025 at 5:19 PM
🧠 Introducing NeuroDiscoveryBench. Built with @alleninstitute.org, it’s the first benchmark for evaluating AI systems like our Asta DataVoyager agent on neuroscience data. The benchmark tests whether AI can truly extract insights from complex brain datasets.
December 12, 2025 at 8:41 PM
Olmo 3.1 is here. We extended our strongest RL run and scaled our instruct recipe to 32B—releasing Olmo 3.1 Think 32B & Olmo 3.1 Instruct 32B, our most capable models yet. 🧵
December 12, 2025 at 5:14 PM
Update: DataVoyager, which we launched in Preview early this fall, is now available in Asta. 🎉
You can upload real datasets, ask complex research questions in natural language, & get back reproducible answers + visualizations. 🔍📊
December 8, 2025 at 8:47 PM
We're at #NeurIPS2025 with papers, posters, workshops, fireside chats, & talks across the conference. Come learn about our latest research + see live demos!
December 2, 2025 at 6:05 PM
SciArena leaderboard update! 🔬
We've added new frontier models – including GPT-5.1 and Gemini 3 Pro Preview – to our arena for scientific literature tasks. The new rankings: o3 holds #1, Gemini 3 Pro Preview lands at #2, Claude Opus 4.1 sits at #3, GPT-5 at #4, & GPT-5.1 debuts at #5. 🧵
December 1, 2025 at 8:24 PM
Olmo 3 is now available through @hf.co Inference Providers, thanks to Public AI! 🎉
This means you can run our fully open 7B and 32B models — including Think and Instruct variants — via serverless API with no infrastructure to manage.
November 28, 2025 at 4:50 PM
We recently released OlmoEarth—an open spatio-temporal foundation model for planetary intelligence, trained on multimodal Earth observation, OpenStreetMap, & other open data. It sets a new accuracy/efficiency Pareto frontier and powers our OlmoEarth Platform. Technical report now on arXiv. 👇
November 26, 2025 at 7:23 PM
⚠️ Update on Deep Research Tulu (DR Tulu), our post-training recipe for deep research agents: we’re releasing an upgraded version of our example agent, DR Tulu-8B (RL), that matches or beats systems like Gemini 3 Pro & Tongyi DeepResearch-30B-A3B on core benchmarks. 🧵
November 25, 2025 at 7:37 PM
Our Olmo 3 models are now available via API on
@openrouter.bsky.social. Try Olmo 3-Instruct (7B) for chat & tool use, and our reasoning models Olmo-3 Think (7B & 32B) for more complex problems.
November 22, 2025 at 1:58 AM
Missed our Olmo 3 livestream? Check it out on YouTube! The Olmo team recaps what makes Olmo 3 our most exciting LM release yet, perhaps most importantly our fully open model flow of checkpoints, training data & recipes, and more. Watch 📷 www.youtube.com/watch?v=QUFK...
Olmo 3 | Livestream with Hugging Face
YouTube video by Ai2
www.youtube.com
November 20, 2025 at 10:03 PM
Announcing Olmo 3, a leading fully open LM suite built for reasoning, chat, & tool use, and an open model flow—not just the final weights, but the entire training journey.
Best fully open 32B reasoning model & best 32B base model. 🧵
November 20, 2025 at 2:37 PM
Today we’re releasing Deep Research Tulu (DR Tulu)—the first fully open, end-to-end recipe for long-form deep research, plus an 8B agent you can use right away. Train agents that plan, search, synthesize, & cite across sources, making expert research more accessible. 🧭📚
November 18, 2025 at 3:31 PM
Introducing OlmoEarth 🌍, state-of-the-art AI foundation models paired with ready-to-use open infrastructure to turn Earth data into clear, up-to-date insights within hours—not years.
November 4, 2025 at 2:52 PM
Our Olmo Discord AMA just wrapped! Researchers answered community questions about their work with Olmo, our family of fully open LLMs. Here’s some highlights. 🧵
October 28, 2025 at 6:56 PM