Oumi
banner
oumi-pbc.bsky.social
Oumi
@oumi-pbc.bsky.social
Let’s build better AI - open is the path forward
🚀 Custom Evaluations Made Easy with Oumi 🚀

Build custom evaluations for any model in <50 lines of code! 🙌

✅ Simple config change to evaluate models like GPT-4o, Claude 3.7, LLaMA 405B
✅ Learn how we evaluated SOTA LLMs as hallucination classifiers🧠✨

👉 Check it out: oumi.ai/blog/posts/c...
Oumi - Build Custom Evaluations for any Open or Closed Model in just 50 Lines of Code
Use Oumi to build custom evaluations for any open or closed model, in just 50 lines of code
oumi.ai
March 25, 2025 at 9:56 PM
Looking for simple, robust model evaluations? Look no further!

📊Standardize benchmarks
🗣️Generative evals
🔧Customizable evals

Oumi has them all!

Today in our Weekly Walkthrough, @taenin.bsky.social will give you an overview of Oumi’s evaluation framework.

🎥 Watch now: youtu.be/GhHmtjMw-l4
Evaluate Models with Oumi [walkthrough]
YouTube video by Oumi
youtu.be
March 19, 2025 at 5:23 PM
Need to label a dataset? Looking to run inference at scale? You’re in luck! 🍀

@taenin.bsky.social tackles Batch Prediction in this edition of our Weekly Walkthrough, showing how you can quickly run inference over your data within the Oumi platform!

🎥 Watch now: youtu.be/3Yg3ycxCEYQ
Let's run batch prediction with Oumi [walkthrough]
YouTube video by Oumi
youtu.be
March 12, 2025 at 4:14 PM
This week we’re headed to the train station, Platform 135M+! 🚂

In today’s Weekly Walkthrough, @taenin.bsky.social guides us through training with Oumi, showing us how to customize training for your needs (including functionality like PEFT and FSDP). 🙌

📹 Link to watch in thread!
March 5, 2025 at 9:06 PM
Apparently a lot of you were dreaming about DeepSeek's R1 model 👀

@jgreer-oumi.bsky.social's MiniMath-R1-1.5B has over 750 downloads since we released it last week!

Have you used it to train your own DeepSeek R1 model yet?
Dreaming about DeepSeek R1 this month? ☁️

We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.

With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!

Data, model, and notebook in thread! 🙌
March 4, 2025 at 5:39 PM
On Fridays, we meme.

Credit: @jgreer-oumi.bsky.social
February 28, 2025 at 11:14 PM
Today we’re kicking off the first of our Weekly Walkthroughs with @taenin.bsky.social 🙌

Each week, Matthew will walk you through how you can use Oumi to make the most of your machine learning workflows ⚙️

In today’s video, Matthew covers model inference in Oumi 🚀

🎥 Link to watch in thread!
February 26, 2025 at 5:29 PM
Dreaming about DeepSeek R1 this month? ☁️

We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.

With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!

Data, model, and notebook in thread! 🙌
February 25, 2025 at 5:01 PM
What is Oumi? 🤔

We’ve talked a lot about why we built Oumi. Now, let’s dive into what it actually does.

In this video, @taenin.bsky.social walks through the core functionality of Oumi, how it works, and why it matters for AI researchers and developers.

🎥 Link to watch in thread!
February 21, 2025 at 6:40 PM
Last week, Emre Can Acikgoz (PhD at UIUC) released CoALM 🚀 fully open-source Conversational Agentic Language Models trained using Oumi 🙌

Check out their work!👇

📄 Arxiv: arxiv.org/abs/2502.08820
🌐 Project: emrecanacikgoz.github.io/CoALM/
Can a Single Model Master Both Multi-turn Conversations and Tool Use? CoALM: A Unified Conversational Agentic Language Model
Large Language Models (LLMs) with API-calling capabilities enabled building effective Language Agents (LA), while also revolutionizing the conventional task-oriented dialogue (TOD) paradigm. However, ...
arxiv.org
February 20, 2025 at 10:38 PM
Do you have questions for the Oumi team on #OpenSource, #DeepSeek, & #FrontierAI?💡

Join us tomorrow, Feb 14 9am-12pm PT for a
Reddit AMA on r/MachineLearning & bring your best questions for our founders on all things open source!

www.reddit.com/r/MachineLea...
From the MachineLearning community on Reddit: [D] We built GenAI at Google and Apple, then left to build an open source AI lab, to enable the open community to collaborate and build the next DeepSeek....
Posted by koukoumidis - 5 votes and 0 comments
www.reddit.com
February 14, 2025 at 12:11 AM
Oumi is trending on GitHub 🤯

We’re blown away by the response we’ve received since launching last week.

"The community is ready for 100% truly open AI. Together, we can build the open-source AI the world needs." @koukoumidis.bsky.social

Join us: lnkd.in/gAw3BXGW

#OpenAI #Collaboration #Oumi
February 3, 2025 at 6:27 PM
"Global AI collaboration" 👏

Check out Oumi in today's Daily Rede tech & AI newsletter: Oumi debuts 'open' AI 🌐, DeepSeek under scope 🕵️‍♂️ Read more: rede.io/newsletters/...
Oumi debuts 'open' AI 🌐, DeepSeek under scope 🕵️‍♂️
Oumi strives to be AI's 'Linux moment' with its open-source ambition. Oumi is an open-source AI platform backed by top universities, aiming to support the collaborative development of AI models. Oumi'...
rede.io
January 31, 2025 at 9:06 PM
🚀
January 30, 2025 at 6:34 PM
Introducing Oumi 🚀 a community of researchers and developers united in their mission to make frontier AI more open, collaborative, and accessible. Join us in building the platform, models, and tools: let’s shape the future of AI. #oumi #opensource #collaboration

github.com/oumi-ai/oumi
January 29, 2025 at 4:22 PM