Build custom evaluations for any model in <50 lines of code! 🙌
✅ Simple config change to evaluate models like GPT-4o, Claude 3.7, LLaMA 405B
✅ Learn how we evaluated SOTA LLMs as hallucination classifiers🧠✨
👉 Check it out: oumi.ai/blog/posts/c...
Build custom evaluations for any model in <50 lines of code! 🙌
✅ Simple config change to evaluate models like GPT-4o, Claude 3.7, LLaMA 405B
✅ Learn how we evaluated SOTA LLMs as hallucination classifiers🧠✨
👉 Check it out: oumi.ai/blog/posts/c...
📊Standardize benchmarks
🗣️Generative evals
🔧Customizable evals
Oumi has them all!
Today in our Weekly Walkthrough, @taenin.bsky.social will give you an overview of Oumi’s evaluation framework.
🎥 Watch now: youtu.be/GhHmtjMw-l4
📊Standardize benchmarks
🗣️Generative evals
🔧Customizable evals
Oumi has them all!
Today in our Weekly Walkthrough, @taenin.bsky.social will give you an overview of Oumi’s evaluation framework.
🎥 Watch now: youtu.be/GhHmtjMw-l4
@taenin.bsky.social tackles Batch Prediction in this edition of our Weekly Walkthrough, showing how you can quickly run inference over your data within the Oumi platform!
🎥 Watch now: youtu.be/3Yg3ycxCEYQ
@taenin.bsky.social tackles Batch Prediction in this edition of our Weekly Walkthrough, showing how you can quickly run inference over your data within the Oumi platform!
🎥 Watch now: youtu.be/3Yg3ycxCEYQ
In today’s Weekly Walkthrough, @taenin.bsky.social guides us through training with Oumi, showing us how to customize training for your needs (including functionality like PEFT and FSDP). 🙌
📹 Link to watch in thread!
In today’s Weekly Walkthrough, @taenin.bsky.social guides us through training with Oumi, showing us how to customize training for your needs (including functionality like PEFT and FSDP). 🙌
📹 Link to watch in thread!
@jgreer-oumi.bsky.social's MiniMath-R1-1.5B has over 750 downloads since we released it last week!
Have you used it to train your own DeepSeek R1 model yet?
We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.
With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!
Data, model, and notebook in thread! 🙌
@jgreer-oumi.bsky.social's MiniMath-R1-1.5B has over 750 downloads since we released it last week!
Have you used it to train your own DeepSeek R1 model yet?
Each week, Matthew will walk you through how you can use Oumi to make the most of your machine learning workflows ⚙️
In today’s video, Matthew covers model inference in Oumi 🚀
🎥 Link to watch in thread!
Each week, Matthew will walk you through how you can use Oumi to make the most of your machine learning workflows ⚙️
In today’s video, Matthew covers model inference in Oumi 🚀
🎥 Link to watch in thread!
We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.
With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!
Data, model, and notebook in thread! 🙌
We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.
With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!
Data, model, and notebook in thread! 🙌
We’ve talked a lot about why we built Oumi. Now, let’s dive into what it actually does.
In this video, @taenin.bsky.social walks through the core functionality of Oumi, how it works, and why it matters for AI researchers and developers.
🎥 Link to watch in thread!
We’ve talked a lot about why we built Oumi. Now, let’s dive into what it actually does.
In this video, @taenin.bsky.social walks through the core functionality of Oumi, how it works, and why it matters for AI researchers and developers.
🎥 Link to watch in thread!
Check out their work!👇
📄 Arxiv: arxiv.org/abs/2502.08820
🌐 Project: emrecanacikgoz.github.io/CoALM/
Check out their work!👇
📄 Arxiv: arxiv.org/abs/2502.08820
🌐 Project: emrecanacikgoz.github.io/CoALM/
Join us tomorrow, Feb 14 9am-12pm PT for a
Reddit AMA on r/MachineLearning & bring your best questions for our founders on all things open source!
www.reddit.com/r/MachineLea...
Join us tomorrow, Feb 14 9am-12pm PT for a
Reddit AMA on r/MachineLearning & bring your best questions for our founders on all things open source!
www.reddit.com/r/MachineLea...
We’re blown away by the response we’ve received since launching last week.
"The community is ready for 100% truly open AI. Together, we can build the open-source AI the world needs." @koukoumidis.bsky.social
Join us: lnkd.in/gAw3BXGW
#OpenAI #Collaboration #Oumi
We’re blown away by the response we’ve received since launching last week.
"The community is ready for 100% truly open AI. Together, we can build the open-source AI the world needs." @koukoumidis.bsky.social
Join us: lnkd.in/gAw3BXGW
#OpenAI #Collaboration #Oumi
Check out Oumi in today's Daily Rede tech & AI newsletter: Oumi debuts 'open' AI 🌐, DeepSeek under scope 🕵️♂️ Read more: rede.io/newsletters/...
Check out Oumi in today's Daily Rede tech & AI newsletter: Oumi debuts 'open' AI 🌐, DeepSeek under scope 🕵️♂️ Read more: rede.io/newsletters/...
github.com/oumi-ai/oumi
github.com/oumi-ai/oumi