Lightnews — Scholar-powered news

Oumi

@oumi-pbc.bsky.social

🚀 Custom Evaluations Made Easy with Oumi 🚀

Build custom evaluations for any model in <50 lines of code! 🙌

✅ Simple config change to evaluate models like GPT-4o, Claude 3.7, LLaMA 405B
✅ Learn how we evaluated SOTA LLMs as hallucination classifiers🧠✨

👉 Check it out: oumi.ai/blog/posts/c...

Oumi - Build Custom Evaluations for any Open or Closed Model in just 50 Lines of Code

Use Oumi to build custom evaluations for any open or closed model, in just 50 lines of code

oumi.ai

March 25, 2025 at 9:56 PM

Oumi

@oumi-pbc.bsky.social

Looking for simple, robust model evaluations? Look no further!

📊Standardize benchmarks
🗣️Generative evals
🔧Customizable evals

Oumi has them all!

Today in our Weekly Walkthrough, @taenin.bsky.social will give you an overview of Oumi’s evaluation framework.

🎥 Watch now: youtu.be/GhHmtjMw-l4

Evaluate Models with Oumi [walkthrough]

YouTube video by Oumi

youtu.be

March 19, 2025 at 5:23 PM

Oumi

@oumi-pbc.bsky.social

Need to label a dataset? Looking to run inference at scale? You’re in luck! 🍀

@taenin.bsky.social tackles Batch Prediction in this edition of our Weekly Walkthrough, showing how you can quickly run inference over your data within the Oumi platform!

🎥 Watch now: youtu.be/3Yg3ycxCEYQ

Let's run batch prediction with Oumi [walkthrough]

YouTube video by Oumi

youtu.be

March 12, 2025 at 4:14 PM

Oumi

@oumi-pbc.bsky.social

This week we’re headed to the train station, Platform 135M+! 🚂

In today’s Weekly Walkthrough, @taenin.bsky.social guides us through training with Oumi, showing us how to customize training for your needs (including functionality like PEFT and FSDP). 🙌

📹 Link to watch in thread!

March 5, 2025 at 9:06 PM

Oumi

@oumi-pbc.bsky.social

Apparently a lot of you were dreaming about DeepSeek's R1 model 👀

@jgreer-oumi.bsky.social's MiniMath-R1-1.5B has over 750 downloads since we released it last week!

Have you used it to train your own DeepSeek R1 model yet?

Oumi @oumi-pbc.bsky.social · Feb 25

Dreaming about DeepSeek R1 this month? ☁️

We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.

With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!

Data, model, and notebook in thread! 🙌

March 4, 2025 at 5:39 PM

Oumi

@oumi-pbc.bsky.social

On Fridays, we meme.

Credit: @jgreer-oumi.bsky.social

February 28, 2025 at 11:14 PM

Oumi

@oumi-pbc.bsky.social

Today we’re kicking off the first of our Weekly Walkthroughs with @taenin.bsky.social 🙌

Each week, Matthew will walk you through how you can use Oumi to make the most of your machine learning workflows ⚙️

In today’s video, Matthew covers model inference in Oumi 🚀

🎥 Link to watch in thread!

February 26, 2025 at 5:29 PM

Oumi

@oumi-pbc.bsky.social

Dreaming about DeepSeek R1 this month? ☁️

We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.

With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!

Data, model, and notebook in thread! 🙌

February 25, 2025 at 5:01 PM

Oumi

@oumi-pbc.bsky.social

What is Oumi? 🤔

We’ve talked a lot about why we built Oumi. Now, let’s dive into what it actually does.

In this video, @taenin.bsky.social walks through the core functionality of Oumi, how it works, and why it matters for AI researchers and developers.

🎥 Link to watch in thread!

February 21, 2025 at 6:40 PM

Oumi

@oumi-pbc.bsky.social

Last week, Emre Can Acikgoz (PhD at UIUC) released CoALM 🚀 fully open-source Conversational Agentic Language Models trained using Oumi 🙌

Check out their work!👇

📄 Arxiv: arxiv.org/abs/2502.08820
🌐 Project: emrecanacikgoz.github.io/CoALM/

Can a Single Model Master Both Multi-turn Conversations and Tool Use? CoALM: A Unified Conversational Agentic Language Model

Large Language Models (LLMs) with API-calling capabilities enabled building effective Language Agents (LA), while also revolutionizing the conventional task-oriented dialogue (TOD) paradigm. However, ...

arxiv.org

February 20, 2025 at 10:38 PM

Oumi

@oumi-pbc.bsky.social

Do you have questions for the Oumi team on #OpenSource, #DeepSeek, & #FrontierAI?💡

Join us tomorrow, Feb 14 9am-12pm PT for a
Reddit AMA on r/MachineLearning & bring your best questions for our founders on all things open source!

www.reddit.com/r/MachineLea...

From the MachineLearning community on Reddit: [D] We built GenAI at Google and Apple, then left to build an open source AI lab, to enable the open community to collaborate and build the next DeepSeek....

Posted by koukoumidis - 5 votes and 0 comments

www.reddit.com

February 14, 2025 at 12:11 AM

Oumi

@oumi-pbc.bsky.social

Oumi is trending on GitHub 🤯

We’re blown away by the response we’ve received since launching last week.

"The community is ready for 100% truly open AI. Together, we can build the open-source AI the world needs." @koukoumidis.bsky.social

Join us: lnkd.in/gAw3BXGW

#OpenAI #Collaboration #Oumi

February 3, 2025 at 6:27 PM

Oumi

@oumi-pbc.bsky.social

"Global AI collaboration" 👏

Check out Oumi in today's Daily Rede tech & AI newsletter: Oumi debuts 'open' AI 🌐, DeepSeek under scope 🕵️‍♂️ Read more: rede.io/newsletters/...

Oumi debuts 'open' AI 🌐, DeepSeek under scope 🕵️‍♂️

Oumi strives to be AI's 'Linux moment' with its open-source ambition. Oumi is an open-source AI platform backed by top universities, aiming to support the collaborative development of AI models. Oumi'...

rede.io

January 31, 2025 at 9:06 PM

Oumi

@oumi-pbc.bsky.social

🚀

January 30, 2025 at 6:34 PM

Oumi

@oumi-pbc.bsky.social

Introducing Oumi 🚀 a community of researchers and developers united in their mission to make frontier AI more open, collaborative, and accessible. Join us in building the platform, models, and tools: let’s shape the future of AI. #oumi #opensource #collaboration

github.com/oumi-ai/oumi

January 29, 2025 at 4:22 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news