🌟open-data 🌟
🤖 open-models 🤖
💻 open-source 💻
💪anyone can compete for free 💪
dcvlr-neurips.github.io
🧵 1 / n
🌟open-data 🌟
🤖 open-models 🤖
💻 open-source 💻
💪anyone can compete for free 💪
dcvlr-neurips.github.io
🧵 1 / n
Build custom evaluations for any model in <50 lines of code! 🙌
✅ Simple config change to evaluate models like GPT-4o, Claude 3.7, LLaMA 405B
✅ Learn how we evaluated SOTA LLMs as hallucination classifiers🧠✨
👉 Check it out: oumi.ai/blog/posts/c...
Build custom evaluations for any model in <50 lines of code! 🙌
✅ Simple config change to evaluate models like GPT-4o, Claude 3.7, LLaMA 405B
✅ Learn how we evaluated SOTA LLMs as hallucination classifiers🧠✨
👉 Check it out: oumi.ai/blog/posts/c...
📊Standardize benchmarks
🗣️Generative evals
🔧Customizable evals
Oumi has them all!
Today in our Weekly Walkthrough, @taenin.bsky.social will give you an overview of Oumi’s evaluation framework.
🎥 Watch now: youtu.be/GhHmtjMw-l4
📊Standardize benchmarks
🗣️Generative evals
🔧Customizable evals
Oumi has them all!
Today in our Weekly Walkthrough, @taenin.bsky.social will give you an overview of Oumi’s evaluation framework.
🎥 Watch now: youtu.be/GhHmtjMw-l4
@taenin.bsky.social tackles Batch Prediction in this edition of our Weekly Walkthrough, showing how you can quickly run inference over your data within the Oumi platform!
🎥 Watch now: youtu.be/3Yg3ycxCEYQ
@taenin.bsky.social tackles Batch Prediction in this edition of our Weekly Walkthrough, showing how you can quickly run inference over your data within the Oumi platform!
🎥 Watch now: youtu.be/3Yg3ycxCEYQ
Also whoever decided to make Phi have 10 KV heads instead of a multiple of 4 or 8, why do you hate multiple GPUs 😭
Also whoever decided to make Phi have 10 KV heads instead of a multiple of 4 or 8, why do you hate multiple GPUs 😭
@jgreer-oumi.bsky.social's MiniMath-R1-1.5B has over 750 downloads since we released it last week!
Have you used it to train your own DeepSeek R1 model yet?
We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.
With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!
Data, model, and notebook in thread! 🙌
@jgreer-oumi.bsky.social's MiniMath-R1-1.5B has over 750 downloads since we released it last week!
Have you used it to train your own DeepSeek R1 model yet?
We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.
With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!
Data, model, and notebook in thread! 🙌
We collected 650,000+ R1 prompts and responses for you to train your own DeepSeek-R1 model.
With this data we made MiniMath-R1-1.5B, the top MMLU-Pro-Math model at <=1.5B parameters, achieving an accuracy of 44.4%!
Data, model, and notebook in thread! 🙌
The community is ready for 100% truly open AI. Let’s keep the momentum going. Together, we can build the open-source AI the world truly needs. 🌍 🚀
The community is ready for 100% truly open AI. Let’s keep the momentum going. Together, we can build the open-source AI the world truly needs. 🌍 🚀
#oumi #opensource #collaboration
#oumi #opensource #collaboration