MLflow
banner
mlflow.org
MLflow
@mlflow.org
An open source machine learning platform for managing the complete ML lifecycle
Missed last week’s #MLflow Community Meetup? Check out Ben Wilson on agentic judges: “The judge no longer works as an LLM as a judge—it actually works as an agent as a judge.”

🎥 Full video: www.youtube.com/live/bkMabn8...

#opensource #oss #agenticjudges
October 15, 2025 at 2:14 PM
⚡ In this lightning talk at MLOps World, Danny Chiao tackled a top agent challenge: ensuring high quality output.

Rather than labeling and analyzing traces by hand, MLflow makes it easy to log, evaluate, and iterate faster—using techniques leading companies rely on to deploy agents in production. ✅
October 9, 2025 at 7:31 PM
🚨 Reminder: MLflow Community Meetup is tomorrow, Oct 8 at 4:00 PM PT!

We'll explore trace‑aware, feedback‑aligned judges and versioned eval datasets in MLflow. You don't wait to miss it!

🎥 LIVE on LinkedIn, YouTube & X
🔗 RSVP: luma.com/mlflow-1001

#opensource #oss #mlflow
October 7, 2025 at 4:42 PM
Building better LLM evals? Ben Wilson highlights how frameworks like #DSPy boost judge prompts—and reliability—as models evolve.

Tips for judge reproducibility/reliability: use reproducible pipelines, re-tune logic as endpoints change, & standardize. ✅

🎥 Watch more: www.youtube.com/live/HTxpmnO...
October 6, 2025 at 5:59 PM
🚀 Headed to MLOps World | GenAI Summit 2025 next week? Don’t miss an exciting lightning talk from Danny Chiao, Engineering Lead at Databricks!

🎤 𝗧𝗲𝗰𝗵𝗻𝗶𝗾𝘂𝗲𝘀 𝘁𝗼 𝗯𝘂𝗶𝗹𝗱 𝗵𝗶𝗴𝗵 𝗾𝘂𝗮𝗹𝗶𝘁𝘆 𝗮𝗴𝗲𝗻𝘁𝘀 𝗳𝗮𝘀𝘁𝗲𝗿 𝘄𝗶𝘁𝗵 𝗠𝗟𝗳𝗹𝗼𝘄

🗓️ October 9
📍 Austin, TX
🔗 Learn more: mlopsworld.com#agenda

#MLflow #GenAI #MLOps #LLM
October 1, 2025 at 9:17 PM
🚀 The fifth “Invoice Extraction with OpenAI + MLflow” session is now available! #MLflow Ambassador Shrinath Suresh dives into designing a custom scorer to evaluate invoice extraction models beyond just labels or LLM-as-a-judge.

🎥 youtu.be/SmuhOmOYXSg?...
📖 medium.com/@shrinath.su...

#opensource
September 30, 2025 at 8:05 PM
This blog highlights how MLflow’s #GenAI capabilities streamline development of an LLM-based Optical Character Recognition (OCR) tool. These capabilities reduce friction, accelerate workflows, and deliver value to both technical and non-technical contributors.

🚀 Dive in: mlflow.org/blog/mlflow-...
September 15, 2025 at 8:05 PM
This blog looks at the “Coffee Machine” approach: global teams set up standardized #ML pipelines, & local teams adapt them using their own #data. ☕

#MLflow supports every step, making it possible to track changes, register model variants, & maintain reproducibility.

🔗 medium.com/dscier/brewi...
September 11, 2025 at 3:35 PM
📣 Happening Tomorrow — MLflow Office Hours!

Join #MLflow maintainers for a live Q&A session! Whether you’re running MLflow in production or experimenting with LLMs & GenAI, this is your chance to bring real challenges and get direct feedback.

🕒 Sept 10 @ 3PM SGT
🎟 RSVP: lu.ma/mlflow-910

#oss
September 9, 2025 at 8:34 PM
Evaluating AI agents is tricky—they make multi-step decisions, use tools, and follow reasoning chains. To measure them, you need to assess the workflow, not just the final answer. That’s where MLflow’s new agent evaluation framework steps in.

🔗 Docs: mlflow.org/docs/latest/...

#MLflow #AI #MLOps
September 5, 2025 at 3:05 PM
📣 MLflow 3.3.0 is now available!

This release introduces several major features and improvements:
🔹 𝗠𝗼𝗱𝗲𝗹 𝗥𝗲𝗴𝗶𝘀𝘁𝗿𝘆 𝗪𝗲𝗯𝗵𝗼𝗼𝗸𝘀
🔹 𝗔𝗴𝗻𝗼 𝗧𝗿𝗮𝗰𝗶𝗻𝗴 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗶𝗼𝗻
🔹 𝗚𝗲𝗻𝗔𝗜 𝗘𝘃𝗮𝗹𝘂𝗮𝘁𝗶𝗼𝗻 𝗶𝗻 𝗢𝗦𝗦
🔹 𝗥𝗲𝘃𝗮𝗺𝗽𝗲𝗱 𝗧𝗿𝗮𝗰𝗲 𝗧𝗮𝗯𝗹𝗲 𝗩𝗶𝗲𝘄
🔹 𝗙𝗮𝘀𝘁𝗔𝗣𝗜 + 𝗨𝘃𝗶𝗰𝗼𝗿𝗻 𝗦𝗲𝗿𝘃𝗲𝗿

🔗 Check out the release notes: github.com/mlflow/mlflo...

#oss
August 22, 2025 at 1:17 PM
Join us for the next MLflow Community Meetup! 🥳

This month, we’re excited to bring you the latest updates and insights from the MLflow ecosystem, featuring:
​🔔 Webhooks in MLflow
🧪 Managed MLflow Features Now in OSS

📅 August 20 @ 4:00 PM PT
📍 Live on X, YouTube, & LinkedIn

🔗 RSVP: lu.ma/mlflow820
August 7, 2025 at 6:59 PM
MLflow 3.2 is here! 🚀

Key updates include:
🔹 𝗧𝘆𝗽𝗲𝗦𝗰𝗿𝗶𝗽𝘁 𝗦𝗗𝗞 𝗳𝗼𝗿 𝗧𝗿𝗮𝗰𝗶𝗻𝗴
🔹 𝗦𝗲𝗺𝗮𝗻𝘁𝗶𝗰 𝗞𝗲𝗿𝗻𝗲𝗹 𝗧𝗿𝗮𝗰𝗶𝗻𝗴
🔹 𝗜𝗻𝘁𝗲𝗴𝗿𝗮𝘁𝗲𝗱 𝗙𝗲𝗲𝗱𝗯𝗮𝗰𝗸 𝗟𝗼𝗴𝗴𝗶𝗻𝗴
🔹 𝗘𝘅𝗽𝗲𝗿𝗶𝗺𝗲𝗻𝘁 𝗨𝗜 𝗥𝗲𝗳𝗿𝗲𝘀𝗵
🔹 𝗧𝗿𝗮𝗰𝗲 𝗨𝗜 𝗘𝗻𝗵𝗮𝗻𝗰𝗲𝗺𝗲𝗻𝘁𝘀
🔹 𝗣𝗜𝗜 𝗠𝗮𝘀𝗸𝗶𝗻𝗴 𝗶𝗻 𝗧𝗿𝗮𝗰𝗶𝗻𝗴
🔹 𝗣𝗼𝗹𝗮𝗿𝘀 𝗗𝗮𝘁𝗮𝗙𝗿𝗮𝗺𝗲 𝗟𝗼𝗴𝗴𝗶𝗻𝗴

🔗 Check out the full release notes → github.com/mlflow/mlflo...

#oss
August 5, 2025 at 5:15 PM
Have #MLflow questions? Join us this Wed, August 6 for MLflow Office Hours! 🚀

What to Expect:
✅ Live troubleshooting of your MLflow implementation questions
✅ Expert insights on model lifecycle management & GenAI integrations
✅ Sneak peeks at new & upcoming features

🔗 Register: lu.ma/mlflow8625
August 4, 2025 at 2:30 PM
📣 Join us next Wed, August 6 for #MLflow Office Hours — your live Q&A with core MLflow maintainers & contributors!

This is your opportunity to get expert, hands-on technical feedback directly from the team! 🤝 Open to all levels — bring your questions & join the discussion.

🔗 RSVP: lu.ma/mlflow8625
July 29, 2025 at 2:45 PM
"When we update any part of a system, we need to make sure we’re not introducing regressions — and ideally, that we’re improving quality. One way to address this is through evaluation-driven development." - Yuki Watanabe, MLflow Maintainer 💬

🔗 Watch the full webinar: www.youtube.com/watch?v=-jzm...
July 18, 2025 at 2:58 PM
Model evaluation is the foundation of reliable machine learning. With #MLflow’s evaluation framework, you get more than just accuracy—you get deep insights into model behavior, automated testing, rich visualizations, and validation pipelines.

Get started ➡️ mlflow.org/docs/latest/...

#opensource
July 17, 2025 at 2:27 PM
Happening TODAY! 🚨 Join Agents in Production 2025 for a can’t-miss session on building evaluation-driven #Agentic systems with #MLflow 3.0.

🔎 Yuki Watanabe will break down:
✅ One-line observability
✅ Automatic evaluation
✅ Human-in-the-loop feedback

🔗 Register: home.mlops.community/home/events/...
July 17, 2025 at 12:36 PM
🎥 “People ask for it. It’s open-source. It works for both classical ML and GenAI. And it’s easy to start.”

Mikhail Rozhkov from Nebius shares why their team chose #MLflow as their experiment tracking and observability platform.

👀 Watch the full video: www.youtube.com/live/4Ok-03P...

#opensource
July 16, 2025 at 2:39 PM
🚨 Happening TOMORROW, July 17 — Agents in Production 2025 is almost here!

Get an inside look at MLflow 3.0’s powerful new features:
✅ One-line observability
✅ Automatic evaluation
✅ Human-in-the-loop feedback

Free + virtual! 🎟️ Register here: home.mlops.community/home/events/...

#opensource #oss
July 16, 2025 at 1:33 PM
MLflow will soon support TypeScript natively! 🙌

You'll be able to:
✅ Track, monitor, and evaluate projects in Python, TypeScript, or JavaScript—all in one place
✅ Unify workflows across data science & full-stack development

👀 Get an exclusive preview: www.youtube.com/watch?v=-jzm...

#oss #mlflow
July 15, 2025 at 8:16 PM
At what stage or scale of an ML project would you recommend moving from self-hosted MLflow to SageMaker managed MLflow?

Watch the full clip for the answer! ⤵️

#sagemaker #mlflow #opensource #oss #ML
July 15, 2025 at 3:51 PM
Are your AI responses off-topic?

MLflow answer relevance gives instant 1-5 scores showing which responses miss the user's actual question. No more manual review.

📕 View the full article: buff.ly/mT4mlmZ

Credit: Khuyen Tran, Founder of CodeCut
July 15, 2025 at 2:13 PM
📣 Agents in Production 2025 is this Thursday, July 17!
Join Yuki Watanabe (Databricks) for “Driving Evaluation-Driven Development with MLflow 3.0.”

✅ Explore new #MLflow features & agentic app strategies.

Free & virtual! Register 👉 home.mlops.community/home/events/...

#agenticai #opensource
July 14, 2025 at 1:45 PM
Don’t miss Yuki Watanabe’s talk at Agents in Production 2025! See how #MLflow 3.0 enables evaluation-driven Agentic app development and helps build more reliable AI systems.

🗓️ Thursday, July 17
💻 Free & virtual

🔗 Register: home.mlops.community/home/events/agentsinproduction2025

#opensource #oss
July 9, 2025 at 2:37 PM