Lightnews — Scholar-powered news

Reposted by Omar Khattab

@bexelbie.com

Drew’s post is well worth reading as DSPy seems to be a missing link in thinking about LLM usage. Very readable and interesting. www.dbreunig.com/2025/06/10/l...

Thank you @simonwillison.net

October 6, 2025 at 11:42 AM

Reposted by Omar Khattab

Simon Willison

@simonwillison.net

If you've been trying to figure out DSPy - the automatic prompt optimization system - this talk by @dbreunig.bsky.social is the clearest explanation I've seen yet, with a very useful real-world case study www.youtube.com/watch?v=I9Zt...

My notes here: simonwillison.net/2025/Oct/4/d...

Let the LLM Write the Prompts: An Intro to DSPy in Compound AI Pipelines

YouTube video by Databricks

www.youtube.com

October 4, 2025 at 11:05 PM

Reposted by Omar Khattab

PyData Boston

@pydatabos.bsky.social

#pydatabos interesting! How the Arbor library works under the hood hand in hand with DSPy

October 15, 2025 at 11:42 PM

Omar Khattab

@lateinteraction.bsky.social

premature optimization is the sqrt of all evil

PyData Boston @pydatabos.bsky.social · Oct 15

#pydatabos one line motivation for using DSPy!

October 29, 2025 at 4:26 PM

Reposted by Omar Khattab

PyData Boston

@pydatabos.bsky.social

#pydatabos one line motivation for using DSPy!

October 15, 2025 at 11:56 PM

Reposted by Omar Khattab

joelniklaus.bsky.social

@joelniklaus.bsky.social

Stop what you are doing and try out GEPA now!

"GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning" presents such elegant ideas by a collection of amazing researchers!

Here is a tldr of how it works:

October 21, 2025 at 3:03 PM

Reposted by Omar Khattab

Tim Kellogg

@timkellogg.me

colbert-muvera-micro a 4M(!!) late interaction model

late interaction models do embedding vector index queries and reranking at the same time leading to far higher accuracy

huggingface.co/NeuML/colber...

A diagram illustrating a dual-encoder retrieval model using MaxSim scoring.
• On the left (green box): labeled “Query Encoder, f_Q”. It takes a Query as input and produces multiple vector embeddings (rectangles).
• On the right (blue box): labeled “Document Encoder, f_D”. It takes a Document as input and produces multiple vector embeddings (rectangles). This block is marked with “Offline Indexing” along the side, showing that documents are pre-encoded.
• Between the two encoders: dotted and solid arrows connect query embeddings to document embeddings, representing similarity comparisons.
• Each comparison goes through a “MaxSim” operation (highlighted boxes), which selects the maximum similarity for each query token across document tokens.
• At the top: outputs of MaxSim flow into a summation node (Σ) to produce a single score for ranking.

This shows the ColBERT (Contextualized Late Interaction) retrieval framework: query and document are encoded separately, interactions are computed via maximum similarity per query token, and results are aggregated into a score.

September 19, 2025 at 11:15 AM

Reposted by Omar Khattab

Latitude77

@latitude77.bsky.social

Let the Model Write the Prompt | Drew Breunig #dspy #promptengineering #llms #generativeai

June 19, 2025 at 3:54 PM

Reposted by Omar Khattab

Drew Breunig

@dbreunig.bsky.social

Here's the write up of my Data+AI Summit talk on the perils of prompts in code and how to mitigate them with DSPy. www.dbreunig.com/2025/06/10/l...

Let the Model Write the Prompt

Notes from a talk I delivered at the 2025 Data + AI Summit, detailing the problem with prompts in your code and how DSPy can make everything better.

www.dbreunig.com

June 15, 2025 at 4:57 PM

Reposted by Omar Khattab

MLflow

@mlflow.org

Have you heard the news? #MLflow now supports tracking for DSPy optimization workflows—just like it does for #PyTorch training!

Keep reading to see what this means for your #LLM projects… 👇

#opensource #dspy #oss

May 30, 2025 at 3:08 PM

Reposted by Omar Khattab

MLflow

@mlflow.org

📣 TODAY at 4PM PT - MLflow Community Meetup!

🔗 Register today 👉 lu.ma/mlflow423

Join the global MLflow community for two exciting tech deep dives:
🔹 MLflow + #DSPy Integration
🔹 Cleanlab + #MLflow

🎥 Streaming live on YouTube, LinkedIn, and X
💬 Live Q&A with the presenters

#opensource #oss

MLflow Community Meetup | April 23 · Luma

Join us for the next MLflow Community Meetup — Wednesday, April 23 at 4PM PT! We’re bringing two exciting presentations to the community: 🔹 MLflow + DSPy…

lu.ma

April 23, 2025 at 7:21 PM

Reposted by Omar Khattab

MLflow

@mlflow.org

MLflow now supports tracking for #DSPy (Community) optimization — just like it does for @pytorch.org training! 🙌

#MLflow is the first to bring full visibility into DSPy’s prompt optimization process. More observability, less guesswork.

Get started today! ➡️ medium.com/@AI-on-Datab...

#opensource

April 21, 2025 at 7:20 PM

Reposted by Omar Khattab

MLflow

@mlflow.org

Join us for the next MLflow Community Meetup — Wednesday, April 23 at 4PM PT! 🗓️

🔹 Explore the new MLflow + #DSPy integration
🔹 Learn how Cleanlab adds trust to AI workflows with MLflow

💬 Live Q&A + demos
📺 Streamed on YouTube, LinkedIn, and X
👉 RSVP: lu.ma/mlflow423

#opensource #mlflow #oss

MLflow Monthly Meetup · Luma

Join us for the next MLflow Community Meetup — Wednesday, April 23 at 4PM PT! We’re bringing two exciting presentations to the community: 🔹 MLflow + DSPy…

lu.ma

April 15, 2025 at 7:51 PM

Omar Khattab

@lateinteraction.bsky.social

Yes there's an evals crisis, but evaluating *models* is not even the right question most of the time

LangProBe from Shangyin Tan, @lakshyaaagrawal.bsky.social, Arnav Singhvi, Liheng Lai, @michaelryan207.bsky.social et al begins to ask what complete *AI systems* we should build & under what settings

Lakshya A Agrawal @lakshyaaagrawal.bsky.social · Mar 3

🧵Introducing LangProBe: the first benchmark testing where and how composing LLMs into language programs affects cost-quality tradeoffs!

We find that, on avg across diverse tasks, smaller models within optimized programs beat calls to larger models at a fraction of the cost.

March 3, 2025 at 7:42 PM

Reposted by Omar Khattab

Lakshya A Agrawal

@lakshyaaagrawal.bsky.social

🧵Introducing LangProBe: the first benchmark testing where and how composing LLMs into language programs affects cost-quality tradeoffs!

We find that, on avg across diverse tasks, smaller models within optimized programs beat calls to larger models at a fraction of the cost.

March 3, 2025 at 6:59 PM

Omar Khattab

@lateinteraction.bsky.social

Composition & abstraction are the foundations of CS, but are clearly absent in modern ML.

It's not that they're not crucial for intelligent software. But it takes building many half-working systems to abstract successfully, and it takes good abstractions to have primitives worth composing.

🧵1/2

February 26, 2025 at 10:00 PM

Omar Khattab

@lateinteraction.bsky.social

Some quick thoughts: On why we gave the ColBERT paradigm the name "late interaction" instead of "multi-vector", a term that emerged later and that has proven to be more intuitive.

**The mechanism is actually not about having multiple vectors at all.** You can see this in four different ways.

🧵1/7

February 26, 2025 at 7:14 PM

Omar Khattab

@lateinteraction.bsky.social

Someone needs to write the book “Modern Machine Learning — the math, the myth, the legend”

February 1, 2025 at 5:40 AM

Omar Khattab

@lateinteraction.bsky.social

a statistician walks into an error bar

surprisingly, not everyone was mean to him

January 28, 2025 at 8:31 PM

Omar Khattab

@lateinteraction.bsky.social

What do you call LLMs that exhibit reward hacking for translation?

Agents gone ROUGE.

January 5, 2025 at 3:15 AM

Omar Khattab

@lateinteraction.bsky.social

Reminder to self:

Much like a good paper always starts with (i) the status of the field at the time of the proposal, (ii) what’s an annoying gap, and (iii) the novel intuition that motivates a new insightful proposal, a good course lecture must also situate every major concept this way.

[1/3]

December 31, 2024 at 9:21 PM

Omar Khattab

@lateinteraction.bsky.social

When building ColBERT, I sort of assumed it will pave the way for hypernetwork-based, pruning-capable retrieval indexes. Let me explain.

The big insight in ColBERT is that we can encode each document upfront *not* into a vector, but into a rich scoring function, f: query -> float, which ... [1/3]

December 31, 2024 at 5:29 PM

Reposted by Omar Khattab

Ana Marasović

@anamarasovic.bsky.social

Anyone here have experience with working with someone in @aclmeeting.bsky.social to get a confirmation of reviewing/awards/etc for the purpose of US work visa / green card? If so, please can you share their contact with me 🙏

December 29, 2024 at 6:54 PM

Reposted by Omar Khattab

Drew Breunig

@dbreunig.bsky.social

Strongly agree. Especially when you need to be nimble enough to adopt new or different models.

Yoav Goldberg @yoavgo.bsky.social · Dec 29

"prompt engineering" is much more about effective processes for creating good prompts for your data, than it is about specific techniques. which is to a large extent why platforms like dspy work: they kinda force you into a processes, while attempting to automate the techniques.

December 29, 2024 at 5:35 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news