Lightnews — Scholar-powered news

Marco Cuturi

@marcocuturi.bsky.social

760 followers 58 following 25 posts

machine learning researcher @ Apple machine learning research

Posts Replies Media Videos

Marco Cuturi

@marcocuturi.bsky.social

We have been working with Michal Klein on pushing a module to train *flow matching* models using JAX. This is shipped as part of our new release of the OTT-JAX toolbox (github.com/ott-jax/ott)

The tutorial to do so is here: ott-jax.readthedocs.io/tutorials/ne...

November 5, 2025 at 2:04 PM

Marco Cuturi

@marcocuturi.bsky.social

While working on semidiscrete flow matching this summer (➡️ arxiv.org/abs/2509.25519), I kept looking for a video illustrating that the velocity field solving the Benamou-Brenier OT problem is NOT constant w.r.t. time ⏳... so I did it myself, take a look! ott-jax.readthedocs.io/tutorials/th...

October 9, 2025 at 8:09 PM

Marco Cuturi

@marcocuturi.bsky.social

for people that like OT, IMHO the very encouraging insight is that we have evidence that the "better" you solve your OT problem, the more flow matching metrics improve, this is Figure 3

October 4, 2025 at 8:45 AM

Marco Cuturi

@marcocuturi.bsky.social

Thanks @rflamary.bsky.social! yes, exactly. We try to summarize this tradeoff in Table 1, in which we show that for a one-off preprocessing cost, we now get all (noise,data) pairings you might need during flow matching training for "free" (up to the MIPS lookup for each noise).

October 4, 2025 at 8:44 AM

Marco Cuturi

@marcocuturi.bsky.social

This much faster than using Sinkhorn, and generates with higher quality.

As a bonus, you can forget about entropy regularization (set ε=0), apply things like correctors to guidance, and use it on consistency-type models, or even with conditional generation.

October 3, 2025 at 9:00 PM

Marco Cuturi

@marcocuturi.bsky.social

In practice, however, this idea only begins to work when using massive batch sizes (see arxiv.org/abs/2506.05526). The problem is that the costs of running Sinkhorn on millions of points can quickly balloon...

Our solution? rely on semidiscrete OT at scales that were never considered before.

October 3, 2025 at 8:56 PM

Marco Cuturi

@marcocuturi.bsky.social

Our two phenomenal interns, Alireza Mousavi-Hosseini and Stephen Zhang @syz.bsky.social have been cooking some really cool work with Michal Klein and me over the summer.

Relying on optimal transport couplings (to pick noise and data pairs) should, in principle, be helpful to guide flow matching

🧵

October 3, 2025 at 8:50 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news