Lightnews — Scholar-powered news

Xi WANG

@xiwang92.bsky.social

34 followers 40 following 8 posts

Ecole Polytechnique, IP Paris; Prev. Ph.D.@Univ Rennes, Inria/IRISA
https://triocrossing.github.io/

Posts Replies Media Videos

Xi WANG

@xiwang92.bsky.social

Our approach fundamentally differs from previous distillation methods, such as DMD. Instead of minimizing the divergence of denoising distributions across the entire latent space, Di[M]O optimizes the divergence of token-level conditional distributions.

March 21, 2025 at 3:36 PM

Xi WANG

@xiwang92.bsky.social

To approximate the loss gradient, we introduce an auxiliary model that estimates an otherwise intractable term in the loss function. The auxiliary model is trained using a standard MDM training loss, with one-step generated samples as targets.

March 21, 2025 at 3:36 PM

Xi WANG

@xiwang92.bsky.social

To sample from the correct joint distribution, we introduce an initialization that maps a randomized input sequence to an almost deterministic target sequence.
Without proper initialization, the model may suffer from divergence or mode collapse, making this step essential.

March 21, 2025 at 3:36 PM

Xi WANG

@xiwang92.bsky.social

The key idea is inspired by on-policy distillation. We align the output distributions of the teacher and student models at the student generated intermediate states, ensuring that the student's generation closely matches the teacher's by covering all possible intermediate states.

March 21, 2025 at 3:36 PM

Xi WANG

@xiwang92.bsky.social

Masked Diffusion Models (MDMs) are a hot topic in generative AI 🔥 — powerful but slow due to multiple sampling steps.
We @polytechniqueparis.bsky.social and @inria-grenoble.bsky.social introduce Di[M]O — a novel approach to distill MDMs into a one-step generator without sacrificing quality.

March 21, 2025 at 3:36 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news