Lightnews — Scholar-powered news

Medhini Narasimhan

@medhini.bsky.social

19 followers 51 following 0 posts

Researcher @Google Deepmind working on Veo.

UC Berkeley/BAIR PhD, UIUC MS/CS

medhini.github.io

Posts Replies Media Videos

Reposted by Medhini Narasimhan

Michael Tschannen

@mtschannen.bsky.social

Have you ever wondered how to train an autoregressive generative transformer on text and raw pixels, without a pretrained visual tokenizer (e.g. VQ-VAE)?

We have been pondering this during summer and developed a new model: JetFormer 🌊🤖

arxiv.org/abs/2411.19722

A thread 👇

1/

December 2, 2024 at 4:41 PM

Reposted by Medhini Narasimhan

Sander Dieleman

@sedielem.bsky.social

The link between diffusion models and optimal transport is still a bit of an enigma to me.

One thing that's clear: different diffusion models trained on similar datasets tend to recover similar mappings. If these are generally not OT, in what sense are they optimal instead?

Gabriel Peyré @gabrielpeyre.bsky.social · Nov 30

I wrote a summary of the main ingredients of the neat proof by Hugo Lavenant that diffusion models do not generally define optimal transport. github.com/mathematical...

November 30, 2024 at 12:56 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news