Lightnews — Scholar-powered news

Reposted by Nicolas Dufour

Lucas Degeorge

@lucasdegeorge.bsky.social

Check out our new work: MIRO

No more post-training alignment!
We integrate human alignment right from the start, during pretraining!

Results:
✨ 19x faster convergence ⚡
✨ 370x less compute 💻

🔗 Explore the project: nicolas-dufour.github.io/miro/

October 31, 2025 at 9:11 PM

Reposted by Nicolas Dufour

Michiel Bontenbal

@mpbontenbal.bsky.social

Image generation becomes much more energy efficient. 👍

Nicolas Dufour @nicolasdufour.bsky.social · 11d

We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control.

- 19x faster convergence ⚡
- 370x less FLOPS than FLUX-dev 📉

October 31, 2025 at 8:28 PM

Reposted by Nicolas Dufour

David Picard

@davidpicard.bsky.social

I'm super happy about Nicolas' latest work, probably the magnum opus of his PhD.

Read the thread for all the great details.
The main conclusion I draw from this work is that better pretraining, in particular by conditioning on better data, allows us to train SOTA models at a fraction of the cost.

Nicolas Dufour @nicolasdufour.bsky.social · 11d

We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control.

- 19x faster convergence ⚡
- 370x less FLOPS than FLUX-dev 📉

October 31, 2025 at 11:39 AM

Nicolas Dufour

@nicolasdufour.bsky.social

We introduce MIRO: a new paradigm for T2I model alignment integrating reward conditioning into pretraining, eliminating the need for separate fine-tuning/RL stages. This single-stage approach offers unprecedented efficiency and control.

- 19x faster convergence ⚡
- 370x less FLOPS than FLUX-dev 📉

October 31, 2025 at 11:24 AM

Reposted by Nicolas Dufour

Mathurin Massias

@mathurinmassias.bsky.social

Kickstarting our workshop on Flow matching and Diffusion with a talk by Eric Vanden Eijnden on how to optimize learning and sampling in Stochastic Interpolants!

Broadcast available at gdr-iasis.cnrs.fr/reunions/mod...

October 24, 2025 at 8:30 AM

Reposted by Nicolas Dufour

David Picard

@davidpicard.bsky.social

Final note: I'm (we're) tempted to organize a challenge on that topic as a workshop at a CV conf. ImageNet is the only source of images allowed and then you compete to get the bold numbers.

Do you think there would be people in for that? Do you think it would make for a nice competition?

October 8, 2025 at 8:43 PM

Reposted by Nicolas Dufour

Vicky Kalogeiton

@vickykalogeiton.bsky.social

Very proud of our recent work, kudos to the team! Read @davidpicard.bsky.social’s excellent post for more details or the paper arxiv.org/pdf/2502.21318

October 8, 2025 at 9:19 PM

Reposted by Nicolas Dufour

David Picard

@davidpicard.bsky.social

Today is Antoine Guedon's PhD! Already pretty cool visuals right at the start.

September 25, 2025 at 3:17 PM

Reposted by Nicolas Dufour

David Picard

@davidpicard.bsky.social

Annnnnd it's a reject!

Scale is a religion and if you go against it, you're a heretic and you should burn, "despite [the reviewers] final ratings".

But scale is still not necessary!

Side note: First time swinging reviews up (from 2,2,4,4 to 2,4,4,5) does not get the paper accepted. Strange days.

David Picard @davidpicard.bsky.social · Aug 7

Dear bsky friends, I have a question: Do you really think that the visual quality of these images is so bad that the research that produced them is deeply flawed?
And if I told you that the model was mostly trained on ImageNet with a bit of artistic fine-tuning at 1024 resolution, still really bad?

September 18, 2025 at 5:04 PM

Reposted by Nicolas Dufour

François Rozet

@francois-rozet.bsky.social

Does a smaller latent space lead to worse generation in latent diffusion models? Not necessarily! We show that LDMs are extremely robust to a wide range of compression rates (10-1000x) in the context of physics emulation.

We got lost in latent space. Join us 👇

September 3, 2025 at 1:40 PM

Reposted by Nicolas Dufour

David Picard

@davidpicard.bsky.social

Next week, I'll be in Strasbourg for the GRETSI (@gretsi-info.bsky.social) to present a small discovery on transformers generalization we made with Simon and Jérémie while working on generative recommender systems. I love these "phase transition" plots.

📜: arxiv.org/abs/2508.03934

Short summary 👇

August 23, 2025 at 10:12 AM

Nicolas Dufour

@nicolasdufour.bsky.social

🚀 DinoV3 just became the new go-to backbone for geoloc!
It outperforms CLIP-like models (SigLip2, finetuned StreetCLIP)… and that’s shocking 🤯
Why? CLIP models have an innate advantage — they literally learn place names + images. DinoV3 doesn’t.

August 18, 2025 at 3:14 PM

Reposted by Nicolas Dufour

David Picard

@davidpicard.bsky.social

Dear bsky friends, I have a question: Do you really think that the visual quality of these images is so bad that the research that produced them is deeply flawed?
And if I told you that the model was mostly trained on ImageNet with a bit of artistic fine-tuning at 1024 resolution, still really bad?

August 7, 2025 at 6:43 AM

Nicolas Dufour

@nicolasdufour.bsky.social

I had the privilege to be invited to speak about our work "Around the World in 80 Timesteps" at the French Podcast Underscore! If you speak french, i highly recommend it they did a great job with the montage!

If you want to learn more nicolas-dufour.github.io/plonk

www.youtube.com/watch?v=s5oH...

Il a conçu la première IA d’OSINT (terrifiant… et génial)

YouTube video by Underscore_

www.youtube.com

July 31, 2025 at 4:43 PM

Reposted by Nicolas Dufour

Andrei Bursuc

@abursuc.bsky.social

1/ Can open-data models beat DINOv2? Today we release Franca, a fully open-sourced vision foundation model. Franca with ViT-G backbone matches (and often beats) proprietary models like SigLIPv2, CLIP, DINOv2 on various benchmarks setting a new standard for open-source research.

July 21, 2025 at 2:47 PM

Reposted by Nicolas Dufour

Pierre Marion

@pierremarion.bsky.social

✨Thrilled to see EurIPS launch — the first officially endorsed European NeurIPS presentation venue!

👀 But NeurIPS now requires at least one author to attend in San Diego or Mexico (and not just virtually as before). This is detrimental to many. Why not allow presenting at EurIPS or online?
1/4

EurIPS Conference @euripsconf.bsky.social · Jul 16

EurIPS is coming! 📣 Mark your calendar for Dec. 2-7, 2025 in Copenhagen 📅

EurIPS is a community-organized conference where you can present accepted NeurIPS 2025 papers, endorsed by @neuripsconf.bsky.social and @nordicair.bsky.social and is co-developed by @ellis.eu

eurips.cc

July 17, 2025 at 8:49 AM

Reposted by Nicolas Dufour

Imagine-ENPC

@imagineenpc.bsky.social

Some of our IMAGINE members at #CVPR2025

June 15, 2025 at 7:14 PM

Reposted by Nicolas Dufour

David Picard

@davidpicard.bsky.social

Come on! Who else has a hot air ballon on their poster?

(fun fact: there is no hot air ballon emoji, but @loicland.bsky.social made a tikz macro for it! 😅)

Nicolas Dufour @nicolasdufour.bsky.social · Jun 15

Come see us in poster 186 to see our poster Around the World in 80 timesteps: A generative Approach to Global Visual Geolocation!

Cc @loicland.bsky.social @davidpicard.bsky.social @vickykalogeiton.bsky.social

June 15, 2025 at 3:57 PM

Nicolas Dufour

@nicolasdufour.bsky.social

Come see us in poster 186 to see our poster Around the World in 80 timesteps: A generative Approach to Global Visual Geolocation!

Cc @loicland.bsky.social @davidpicard.bsky.social @vickykalogeiton.bsky.social

June 15, 2025 at 3:30 PM

Reposted by Nicolas Dufour

Elliot Vincent

@elliotvincent.bsky.social

A bit disappointed by the PAMI TC meeting, mostly repetitions of what’s been said at the opening, the "open discussion" slide was really just there to *exist* but no discussion/vote took place, no topic was debated. What space is left to reflect on our community and what we stand for as scientists?

June 14, 2025 at 11:25 PM

Reposted by Nicolas Dufour

Vincent Lepetit

@vincentlepetit.bsky.social

I am heartbroken that I am not at the conference, but seeing what the government is doing to its people and the world, I simply couldn't go there.

June 14, 2025 at 9:51 AM

Reposted by Nicolas Dufour

Elliot Vincent

@elliotvincent.bsky.social

I will also be presenting CoDeX at the same workshop between 1:15PM and 1:45PM.

Abhishek Kuriyal, Mathieu Aubry, @loicland.bsky.social and I improve the performance of deep learning models in challenging domain shift settings by learning how to combine spatial domain experts.

June 11, 2025 at 4:04 AM

Reposted by Nicolas Dufour

Elliot Vincent

@elliotvincent.bsky.social

Discover DAFA-LS, a dataset of SITS centered on Afghan archeological sites and annotated with preservation classification labels.

🎤 1:45PM Oral (room 208 B)
📰 4:30PM Poster (poster boards #419 – #443)

June 11, 2025 at 4:04 AM

Reposted by Nicolas Dufour

Elliot Vincent

@elliotvincent.bsky.social

I will be presenting our work on the detection of archaeological looting with satellite image time series at CVPR 2025 EarthVision workshop tomorrow!

Honored and grateful that this paper received the best student paper award!

June 11, 2025 at 4:04 AM

Nicolas Dufour

@nicolasdufour.bsky.social

I will be at #CVPR2025 this week in Nashville.

I will be presenting our paper "Around the World in 80 Timesteps:
A Generative Approach to Global Visual Geolocation".

We tackle geolocalization as a generative task allowing for SOTA performance and more interpretable predictions.

June 11, 2025 at 12:52 AM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news