Lightnews — Scholar-powered news

Reposted by Johannes Schusterbauer

Pingchuan Ma

@pima-hyphen.bsky.social

I’m thrilled to share that I’ll present two first-authored papers at #ICCV2025 🌺 in Honolulu together with @mgui7.bsky.social ! 🏝️
(Thread 🧵👇)

October 18, 2025 at 3:01 AM

Johannes Schusterbauer

@joh-schb.bsky.social

🤔 What if you could generate an entire image using just one continuous token?

💡 It works if we leverage a self-supervised representation!

Meet RepTok🦎: A generative model that encodes an image into a single continuous latent while keeping realism and semantics. 🧵 👇

October 17, 2025 at 10:21 AM

Reposted by Johannes Schusterbauer

Stefan Baumann

@stefanabaumann.bsky.social

🤔 What happens when you poke a scene — and your model has to predict how the world moves in response?

We built the Flow Poke Transformer (FPT) to model multi-modal scene dynamics from sparse interactions.

It learns to predict the 𝘥𝘪𝘴𝘵𝘳𝘪𝘣𝘶𝘵𝘪𝘰𝘯 of motion itself 🧵👇

October 15, 2025 at 1:56 AM

Johannes Schusterbauer

@joh-schb.bsky.social

Looking forward to attending #CVPR2025 in Nashville next week 🎸🎶 @mgui7.bsky.social and I will be presenting our latest work:

🌊 Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment

June 6, 2025 at 3:48 PM

Johannes Schusterbauer

@joh-schb.bsky.social

Sunrise in the office after the #ICCV deadline night with @mgui7.bsky.social 🚀

March 8, 2025 at 5:46 AM

Reposted by Johannes Schusterbauer

CompVis - Computer Vision and Learning LMU Munich

@compvis.bsky.social

www.youtube.com/watch?v=bCy6...

Building a New Foundation Model (Björn Ommer) | DLD25

YouTube video by DLD Conference

www.youtube.com

January 20, 2025 at 10:01 AM

Reposted by Johannes Schusterbauer

Jan-Hendrik Müller

@kolibril13.bsky.social

Over 60 German universities and research institutions announced their departure from X today.

Amrei Bahr @amreibahr.bsky.social · Jan 10

Starkes Signal!!

Über 60 dt. Hochschulen & Forschungsinstitutionen haben heute ihren Ausstieg bei X bekanntgegeben, s.u. #eXit

X sei nicht mehr vereinbar mit ihren Grundwerten: „Weltoffenheit, wissenschaftliche Integrität, Transparenz und demokratischer Diskurs.“

Liste der Beteiligten hier:

Hochschulen und Forschungsinstitutionen verlassen Plattform X - Gemeinsam für Vielfalt, Freiheit und Wissenschaft

nachrichten.idw-online.de

January 10, 2025 at 10:45 AM

Reposted by Johannes Schusterbauer

Pingchuan Ma

@pima-hyphen.bsky.social

🤔When combining Vision-language models (VLMs) with Large language models (LLMs), do VLMs benefit from additional genuine semantics or artificial augmentations of the text for downstream tasks?

🤨Interested? Check out our latest work at #AAAI25:

💻Code and 📝Paper at: github.com/CompVis/DisCLIP

🧵👇

January 8, 2025 at 3:54 PM

Johannes Schusterbauer

@joh-schb.bsky.social

Congrats to @frankfundel.bsky.social for publishing this work at WACV🔥

Has been a pleasure to jointly work on this topic with such a talented master student🤗

Looking forward to seeing what comes next!🚀

Frank Fundel @frankfundel.bsky.social · Dec 6

Did you know you can distill the capabilities of a large diffusion model into a small ViT? ⚗️
We showed exactly that for a fundamental task:
semantic correspondence📍

A thread 🧵👇

December 6, 2024 at 5:05 PM

Johannes Schusterbauer

@joh-schb.bsky.social

Awesome work from some colleagues cleaning up diffusion features!🚀

Nick Stracke @rmsnorm.bsky.social · Dec 4

🤔 Why do we extract diffusion features from noisy images? Isn’t that destroying information?

Yes, it is - but we found a way to do better. 🚀

Here’s how we unlock better features, no noise, no hassle.

📝 Project Page: compvis.github.io/cleandift
💻 Code: github.com/CompVis/clea...

🧵👇

December 5, 2024 at 7:06 AM

Reposted by Johannes Schusterbauer

Sander Dieleman

@sedielem.bsky.social

IMO VQGAN is why GANs deserve the NeurIPS test of time award. Suddenly our image representations were an order of magnitude more compact. Absolute game changer for generative modelling at scale, and the basis for latent diffusion models.

Taming Transformers for High-Resolution Image Synthesis

Designed to learn long-range interactions on sequential data, transformers continue to show state-of-the-art results on a wide variety of tasks. In contrast to CNNs, they contain no inductive bias tha...

arxiv.org

November 28, 2024 at 12:09 AM

Reposted by Johannes Schusterbauer

Anton Obukhov

@obukhov.ai

Check out my GenAI starter pack! go.bsky.app/BT1bRvZ

November 23, 2024 at 10:45 AM

Reposted by Johannes Schusterbauer

Stefan Baumann

@stefanabaumann.bsky.social

After many years, our lab finally has a social media presence at @compvis.bsky.social ! 🥳
Give it a follow, we have some amazing research on generative computer vision coming soon!

November 20, 2024 at 6:31 PM

Reposted by Johannes Schusterbauer

Nick Stracke

@rmsnorm.bsky.social

me right now..

November 20, 2024 at 2:22 PM

Reposted by Johannes Schusterbauer

Sander Dieleman

@sedielem.bsky.social

In a gratuitous attempt to acquire more followers myself 😁, I've made a start on a "starter pack". Hopefully as more people from 🐦 make it over to 🦋, we can extend this a bit. Suggestions welcome!

I've noticed not all accounts seem to be eligible to be added, anyone know what's up with that? 🤔

November 15, 2024 at 8:04 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news