Lightnews — Scholar-powered news

Thomas Kipf

@tkipf.bsky.social

6.1K followers 330 following 24 posts

Research at Google DeepMind. Ex-Physicist. Controllable World Simulators (GNNs, Structured World Models, Neural Assets). TLM Veo Capabilities (Ingredients & more).

📍 San Francisco, CA

Posts Replies Media Videos

Thomas Kipf

@tkipf.bsky.social

Two life updates:

1) About a year ago I decided to join the Veo team to work on capabilities. It’s been a fun ride! Excited for what’s still to come.

2) I've been busy caring for a newborn the past couple of days 🥰 Excited for the incredible world he will grow up in. Veo's impression below:

May 27, 2025 at 8:04 PM

Thomas Kipf

@tkipf.bsky.social

I gave a talk on Compositional World Models at NeurIPS last week 🌐

The recording is now online: neurips.cc/virtual/2024... (for registered attendees; starts at 6:06:00)

Workshop: compositional-learning.github.io

December 19, 2024 at 1:57 AM

Thomas Kipf

@tkipf.bsky.social

Check out the paper & website for emergent scene tracking examples:

📜https://arxiv.org/abs/2411.05927
🌐https://moog-paper.github.io

We can visualize token attention to see what part of the scene they take responsibility for – we find that they capture/track the 3D content of the scene.

November 15, 2024 at 6:09 AM

Thomas Kipf

@tkipf.bsky.social

The world doesn’t live on a pixel grid and neither should vision models!

Excited to share Moving off-the-Grid (MooG): a video model w/o grid-based representations. MooG learns detached “off-the-grid tokens” that bind to (and track) scene elements as camera & content move.

🧵

November 15, 2024 at 6:09 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news