Lightnews — Scholar-powered news

Gene Chou

@gene-chou.bsky.social

520 followers 160 following 7 posts

PhD student at Cornell, interested in 3D generation, reconstruction; prev Princeton '22

https://genechou.com

Posts Replies Media Videos

Gene Chou

@gene-chou.bsky.social

We design a self-supervised method that takes advantage of the consistency of videos and variability of multiview internet photos to train a 3D-aware video model without ANY 3D annotations. We name our method KFC-W (KeyFrame-Conditioned video generation in-the-Wild). 4/N

November 21, 2024 at 4:19 PM

Gene Chou

@gene-chou.bsky.social

This task is difficult for video models! They perform visually pleasing morphing but can ignore scene identity and hallucinate new structures. Our main insight is that training for general video synthesis is not enough: we need to introduce scalable, 3D-aware objectives. 3/N

November 21, 2024 at 4:19 PM

Gene Chou

@gene-chou.bsky.social

We've released our paper "Generating 3D-Consistent Videos from Unposed Internet Photos"! Video models like Luma generate pretty videos, but sometimes struggle with 3D consistency. We can do better by scaling them with 3D-aware objectives. 1/N

page: genechou.com/kfcw

November 21, 2024 at 4:19 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news