Lightnews — Scholar-powered news

Gene Chou

@gene-chou.bsky.social

520 followers 160 following 7 posts

PhD student at Cornell, interested in 3D generation, reconstruction; prev Princeton '22

https://genechou.com

Posts Replies Media Videos

Gene Chou

@gene-chou.bsky.social

This work wouldn't have been possible without my internship mentor Kai, advisors @snavely.bsky.social @bharathhariharan.bsky.social, and other coauthors.

Project Page: genechou.com/kfcw
N/N

November 21, 2024 at 4:19 PM

Gene Chou

@gene-chou.bsky.social

Our method outperforms commercial models in terms of geometric and appearance consistency, and we show that video models trained with 3D-aware objectives can be useful as 3D priors for downstream tasks such as SfM and 3DGS. Check out our paper for more details! 5/N

November 21, 2024 at 4:19 PM

Gene Chou

@gene-chou.bsky.social

We design a self-supervised method that takes advantage of the consistency of videos and variability of multiview internet photos to train a 3D-aware video model without ANY 3D annotations. We name our method KFC-W (KeyFrame-Conditioned video generation in-the-Wild). 4/N

November 21, 2024 at 4:19 PM

Gene Chou

@gene-chou.bsky.social

This task is difficult for video models! They perform visually pleasing morphing but can ignore scene identity and hallucinate new structures. Our main insight is that training for general video synthesis is not enough: we need to introduce scalable, 3D-aware objectives. 3/N

November 21, 2024 at 4:19 PM

Gene Chou

@gene-chou.bsky.social

We propose the task of generating videos from sparse (2-5), unposed internet photos. A model’s ability to capture underlying geometry, recognize scene identity, and relate frames in terms of camera position reflects its understanding of 3D structure and scene layout. 2/N

November 21, 2024 at 4:19 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news