linyijin.bsky.social
@linyijin.bsky.social
jinlinyi.github.io
PhD student at University of Michigan; Student Researcher @Google DeepMind; ex-intern @Adobe
This type of data is ideal for learning the structure and dynamics of the real world.

We gave this a shot — by extending DUSt3R to model 3D motion, and training on our dataset. Given a pair of frames, our model predicts a 3D point cloud, and corresponding 3D motion trajectories.
December 13, 2024 at 3:13 AM
Introducing 👀Stereo4D👀

A method for mining 4D from internet stereo videos. It enables large-scale, high-quality, dynamic, *metric* 3D reconstructions, with camera poses and long-term 3D motion trajectories.

We used Stereo4D to make a dataset of over 100k real-world 4D scenes.
December 13, 2024 at 3:13 AM