Lightnews — Scholar-powered news

Ruojin Cai

@ruojin.bsky.social

54 followers 61 following 6 posts

PhD Student at Cornell CS
https://www.cs.cornell.edu/~ruojin/

Posts Replies Media Videos

Ruojin Cai

@ruojin.bsky.social

We show that InterPose generalizes across 3 SOTA video models (DynamiCrafter, Runway Gen-3, Luma Dream Machine) and consistently outperforms DUSt3R on 4 diverse datasets (indoor, outdoor, object) using our new benchmark, which selects challenging pairs with little to no overlap.

December 23, 2024 at 5:44 PM

Ruojin Cai

@ruojin.bsky.social

⚠️ Challenge: Generated videos may contain visual artifacts or implausible motion.
🔑 Solution: We generate multiple videos and use a self-consistency metric to select the most visually consistent sample.

December 23, 2024 at 5:44 PM

Ruojin Cai

@ruojin.bsky.social

🤔Can Generative Video Models Help Pose Estimation?
✅Yes!
We find that generative video models can hallucinate plausible intermediate frames that provide useful context for pose estimators (e.g. DUSt3R), especially for images with little to no overlap.
🔗 inter-pose.github.io

December 23, 2024 at 5:44 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news