Lightnews — Scholar-powered news

Sergio Izquierdo

@sizquierdo.bsky.social

PhD candidate at University of Zaragoza.
Previously intern at Niantic Labs and Skydio.

Working on 3D reconstruction and Deep Learning.
serizba.github.io

Posts Replies Media Videos

Sergio Izquierdo

@sizquierdo.bsky.social

We focused on depth from videos and as you pointed we didn't train on datasets with different captures per scene.

March 31, 2025 at 3:51 PM

Sergio Izquierdo

@sizquierdo.bsky.social

Check the website: nianticlabs.github.io/mvsanywhere/
And the paper: arxiv.org/pdf/2503.22430
Code coming soon!

Great work with @mohamedsayed.bsky.social @mdfirman.bsky.social @guiggh.bsky.social D. Turmukhambetov @jcivera.bsky.social @oisinmacaodha.bsky.social @gbrostow.bsky.social J. Watson

MVSAnywhere: Zero-Shot Multi-View Stereo

MVSAnywhere: Zero-Shot Multi-View Stereo, CVPR 2025

nianticlabs.github.io

March 31, 2025 at 12:52 PM

Sergio Izquierdo

@sizquierdo.bsky.social

💡Use case:

We show how the accurate and robust depths from MVSAnywhere serve to regularize gaussian splats, obtaining much cleaner scene reconstructions.

As MVSAnywhere is agnostic to the scene scale, this is plug-and-play for your splats!

March 31, 2025 at 12:52 PM

Sergio Izquierdo

@sizquierdo.bsky.social

🏆Results:

MVSAnywhere achieves state-of-the-art results on the Robust Multi-View Depth Benchmark, showing its strong generalization performance.

March 31, 2025 at 12:52 PM

Sergio Izquierdo

@sizquierdo.bsky.social

🧩Challenge: Varying Depth Scales & Unknown Ranges

🔹Most models require a known depth range to estimate the cost volume.
✅MVSAnywhere estimates an initial range based on camera scale and setup and refines it. It predicts at the same scale as the input cameras!

March 31, 2025 at 12:52 PM

Sergio Izquierdo

@sizquierdo.bsky.social

🧩Challenge: Domain Generalization

🔹Previous models struggle across different domains ( indoor🏠 vs outdoor🏞️).
✅MVSAnywhere uses a transformer architecture and is trained on a large array of varied synthetic datasets

March 31, 2025 at 12:52 PM

Sergio Izquierdo

@sizquierdo.bsky.social

🧩Challenge: Robustness to casually captured videos

🔹MVS methods completely rely on the matches of the cost volume (not working for low overlap & dynamic)
✅MVSAnywhere successfully combines strong single-view image priors with multi-view information from our cost volume

MVSAnywhere works with dynamic objects and casually captured videos.

March 31, 2025 at 12:52 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news