Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel ... Grégory Rogez
arxiv.org/abs/2508.16465
Trending on www.scholar-inbox.com
Anilkumar Swamy, Vincent Leroy, Philippe Weinzaepfel ... Grégory Rogez
arxiv.org/abs/2508.16465
Trending on www.scholar-inbox.com
Maxime Pietrantoni, @gabrielacsurka.bsky.social, @sattlertorsten.bsky.social
arxiv.org/abs/2507.23569
Maxime Pietrantoni, @gabrielacsurka.bsky.social, @sattlertorsten.bsky.social
arxiv.org/abs/2507.23569
www.arxiv.org/abs/2506.21348
www.arxiv.org/abs/2506.21348
www.arxiv.org/abs/2506.21348
www.arxiv.org/abs/2506.21348
Slides are now available for those interested
1- Catch me if you can: Manoeuvre the Competition
with Your Unique Abilities
tinyurl.com/StandOut-DD2...
2- Beyond Long Video Understanding
tinyurl.com/BeyondLong-D...
Slides are now available for those interested
1- Catch me if you can: Manoeuvre the Competition
with Your Unique Abilities
tinyurl.com/StandOut-DD2...
2- Beyond Long Video Understanding
tinyurl.com/BeyondLong-D...
- What matters in ImageNav: architecture, pre-training, sim settings, pose (poster & highlight at the Embodied AI workshop)
- CondiMen: Conditional Multi-person Human Mesh Recovery (Poster at the Rhobin workshop and at the 3D Humans workshop)
- What matters in ImageNav: architecture, pre-training, sim settings, pose (poster & highlight at the Embodied AI workshop)
- CondiMen: Conditional Multi-person Human Mesh Recovery (Poster at the Rhobin workshop and at the 3D Humans workshop)
Interactive site, play around with dynamical models:
europe.naverlabs.com/research/pub...
Thanks @weinzaepfelp.bsky.social for the photo.
@steevenj7.bsky.social
Interactive site, play around with dynamical models:
europe.naverlabs.com/research/pub...
Thanks @weinzaepfelp.bsky.social for the photo.
@steevenj7.bsky.social
go.bsky.app/JdTFu4Q
go.bsky.app/JdTFu4Q
paper: arxiv.org/abs/2503.14405
code: github.com/naver/dune
paper: arxiv.org/abs/2503.14405
code: github.com/naver/dune
"DUNE: Distilling a UNiversal Encoder from Heterogeneous 2D and 3D Teachers"
We propose DUNE: a ViT-based encoder distilled from multiple specialized 2D & 3D foundation models to unify visual tasks across 2D, 3D and human understanding.
"DUNE: Distilling a UNiversal Encoder from Heterogeneous 2D and 3D Teachers"
We propose DUNE: a ViT-based encoder distilled from multiple specialized 2D & 3D foundation models to unify visual tasks across 2D, 3D and human understanding.
Discover MUSt3R & Pow3R, universal encoder DUNE + research in navigation, vizloc, segmentation & human motion understanding!
All our #CVPR2025 papers are here
➡️ tinyurl.com/4z79ujce
Discover MUSt3R & Pow3R, universal encoder DUNE + research in navigation, vizloc, segmentation & human motion understanding!
All our #CVPR2025 papers are here
➡️ tinyurl.com/4z79ujce
Code IS available here github.com/naver/must3r
I hope it works in your scenarios and you have as much fun as we do playing around with it!
Code IS available here github.com/naver/must3r
I hope it works in your scenarios and you have as much fun as we do playing around with it!
Interested in SfM, RGB-SLAM or... both at the same time???
Come see MUSt3R @CVPR25 Friday morning, ExHall D Poster #82.
Jerome and Boris will be there to present how we can adapt DUSt3R to multiple views via a memory mechanism.
If you missed it earlier [...]
Interested in SfM, RGB-SLAM or... both at the same time???
Come see MUSt3R @CVPR25 Friday morning, ExHall D Poster #82.
Jerome and Boris will be there to present how we can adapt DUSt3R to multiple views via a memory mechanism.
If you missed it earlier [...]