"DUNE: Distilling a UNiversal Encoder from Heterogeneous 2D and 3D Teachers"
We propose DUNE: a ViT-based encoder distilled from multiple specialized 2D & 3D foundation models to unify visual tasks across 2D, 3D and human understanding.
"DUNE: Distilling a UNiversal Encoder from Heterogeneous 2D and 3D Teachers"
We propose DUNE: a ViT-based encoder distilled from multiple specialized 2D & 3D foundation models to unify visual tasks across 2D, 3D and human understanding.
Code IS available here github.com/naver/must3r
I hope it works in your scenarios and you have as much fun as we do playing around with it!
Code IS available here github.com/naver/must3r
I hope it works in your scenarios and you have as much fun as we do playing around with it!