Lightnews — Scholar-powered news

Juliette Marrie

@jlt-m.bsky.social

94 followers 76 following 7 posts

Postdoc at Kyutai
https://juliettemarrie.github.io

Posts Replies Media Videos

Juliette Marrie

@jlt-m.bsky.social

My experiments are run on a 48GB GPU. A 24GB GPU may be sufficient depending on the application. Feel free to reach out if you have any questions or run into any issues, and we can find a way to make it work within your memory constraints.

February 2, 2025 at 11:12 AM

Juliette Marrie

@jlt-m.bsky.social

Thanks! So far, I have been evaluating on standard datasets for foreground/background segmentation (SPIn-NeRF, NVOS) and open-vocabulary object localization (LERF). The object removal task you introduce in Semantics-Controlled GS could be another interesting application!

February 2, 2025 at 10:30 AM

Juliette Marrie

@jlt-m.bsky.social

Uplifting is implemented in the forward rendering process, so it is as fast as forward rendering. Experimentally, it takes around 2ms per image per feature dimension. For example, uplifting 100 DINOv2 feature maps of dimension 40 (PCA-reduced) takes about 9s. See Appendix B.1 for more details.

January 31, 2025 at 4:36 PM

Juliette Marrie

@jlt-m.bsky.social

(3/3) LUDVIG uses a graph diffusion mechanism to refine 3D features, such as coarse segmentation masks, by leveraging 3D scene geometry and pairwise similarities induced by DINOv2.

January 31, 2025 at 9:59 AM

Juliette Marrie

@jlt-m.bsky.social

(2/3) We propose a simple, parameter-free aggregation mechanism, based on alpha-weighted multi-view blending of 2D pixel features in the forward rendering process.

Illustration of the inverse and forward rendering of 2D visual features produced by DINOv2.

January 31, 2025 at 9:59 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news