Dominik Schnaus
schnaus.bsky.social
Dominik Schnaus
@schnaus.bsky.social
PhD student @ TUM with Daniel Cremers
Can we match vision and language representations without any supervision or paired data?

Surprisingly, yes! 

Our #CVPR2025 paper with @neekans.bsky.social and @dcremers.bsky.social shows that the pairwise distances in both modalities are often enough to find correspondences.

⬇️ 1/4
June 3, 2025 at 9:27 AM