Lightnews — Scholar-powered news

Eleonora Grassucci

@eleonoragrassucci.bsky.social

18 followers 11 following 9 posts

Assistant Professor @Sapienza, Rome.
Generative AI, Multimodal Learning, Generative Semantic Communication

Posts Replies Media Videos

Eleonora Grassucci

@eleonoragrassucci.bsky.social

🤔How?
1⃣Extract embeddings with modality encoders
2⃣Arrange them in a tensor
3⃣Compute the Gram matrix
4⃣Compute the determinant, and here it is the volume of the parallelotope!

3/n🧵

December 18, 2024 at 3:43 PM

Eleonora Grassucci

@eleonoragrassucci.bsky.social

💡The intuition is: semantically aligned data has a small volume, while semantically misaligned data has a large volume!
We do not need to get the pairwise cosine similarity anymore, which is insufficient for tasks that require cross-modal understanding beyond pairs!

2/n🧵

December 18, 2024 at 3:43 PM

Eleonora Grassucci

@eleonoragrassucci.bsky.social

No more pairwise cosine similarity for multimodal learning we can directly go up to n modalities!🔴

👉GRAM can align *from 2 to n modalities* altogether in a joint fashion by gaining alignment insights from the volume of the parallelotope spanned by the modality vectors.

1/n🧵

December 18, 2024 at 3:43 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news