Eleonora Grassucci
eleonoragrassucci.bsky.social
Eleonora Grassucci
@eleonoragrassucci.bsky.social
Assistant Professor @Sapienza, Rome.
Generative AI, Multimodal Learning, Generative Semantic Communication
GRAM is going to #ICLR2025!
More on this soon, and see y'all in Singapore!
#multimodal #contrastive #learning
No more pairwise cosine similarity for multimodal learning we can directly go up to n modalities!🔴

👉GRAM can align *from 2 to n modalities* altogether in a joint fashion by gaining alignment insights from the volume of the parallelotope spanned by the modality vectors.

1/n🧵
January 23, 2025 at 9:07 AM
Super interesting work on #GenAI #Video2Audio with impressive results from my friends @riccardofosco.bsky.social @Christian Marinoni together with @emilianpos.bsky.social @mcomunita.bsky.social Luca Cosmo, Joshua Reiss and @dacom.bsky.social !

👇 Go check it out!
🌟 Excited to Share Our Latest Work! 🎥🎶

Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls

arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A
Stable-V2A: Synchronized Sound Effects Synthesis
Stable-V2A is a two-stage model for synthesizing synchronized sound effects with support for temporal and semantic controls.
ispamm.github.io
December 20, 2024 at 6:37 PM
No more pairwise cosine similarity for multimodal learning we can directly go up to n modalities!🔴

👉GRAM can align *from 2 to n modalities* altogether in a joint fashion by gaining alignment insights from the volume of the parallelotope spanned by the modality vectors.

1/n🧵
December 18, 2024 at 3:43 PM
Reposted by Eleonora Grassucci
Eleonora Lopez, Eleonora Grassucci, Debora Capriotti, Danilo Comminiello
Towards Explaining Hypercomplex Neural Networks
https://arxiv.org/abs/2403.17929
March 27, 2024 at 1:13 PM
Reposted by Eleonora Grassucci
Jinho Choi, Jihong Park, Eleonora Grassucci, Danilo Comminiello
Semantic Communication Challenges: Understanding Dos and Avoiding Don'ts
https://arxiv.org/abs/2403.15649
March 26, 2024 at 5:01 AM
Reposted by Eleonora Grassucci
Eleonora Lopez, Eleonora Grassucci, Martina Valleriani, Danilo Comminiello
Multi-View Hypercomplex Learning for Breast Cancer Screening
https://arxiv.org/abs/2204.05798
March 5, 2024 at 8:08 PM
Reposted by Eleonora Grassucci
Giordano Cicchetti, Eleonora Grassucci, Jihong Park, Jinho Choi, Sergio Barbarossa, Danilo Comminiello
Language-Oriented Semantic Latent Representation for Image Transmission
https://arxiv.org/abs/2405.09976
May 17, 2024 at 5:03 AM