Generative AI, Multimodal Learning, Generative Semantic Communication
More on this soon, and see y'all in Singapore!
#multimodal #contrastive #learning
👉GRAM can align *from 2 to n modalities* altogether in a joint fashion by gaining alignment insights from the volume of the parallelotope spanned by the modality vectors.
1/n🧵
More on this soon, and see y'all in Singapore!
#multimodal #contrastive #learning
👇 Go check it out!
Here we present Stable-V2A: Synthesis of Synchronized Sound Effects with Temporal and Semantic Controls
arxiv: arxiv.org/abs/2412.15023
Video presentation and results: ispamm.github.io/Stable-V2A
👇 Go check it out!
👉GRAM can align *from 2 to n modalities* altogether in a joint fashion by gaining alignment insights from the volume of the parallelotope spanned by the modality vectors.
1/n🧵
👉GRAM can align *from 2 to n modalities* altogether in a joint fashion by gaining alignment insights from the volume of the parallelotope spanned by the modality vectors.
1/n🧵
Towards Explaining Hypercomplex Neural Networks
https://arxiv.org/abs/2403.17929
Towards Explaining Hypercomplex Neural Networks
https://arxiv.org/abs/2403.17929
Semantic Communication Challenges: Understanding Dos and Avoiding Don'ts
https://arxiv.org/abs/2403.15649
Semantic Communication Challenges: Understanding Dos and Avoiding Don'ts
https://arxiv.org/abs/2403.15649
Multi-View Hypercomplex Learning for Breast Cancer Screening
https://arxiv.org/abs/2204.05798
Multi-View Hypercomplex Learning for Breast Cancer Screening
https://arxiv.org/abs/2204.05798
Language-Oriented Semantic Latent Representation for Image Transmission
https://arxiv.org/abs/2405.09976
Language-Oriented Semantic Latent Representation for Image Transmission
https://arxiv.org/abs/2405.09976