Ali Vosoughi
ali-vosoughi.bsky.social
Ali Vosoughi
@ali-vosoughi.bsky.social
Artificial Intelligence (AI) Researcher and a PhD Candidate in Computer Engineering

See -> https://alivosoughi.com/
Pinned
Check out our pioneering paper on Video and Audiovisual Understanding with LLMs! Dive into the future of AI with us: #VideoUnderstanding #LargeLanguageModels #AIResearch

🕸️ github.com/yunlong10/Aw...
GitHub - yunlong10/Awesome-LLMs-for-Video-Understanding: 🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs. Contribute to yunlong10/Awesome-LLMs-for-Video-Understanding development by creating an account on GitHub.
github.com
Check out our pioneering paper on Video and Audiovisual Understanding with LLMs! Dive into the future of AI with us: #VideoUnderstanding #LargeLanguageModels #AIResearch

🕸️ github.com/yunlong10/Aw...
GitHub - yunlong10/Awesome-LLMs-for-Video-Understanding: 🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs. Contribute to yunlong10/Awesome-LLMs-for-Video-Understanding development by creating an account on GitHub.
github.com
January 13, 2025 at 9:23 PM
Reposted by Ali Vosoughi
Jing Bi, Nguyen Manh Nguyen, Ali Vosoughi, Chenliang Xu
MISAR: A Multimodal Instructional System with Augmented Reality. (arXiv:2310.11699v1 [cs.CL])
http://arxiv.org/abs/2310.11699
October 19, 2023 at 2:05 AM
Reposted by Ali Vosoughi
Yiyang Su, Ali Vosoughi, Shijian Deng, Yapeng Tian, Chenliang Xu
Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation. (arXiv:2310.11713v1 [cs.CV])
http://arxiv.org/abs/2310.11713
October 19, 2023 at 3:01 AM
Reposted by Ali Vosoughi
Jing Bi, Yunlong Tang, Luchuan Song, Ali Vosoughi, Nguyen Nguyen, Chenliang Xu
EAGLE: Egocentric AGgregated Language-video Engine
https://arxiv.org/abs/2409.17523
September 27, 2024 at 9:01 AM