Lightnews — Scholar-powered news

Ali Vosoughi

@ali-vosoughi.bsky.social

3 followers 13 following 1 posts

Artificial Intelligence (AI) Researcher and a PhD Candidate in Computer Engineering

See -> https://alivosoughi.com/

Posts Replies Media Videos

Pinned

Ali Vosoughi @ali-vosoughi.bsky.social · Jan 13

Check out our pioneering paper on Video and Audiovisual Understanding with LLMs! Dive into the future of AI with us: #VideoUnderstanding #LargeLanguageModels #AIResearch

🕸️ github.com/yunlong10/Aw...

GitHub - yunlong10/Awesome-LLMs-for-Video-Understanding: 🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs. Contribute to yunlong10/Awesome-LLMs-for-Video-Understanding development by creating an account on GitHub.

github.com

Reposted by Ali Vosoughi

IEEE WASPAA 2025

@waspaa.com

#WASPAA2025 is in the (IEEE SPS) news!
signalprocessingsociety.org/newsletter/2...

Call for Papers: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2025

The IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) is a highly-regarded bi-annual event hosted by the Audio and Acoustic Signal Processing Technical Committee (AASP...

signalprocessingsociety.org

March 5, 2025 at 10:50 PM

Ali Vosoughi

@ali-vosoughi.bsky.social

GitHub - yunlong10/Awesome-LLMs-for-Video-Understanding: 🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs. Contribute to yunlong10/Awesome-LLMs-for-Video-Understanding development by creating an account on GitHub.

github.com

January 13, 2025 at 9:23 PM

Reposted by Ali Vosoughi

arxiv cs.CL

@arxiv-cs-cl.bsky.social

Jing Bi, Nguyen Manh Nguyen, Ali Vosoughi, Chenliang Xu
MISAR: A Multimodal Instructional System with Augmented Reality. (arXiv:2310.11699v1 [cs.CL])
http://arxiv.org/abs/2310.11699

October 19, 2023 at 2:05 AM

Reposted by Ali Vosoughi

arxiv cs.CV

@arxiv-cs-cv.bsky.social

Yiyang Su, Ali Vosoughi, Shijian Deng, Yapeng Tian, Chenliang Xu
Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation. (arXiv:2310.11713v1 [cs.CV])
http://arxiv.org/abs/2310.11713

October 19, 2023 at 3:01 AM

Reposted by Ali Vosoughi

arxiv cs.CV

@arxiv-cs-cv.bsky.social

Jing Bi, Yunlong Tang, Luchuan Song, Ali Vosoughi, Nguyen Nguyen, Chenliang Xu
EAGLE: Egocentric AGgregated Language-video Engine
https://arxiv.org/abs/2409.17523

September 27, 2024 at 9:01 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news