Lightnews — Scholar-powered news

stek_fbk

@speechtekfbk.bsky.social

20 followers 17 following 13 posts

Speech technology lab at Fondazione Bruno Kessler

Posts Replies Media Videos

stek_fbk

@speechtekfbk.bsky.social

Sharing the red carpet with @luisabentivogli.bsky.social @fbk-mt.bsky.social

March 18, 2025 at 2:17 PM

stek_fbk

@speechtekfbk.bsky.social

The two papers are:
- Large Language Models Are Strong Audio-Visual Speech Recognition Learners arxiv.org/abs/2409.12319
- EFL-PEFT: A communication Efficient Federated Learning framework using PEFT sparsification for ASR

Large Language Models Are Strong Audio-Visual Speech Recognition Learners

Multimodal large language models (MLLMs) have recently become a focal point of research due to their formidable multimodal understanding capabilities. For example, in the audio and speech domains, an ...

arxiv.org

January 2, 2025 at 10:37 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news