Lightnews — Scholar-powered news

MT Group at FBK

@fbk-mt.bsky.social

Heading home tired but very happy after a fantastic #EMNLP2025 (and some well-deserved vacation 😎).
We were super busy presenting 5 papers! It was fantastic catching up with colleagues, exchanging ideas, and seeing all the amazing work in the #NLProc community!

(1/5)

November 15, 2025 at 10:23 AM

MT Group at FBK

@fbk-mt.bsky.social

Our pick of the week by @zhihangxie.bsky.social: "#Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in #SpeechLLMs" by Dingdong Wang, Junan Li, Mingyu Cui, et al. (#EMNLP2025)

aclanthology.org/2025.emnlp-m...

#SLU #SpeechTech

November 12, 2025 at 2:43 PM

MT Group at FBK

@fbk-mt.bsky.social

🚀 Exciting news from the @fbk-mt.bsky.social group!
@bsavoldi.bsky.social , @linaconti.bsky.social, @matteo-negri.bsky.social & @luisabentivogli.bsky.social are attending #EMNLP2025 in Suzhou 🇨🇳!

Come to our sessions & let's connect:
🔗 mt.fbk.eu/fbk-mt-at-em...

We’re also hiring postdocs!⚡

November 4, 2025 at 9:02 AM

MT Group at FBK

@fbk-mt.bsky.social

🎉🎓Congratulations to our PhD student @dennisfucci.bsky.social on a very successful thesis defense! 👏

Many thanks to the evaluation committee members @deboranozza.bsky.social, Mirco Ravanelli, and Leonardo Badino for their insightful feedback and appreciation of his work!

#nlproc

November 1, 2025 at 7:58 AM

Reposted by MT Group at FBK

DH Group at FBK

@dh-fbk.bsky.social

@bsavoldi.bsky.social from @fbk-mt.bsky.social will present Translation in the Hands of Many: Centering Lay Users in Machine Translation Interactions at the poster session on Wed Nov. 5th, 11:00-12:30 in Hall C

October 31, 2025 at 2:56 PM

MT Group at FBK

@fbk-mt.bsky.social

Our #PickOfTheWeek by @beomseok-lee.bsky.social: "Can Speech LLMs Think while Listening?" by Yi-Jen Shih, @rdesh26.bsky.social, Chunyang Wu, Wei Zhou, SK Bong, Yashesh Gaur, Jay Mahadeokar, Ozlem Kalinli, Mike Seltzer (2025).

#Speech #SpeechLLM #LLM #SpeechTech #AI

Beomseok Lee @beomseok-lee.bsky.social · 21d

Can we make Speech LLMs actually think as they listen? 👂💭
This fascinating work applies CoT inspired by human “thinking while listening”, training models to find the inflection point when reasoning starts.
📄 arxiv.org/abs/2510.07497

Can Speech LLMs Think while Listening?

Recent advances in speech large language models (speech LLMs) have enabled seamless spoken interactions, but these systems still struggle with complex reasoning tasks. Previously, chain-of-thought (Co...

arxiv.org

October 29, 2025 at 1:30 PM

MT Group at FBK

@fbk-mt.bsky.social

Our next presentation is by @sarapapi.bsky.social: "How real is your real-time simultaneous speech-to-text translation system?"

Look for the answer in her TACL paper: direct.mit.edu/tacl/article...

#lt2025fbk

October 28, 2025 at 1:08 PM

MT Group at FBK

@fbk-mt.bsky.social

Our Marco Gaido presenting FAMA, the first family of large-scale open-science speech foundation models for English and Italian.

Joint work with the @speechtekfbk.bsky.social group.

Data, code, models publicly available, check all info in the paper:
clic2025.unica.it/wp-content/u...

#lt2025fbk

October 28, 2025 at 12:03 PM

MT Group at FBK

@fbk-mt.bsky.social

@bsavoldi.bsky.social presenting our new multilingual benchmark for evaluating LLMs on gender-neutral translation.

Catch our paper at #EMNLP2025
ℹ️ arxiv.org/pdf/2501.09409

#lt2025fbk

October 28, 2025 at 10:44 AM

MT Group at FBK

@fbk-mt.bsky.social

Now it's the turn of our @dennisfucci.bsky.social presenting the #ACL2025NLP paper on explaining gender bias in speech translation

📖 aclanthology.org/2025.acl-sho...
#lt2025fbk

October 28, 2025 at 10:43 AM

MT Group at FBK

@fbk-mt.bsky.social

The Language Technology at FBK workshop has just started with a truly insightful talk by @deboranozza.bsky.social: "A Roadmap for the Everyday Use of LLMs: Emerging Risks and Research Directions" #LT2025FBK

October 28, 2025 at 10:41 AM

MT Group at FBK

@fbk-mt.bsky.social

Our pick of the week by @linaconti.bsky.social: "Lost in Transcription, Found in Distribution Shift: Demystifying
Hallucination in Speech Foundation Models" by Hanin Atwany, @abdulwaheed.bsky.social, Rita Singh, Monojit Choudhury, and Bhiksha Raj (ACL Findings 2025)

aclanthology.org/2025.finding...

October 23, 2025 at 2:46 PM

MT Group at FBK

@fbk-mt.bsky.social

🚀 Join us for the LT@FBK day 2025! Discover cutting-edge research and highlights in speech and language technologies from Fondazione Bruno Kessler (FBK)

📅 October 28, 2025
📍FBK, Trento
ℹ️ lt-highlights.fbk.eu

LT Highlights @ FBK 2025

lt-highlights.fbk.eu

October 21, 2025 at 10:15 AM

MT Group at FBK

@fbk-mt.bsky.social

Our pick of the week by @bsavoldi.bsky.social: "Acoustic-based Gender Differentiation in Speech-aware Language Models" by Junhyuk Choi, Jihwan Seol, Nayeon Kim, Chanhee Cho, EunBin Cho, Bugeun Kim.

arxiv.org/abs/2509.21125

#Gender #SpeechLLM #Speech

Beatrice Savoldi @bsavoldi.bsky.social · Oct 16

#PickOfTheWeek 📚 @fbk-mt.bsky.social
For a read on gender bias in the speech domain ➡️"Acoustic-based Gender Differentiation in Speech-Aware Language Models" arxiv.org/pdf/2509.21125

arxiv.org

October 16, 2025 at 1:54 PM

Reposted by MT Group at FBK

land-fbk.bsky.social

@land-fbk.bsky.social

🚀 Our annual, full-day Language Technologies showcase is back! Dive into the latest research highlights from FBK groups. Want in? We'd love to see you, but don't forget to register!

www.fbk.eu/en/event/346...

Language Technology Research Highlights 2025

The Language Technology Research Highlights 2025 (LT@FBK2025) event aims to bring together scientists, students, practitioners, and enthusiasts who are interested in language technologies and want to ...

www.fbk.eu

October 16, 2025 at 8:57 AM

MT Group at FBK

@fbk-mt.bsky.social

Marco Gaido introducing SimulStream, an #OpenSource Tool for Simultaneous #Speech #Translation 🗣️🖥️📝 at the DI Center Demo Day at FBK!

The tool is going to be released soon. Stay tuned! 👀

October 10, 2025 at 8:42 AM

MT Group at FBK

@fbk-mt.bsky.social

Marco Gaido and Roldano Cattoni presenting our SimulStream Demo at the DI Center Demo Day at FBK!

The open-source tool, which is going to be released soon, natively supports any speech-to-text #HuggingFace models! 🤖

#SpeechTech #Translation

October 10, 2025 at 8:39 AM

Reposted by MT Group at FBK

Lina Conti

@linaconti.bsky.social

🎉 Excited to share that my paper "The Unheard Alternative" was accepted to @blackboxnlp.bsky.social 2025!
We introduce contrastive explanations for speech-to-text, identifying which audio features ST models use to assign a grammatical gender to the speaker.
📄 Preprint: arxiv.org/abs/2509.265...

The Unheard Alternative: Contrastive Explanations for Speech-to-Text Models

Contrastive explanations, which indicate why an AI system produced one output (the target) instead of another (the foil), are widely regarded in explainable AI as more informative and interpretable th...

arxiv.org

October 1, 2025 at 5:37 PM

MT Group at FBK

@fbk-mt.bsky.social

Our very own @sarapapi.bsky.social presenting FAMA at #clicit2025:

📗Paper: clic2025.unica.it/wp-content/u...
🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...

Joint work with @speechtekfbk.bsky.social

September 25, 2025 at 2:49 PM

Reposted by MT Group at FBK

sarapapi.bsky.social

@sarapapi.bsky.social

🚀 Excited to present FAMA, the first large-scale #OpenScience #Speech foundation model for 🇮🇹 Italian & 🇬🇧 English, at #clicit2025 (17:30–18:45 oral session)!

🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...
📄 Preprint: arxiv.org/pdf/2505.22759

September 24, 2025 at 1:20 PM

Reposted by MT Group at FBK

DH Group at FBK

@dh-fbk.bsky.social

We are on our way to Casteddu for #clicit2025 with a guest from @fbk-mt.bsky.social @ailc-nlp.bsky.social

September 24, 2025 at 8:43 AM

MT Group at FBK

@fbk-mt.bsky.social

Our pick of the week by @sarapapi.bsky.social: "Retrieval-Augmented Generation for AI-Generated Content: A Survey" by Penghao Zhao, Hailin Zhang, Qinhan Yu, Zhengren Wang, Yunteng Geng, Fangcheng Fu, Ling Yang, Wentao Zhang, Jie Jiang, Bin Cui.

arxiv.org/pdf/2402.19473

#RAG #survey

September 18, 2025 at 4:32 PM

MT Group at FBK

@fbk-mt.bsky.social

Our pick of the week by Marco Gaido: "Context-Driven Dynamic #Pruning for Large #Speech #Foundation Models" by Masao Someki, Shikhar Bharadwaj, Atharva Anand Joshi, Chyi-Jiunn Lin, Jinchuan Tian, Jee-weon Jung, @shinjiw.bsky.social, et al. #INTERSPEECH2025.

arxiv.org/abs/2505.18860

Context-Driven Dynamic Pruning for Large Speech Foundation Models

Speech foundation models achieve strong generalization across languages and acoustic conditions, but require significant computational resources for inference. In the context of speech foundation mode...

arxiv.org

September 12, 2025 at 3:52 PM

MT Group at FBK

@fbk-mt.bsky.social

Our pick of the week by @zhihangxie.bsky.social: "SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation" by Chenyang Le, Bing Han, Jinshun Li, Songyong Chen, and Yanmin Qian (2025)

#Speech #Simultaneous #Translation #MOE #SpeechTech

Zhihang Xie @zhihangxie.bsky.social · Sep 3

🚀 SimulMEGA: MoE Routers as advanced policy makers for Simultaneous Speech Translation 🎧🌍
Mixture-of-Experts routing → smarter decisions on when & how to translate, balancing latency vs quality in real-time speech. Paper link at arxiv.org/pdf/2509.012...

arxiv.org

September 3, 2025 at 10:54 AM

MT Group at FBK

@fbk-mt.bsky.social

Our pick of the week by @beomseok-lee.bsky.social: "Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs" by Dingdong Wang, Junan Li, Mingyu Cui, Dongchao Yang, Xueyuan Chen, and Helen Meng (EMNLP 2025)

Beomseok Lee @beomseok-lee.bsky.social · Aug 28

🤔 Ever wondered how discrete tokens vs. continuous features behave in SpeechLLMs?
This new work dives into 6 SLU tasks and reveals some interesting takeaways!
arxiv.org/abs/2508.17863

Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs

With the rise of Speech Large Language Models (SpeechLLMs), two dominant approaches have emerged for speech processing: discrete tokens and continuous features. Each approach has demonstrated strong c...

arxiv.org

August 28, 2025 at 9:33 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news