MT Group at FBK
banner
fbk-mt.bsky.social
MT Group at FBK
@fbk-mt.bsky.social
#MachineTranslation Research Unit @ Fondazione Bruno Kessler

#nlproc #deeplearning #ai
mt.fbk.eu
Heading home tired but very happy after a fantastic #EMNLP2025 (and some well-deserved vacation 😎).
We were super busy presenting 5 papers! It was fantastic catching up with colleagues, exchanging ideas, and seeing all the amazing work in the #NLProc community!

(1/5)
November 15, 2025 at 10:23 AM
Our pick of the week by @zhihangxie.bsky.social: "#Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in #SpeechLLMs" by Dingdong Wang, Junan Li, Mingyu Cui, et al. (#EMNLP2025)

aclanthology.org/2025.emnlp-m...

#SLU #SpeechTech
November 12, 2025 at 2:43 PM
🚀 Exciting news from the @fbk-mt.bsky.social group!
@bsavoldi.bsky.social , @linaconti.bsky.social, @matteo-negri.bsky.social & @luisabentivogli.bsky.social are attending #EMNLP2025 in Suzhou 🇨🇳!

Come to our sessions & let's connect:
🔗 mt.fbk.eu/fbk-mt-at-em...

We’re also hiring postdocs!⚡
November 4, 2025 at 9:02 AM
🎉🎓Congratulations to our PhD student @dennisfucci.bsky.social on a very successful thesis defense! 👏

Many thanks to the evaluation committee members @deboranozza.bsky.social, Mirco Ravanelli, and Leonardo Badino for their insightful feedback and appreciation of his work!

#nlproc
November 1, 2025 at 7:58 AM
Reposted by MT Group at FBK
@bsavoldi.bsky.social from @fbk-mt.bsky.social will present Translation in the Hands of Many: Centering Lay Users in Machine Translation Interactions at the poster session on Wed Nov. 5th, 11:00-12:30 in Hall C
October 31, 2025 at 2:56 PM
Our #PickOfTheWeek by @beomseok-lee.bsky.social: "Can Speech LLMs Think while Listening?" by Yi-Jen Shih, @rdesh26.bsky.social, Chunyang Wu, Wei Zhou, SK Bong, Yashesh Gaur, Jay Mahadeokar, Ozlem Kalinli, Mike Seltzer (2025).

#Speech #SpeechLLM #LLM #SpeechTech #AI
Can we make Speech LLMs actually think as they listen? 👂💭
This fascinating work applies CoT inspired by human “thinking while listening”, training models to find the inflection point when reasoning starts.
📄 arxiv.org/abs/2510.07497
Can Speech LLMs Think while Listening?
Recent advances in speech large language models (speech LLMs) have enabled seamless spoken interactions, but these systems still struggle with complex reasoning tasks. Previously, chain-of-thought (Co...
arxiv.org
October 29, 2025 at 1:30 PM
Our next presentation is by @sarapapi.bsky.social: "How real is your real-time simultaneous speech-to-text translation system?"

Look for the answer in her TACL paper: direct.mit.edu/tacl/article...

#lt2025fbk
October 28, 2025 at 1:08 PM
Our Marco Gaido presenting FAMA, the first family of large-scale open-science speech foundation models for English and Italian.

Joint work with the @speechtekfbk.bsky.social group.

Data, code, models publicly available, check all info in the paper:
clic2025.unica.it/wp-content/u...

#lt2025fbk
October 28, 2025 at 12:03 PM
@bsavoldi.bsky.social presenting our new multilingual benchmark for evaluating LLMs on gender-neutral translation.

Catch our paper at #EMNLP2025
ℹ️ arxiv.org/pdf/2501.09409

#lt2025fbk
October 28, 2025 at 10:44 AM
Now it's the turn of our @dennisfucci.bsky.social presenting the #ACL2025NLP paper on explaining gender bias in speech translation

📖 aclanthology.org/2025.acl-sho...
#lt2025fbk
October 28, 2025 at 10:43 AM
The Language Technology at FBK workshop has just started with a truly insightful talk by @deboranozza.bsky.social: "A Roadmap for the Everyday Use of LLMs: Emerging Risks and Research Directions" #LT2025FBK
October 28, 2025 at 10:41 AM
Our pick of the week by @linaconti.bsky.social: "Lost in Transcription, Found in Distribution Shift: Demystifying
Hallucination in Speech Foundation Models" by Hanin Atwany, @abdulwaheed.bsky.social, Rita Singh, Monojit Choudhury, and Bhiksha Raj (ACL Findings 2025)

aclanthology.org/2025.finding...
October 23, 2025 at 2:46 PM
🚀 Join us for the LT@FBK day 2025! Discover cutting-edge research and highlights in speech and language technologies from Fondazione Bruno Kessler (FBK)

📅 October 28, 2025
📍FBK, Trento
ℹ️ lt-highlights.fbk.eu
LT Highlights @ FBK 2025
lt-highlights.fbk.eu
October 21, 2025 at 10:15 AM
Our pick of the week by @bsavoldi.bsky.social: "Acoustic-based Gender Differentiation in Speech-aware Language Models" by Junhyuk Choi, Jihwan Seol, Nayeon Kim, Chanhee Cho, EunBin Cho, Bugeun Kim.

arxiv.org/abs/2509.21125

#Gender #SpeechLLM #Speech
#PickOfTheWeek 📚 @fbk-mt.bsky.social
For a read on gender bias in the speech domain ➡️"Acoustic-based Gender Differentiation in Speech-Aware Language Models" arxiv.org/pdf/2509.21125
arxiv.org
October 16, 2025 at 1:54 PM
Reposted by MT Group at FBK
🚀 Our annual, full-day Language Technologies showcase is back! Dive into the latest research highlights from FBK groups. Want in? We'd love to see you, but don't forget to register!

www.fbk.eu/en/event/346...
Language Technology Research Highlights 2025
The Language Technology Research Highlights 2025 (LT@FBK2025) event aims to bring together scientists, students, practitioners, and enthusiasts who are interested in language technologies and want to ...
www.fbk.eu
October 16, 2025 at 8:57 AM
Marco Gaido introducing SimulStream, an #OpenSource Tool for Simultaneous #Speech #Translation 🗣️🖥️📝 at the DI Center Demo Day at FBK!

The tool is going to be released soon. Stay tuned! 👀
October 10, 2025 at 8:42 AM
Marco Gaido and Roldano Cattoni presenting our SimulStream Demo at the DI Center Demo Day at FBK!

The open-source tool, which is going to be released soon, natively supports any speech-to-text #HuggingFace models! 🤖

#SpeechTech #Translation
October 10, 2025 at 8:39 AM
Reposted by MT Group at FBK
🎉 Excited to share that my paper "The Unheard Alternative" was accepted to @blackboxnlp.bsky.social 2025!
We introduce contrastive explanations for speech-to-text, identifying which audio features ST models use to assign a grammatical gender to the speaker.
📄 Preprint: arxiv.org/abs/2509.265...
The Unheard Alternative: Contrastive Explanations for Speech-to-Text Models
Contrastive explanations, which indicate why an AI system produced one output (the target) instead of another (the foil), are widely regarded in explainable AI as more informative and interpretable th...
arxiv.org
October 1, 2025 at 5:37 PM
September 25, 2025 at 2:49 PM
Reposted by MT Group at FBK
🚀 Excited to present FAMA, the first large-scale #OpenScience #Speech foundation model for 🇮🇹 Italian & 🇬🇧 English, at #clicit2025 (17:30–18:45 oral session)!

🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...
📄 Preprint: arxiv.org/pdf/2505.22759
September 24, 2025 at 1:20 PM
Reposted by MT Group at FBK
We are on our way to Casteddu for #clicit2025 with a guest from @fbk-mt.bsky.social @ailc-nlp.bsky.social
September 24, 2025 at 8:43 AM
Our pick of the week by @sarapapi.bsky.social: "Retrieval-Augmented Generation for AI-Generated Content: A Survey" by Penghao Zhao, Hailin Zhang, Qinhan Yu, Zhengren Wang, Yunteng Geng, Fangcheng Fu, Ling Yang, Wentao Zhang, Jie Jiang, Bin Cui.

arxiv.org/pdf/2402.19473

#RAG #survey
September 18, 2025 at 4:32 PM
Our pick of the week by Marco Gaido: "Context-Driven Dynamic #Pruning for Large #Speech #Foundation Models" by Masao Someki, Shikhar Bharadwaj, Atharva Anand Joshi, Chyi-Jiunn Lin, Jinchuan Tian, Jee-weon Jung, @shinjiw.bsky.social, et al. #INTERSPEECH2025.

arxiv.org/abs/2505.18860
Context-Driven Dynamic Pruning for Large Speech Foundation Models
Speech foundation models achieve strong generalization across languages and acoustic conditions, but require significant computational resources for inference. In the context of speech foundation mode...
arxiv.org
September 12, 2025 at 3:52 PM
Our pick of the week by @zhihangxie.bsky.social: "SimulMEGA: MoE Routers are Advanced Policy Makers for Simultaneous Speech Translation" by Chenyang Le, Bing Han, Jinshun Li, Songyong Chen, and Yanmin Qian (2025)

#Speech #Simultaneous #Translation #MOE #SpeechTech
🚀 SimulMEGA: MoE Routers as advanced policy makers for Simultaneous Speech Translation 🎧🌍
Mixture-of-Experts routing → smarter decisions on when & how to translate, balancing latency vs quality in real-time speech. Paper link at arxiv.org/pdf/2509.012...
arxiv.org
September 3, 2025 at 10:54 AM
Our pick of the week by @beomseok-lee.bsky.social: "Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs" by Dingdong Wang, Junan Li, Mingyu Cui, Dongchao Yang, Xueyuan Chen, and Helen Meng (EMNLP 2025)
August 28, 2025 at 9:33 AM