#nlproc #deeplearning #ai
mt.fbk.eu
We were super busy presenting 5 papers! It was fantastic catching up with colleagues, exchanging ideas, and seeing all the amazing work in the #NLProc community!
(1/5)
We were super busy presenting 5 papers! It was fantastic catching up with colleagues, exchanging ideas, and seeing all the amazing work in the #NLProc community!
(1/5)
aclanthology.org/2025.emnlp-m...
#SLU #SpeechTech
aclanthology.org/2025.emnlp-m...
#SLU #SpeechTech
@bsavoldi.bsky.social , @linaconti.bsky.social, @matteo-negri.bsky.social & @luisabentivogli.bsky.social are attending #EMNLP2025 in Suzhou 🇨🇳!
Come to our sessions & let's connect:
🔗 mt.fbk.eu/fbk-mt-at-em...
We’re also hiring postdocs!⚡
@bsavoldi.bsky.social , @linaconti.bsky.social, @matteo-negri.bsky.social & @luisabentivogli.bsky.social are attending #EMNLP2025 in Suzhou 🇨🇳!
Come to our sessions & let's connect:
🔗 mt.fbk.eu/fbk-mt-at-em...
We’re also hiring postdocs!⚡
Many thanks to the evaluation committee members @deboranozza.bsky.social, Mirco Ravanelli, and Leonardo Badino for their insightful feedback and appreciation of his work!
#nlproc
Many thanks to the evaluation committee members @deboranozza.bsky.social, Mirco Ravanelli, and Leonardo Badino for their insightful feedback and appreciation of his work!
#nlproc
#Speech #SpeechLLM #LLM #SpeechTech #AI
This fascinating work applies CoT inspired by human “thinking while listening”, training models to find the inflection point when reasoning starts.
📄 arxiv.org/abs/2510.07497
#Speech #SpeechLLM #LLM #SpeechTech #AI
Look for the answer in her TACL paper: direct.mit.edu/tacl/article...
#lt2025fbk
Look for the answer in her TACL paper: direct.mit.edu/tacl/article...
#lt2025fbk
Joint work with the @speechtekfbk.bsky.social group.
Data, code, models publicly available, check all info in the paper:
clic2025.unica.it/wp-content/u...
#lt2025fbk
Joint work with the @speechtekfbk.bsky.social group.
Data, code, models publicly available, check all info in the paper:
clic2025.unica.it/wp-content/u...
#lt2025fbk
Catch our paper at #EMNLP2025
ℹ️ arxiv.org/pdf/2501.09409
#lt2025fbk
Catch our paper at #EMNLP2025
ℹ️ arxiv.org/pdf/2501.09409
#lt2025fbk
📖 aclanthology.org/2025.acl-sho...
#lt2025fbk
📖 aclanthology.org/2025.acl-sho...
#lt2025fbk
Hallucination in Speech Foundation Models" by Hanin Atwany, @abdulwaheed.bsky.social, Rita Singh, Monojit Choudhury, and Bhiksha Raj (ACL Findings 2025)
aclanthology.org/2025.finding...
Hallucination in Speech Foundation Models" by Hanin Atwany, @abdulwaheed.bsky.social, Rita Singh, Monojit Choudhury, and Bhiksha Raj (ACL Findings 2025)
aclanthology.org/2025.finding...
📅 October 28, 2025
📍FBK, Trento
ℹ️ lt-highlights.fbk.eu
📅 October 28, 2025
📍FBK, Trento
ℹ️ lt-highlights.fbk.eu
arxiv.org/abs/2509.21125
#Gender #SpeechLLM #Speech
For a read on gender bias in the speech domain ➡️"Acoustic-based Gender Differentiation in Speech-Aware Language Models" arxiv.org/pdf/2509.21125
arxiv.org/abs/2509.21125
#Gender #SpeechLLM #Speech
www.fbk.eu/en/event/346...
www.fbk.eu/en/event/346...
The tool is going to be released soon. Stay tuned! 👀
The tool is going to be released soon. Stay tuned! 👀
The open-source tool, which is going to be released soon, natively supports any speech-to-text #HuggingFace models! 🤖
#SpeechTech #Translation
The open-source tool, which is going to be released soon, natively supports any speech-to-text #HuggingFace models! 🤖
#SpeechTech #Translation
We introduce contrastive explanations for speech-to-text, identifying which audio features ST models use to assign a grammatical gender to the speaker.
📄 Preprint: arxiv.org/abs/2509.265...
We introduce contrastive explanations for speech-to-text, identifying which audio features ST models use to assign a grammatical gender to the speaker.
📄 Preprint: arxiv.org/abs/2509.265...
📗Paper: clic2025.unica.it/wp-content/u...
🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...
Joint work with @speechtekfbk.bsky.social
📗Paper: clic2025.unica.it/wp-content/u...
🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...
Joint work with @speechtekfbk.bsky.social
🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...
📄 Preprint: arxiv.org/pdf/2505.22759
🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...
📄 Preprint: arxiv.org/pdf/2505.22759
arxiv.org/pdf/2402.19473
#RAG #survey
arxiv.org/pdf/2402.19473
#RAG #survey
arxiv.org/abs/2505.18860
arxiv.org/abs/2505.18860
#Speech #Simultaneous #Translation #MOE #SpeechTech
Mixture-of-Experts routing → smarter decisions on when & how to translate, balancing latency vs quality in real-time speech. Paper link at arxiv.org/pdf/2509.012...
#Speech #Simultaneous #Translation #MOE #SpeechTech
This new work dives into 6 SLU tasks and reveals some interesting takeaways!
arxiv.org/abs/2508.17863