sarapapi.bsky.social
@sarapapi.bsky.social
(she/her) Postdoc at @fbk-mt.bsky.social | Working on speech translation
Reposted
Our next presentation is by @sarapapi.bsky.social: "How real is your real-time simultaneous speech-to-text translation system?"

Look for the answer in her TACL paper: direct.mit.edu/tacl/article...

#lt2025fbk
October 28, 2025 at 1:08 PM
Reposted
Thanks to all the participants! #clicit2025
September 26, 2025 at 5:28 PM
Reposted
September 25, 2025 at 2:49 PM
Reposted
Last oral session of the first #clicit2025 day! See you all at the welcome drink!
September 24, 2025 at 4:41 PM
I’m the guest 🙋🏻‍♀️
We are on our way to Casteddu for #clicit2025 with a guest from @fbk-mt.bsky.social @ailc-nlp.bsky.social
September 24, 2025 at 4:13 PM
🚀 Excited to present FAMA, the first large-scale #OpenScience #Speech foundation model for 🇮🇹 Italian & 🇬🇧 English, at #clicit2025 (17:30–18:45 oral session)!

🔗 Models: hf.co/collections/...
📊 Data: hf.co/datasets/FBK...
💻 Code: github.com/hlt-mt/FBK-f...
📄 Preprint: arxiv.org/pdf/2505.22759
September 24, 2025 at 1:20 PM
An interesting survey about #RAG and its interplay with #multimodality: Retrieval-Augmented Generation for AI-Generated Content: A Survey

arxiv.org/pdf/2402.19473

@fbk-mt.bsky.social
arxiv.org
September 18, 2025 at 4:26 PM
Reposted
MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks
Read more: https://arxiv.org/html/2507.19634v1
August 4, 2025 at 8:42 AM
Reposted
Sara Papi, Maike Z\"ufle, Marco Gaido, Beatrice Savoldi, Danni Liu, Ioannis Douros, Luisa Bentivogli, Jan Niehues
MCIF: Multimodal Crosslingual Instruction-Following Benchmark from Scientific Talks
https://arxiv.org/abs/2507.19634
July 29, 2025 at 9:12 AM
Reposted
@sarapapi.bsky.social presented her TACL paper: “How real is your real-time simultaneous speech-to-text translation system?”

👉 aclanthology.org/2025.tacl-1.14/
(2/6)
August 2, 2025 at 4:32 PM
🔥 Is your real-time SimulST system REAL?

Our TACL paper analyzes 110 works and reveals:
🚫 Overreliance on short-form speech
🌀 Terminology chaos
📉 Real-world deployment gaps
We bring order-New taxonomy, trends & recommendations!

📍#ACL2025 Poster: Monday 11-12:30, Hall 4/5

#Speech #SpeechTech
July 27, 2025 at 1:18 PM
Reposted
🔍 Stiamo studiando come l'AI viene usata in Italia e per farlo abbiamo costruito un sondaggio!

👉 bit.ly/sondaggio_ai...

(è anonimo, richiede ~10 minuti, e se partecipi o lo fai girare ci aiuti un sacco🙏)

Ci interessa anche raggiungere persone che non si occupano e non sono esperte di AI!
Qualtrics Survey | Qualtrics Experience Management
The most powerful, simple and trusted way to gather experience data. Start your journey to experience management and try a free account today.
bit.ly
June 3, 2025 at 10:24 AM
🚀 New tech report out! Meet FAMA, our open-science speech foundation model family for both ASR and ST in 🇬🇧 English and 🇮🇹 Italian.

The models are live and ready to try on @hf.co:
🔗 huggingface.co/collections/...

📄 Preprint: arxiv.org/abs/2505.22759

#ASR #ST #OpenScience #MultilingualAI
FAMA - a FBK-MT Collection
The First Large-Scale Open-Science Speech Foundation Model for English and Italian
huggingface.co
May 30, 2025 at 3:35 PM
Reposted
If you're finishing your camera-ready for ACL or ICML and want to cite co-first authors more fairly, I just made a simple fix to do this! Just add $^*$ to the authors' names in your bibtex, and the citations should change :)

github.com/tpimentelms/...
May 29, 2025 at 8:53 AM
I am honored to have received this award today! 🎊
🎉 Excited to share that our @sarapapi.bsky.social has won the 2024 Best PhD Award from the Information and Engineering Doctoral School for her thesis “Direct Speech Translation in Constrained Contexts: The Simultaneous and Subtitling Scenarios.”

#nlproc #speech #speechprocessing #speechtranslation
May 9, 2025 at 4:17 PM
Reposted
The evaluation period has begun for our shared tasks!

The test data is now available on our website, and submissions are due Tuesday April 15! ⏰

Please email task organizers or the google group with any questions 🥳
April 3, 2025 at 3:26 PM
📢 The evaluation period of the Instruction Following task at
@iwslt.bsky.social just started!

🖥️ Consider submitting your speech-to-text system!

The outputs can be easily uploaded on the SPEECHM platform developed in the Meetween project (www.meetween.eu)!
➡️ iwslt2025.speechm.cloud.cyfronet.pl
iwslt2025.speechm.cloud.cyfronet.pl
April 1, 2025 at 12:39 PM
I'm thrilled to be one of the speakers at the next MT Marathon in Helsinki 🚀

I look forward to sharing insights on automatic translation and related topics with our community!
Call for participation: We just opened the registration for this year's MT Marathon in August in Helsinki, Finland: blogs.helsinki.fi/language-tec..., featuring:

- Ayodele Awokoya
- Wilker Aziz
- Marta Costa-Jussa
- Barry Haddow
- Amit Moryosse
- Sara Papi
- Jörg Tiedemann
- Marco Turchi
blogs.helsinki.fi
March 19, 2025 at 11:04 PM
Glad to see that the model weights of the new Step-Audio, a speech foundation model + large language model (+ speech decoder) architecture, are published under open licenses! 🆓

arxiv.org/abs/2502.11946
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in vo...
arxiv.org
February 20, 2025 at 3:11 PM
Reposted
As if you needed more reasons to submit to #GITT2025:
🔑🎵 Cristina Anselmi, video game #localization & #AI expert with a focus on #inclusive #language will be our keynote speaker!
💸Registration fees are on the MTSummit website and you can register just for GITT if you so choose 😎
👀 See you there! 👀
February 13, 2025 at 3:10 PM
I'm happy to share that our paper "Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison" has been accepted at @naaclmeeting.bsky.social 2025! #NAACL2025

@mgaido91.bsky.social 👏

📃 Preprint: arxiv.org/abs/2501.02370
⏰ Code will be released soon

#NLProc #Speech
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison
Following the remarkable success of Large Language Models (LLMs) in NLP tasks, there is increasing interest in extending their capabilities to speech -- the most common form in communication. To integ...
arxiv.org
January 23, 2025 at 8:44 AM
Reposted
Hello world! 👋 We're coming out of hibernation to bring you this happy news:
1) We're organising the 3rd edition of GITT at #MTSummit! Working on #gender & #translation #technology? We'll see you there!
2) We're moving away from Twitter, so share the news and help us find old and new GITT friends!
a polar bear cub is laying in a pile of branches .
ALT: a polar bear cub is laying in a pile of branches .
media.tenor.com
January 22, 2025 at 12:17 PM
Reposted
🙌 All members of our group are now on Bluesky! 🙌

You can find all of us in this starter pack 👇
January 16, 2025 at 9:51 AM
Exciting news: IWSLT will be co-located with @aclmeeting.bsky.social 2025 again this year! 🎉

Interested in speech processing? Check out the new task on instruction following — any model can participate! 🚀

📅 Data release: April 1
⏳ Submission deadline: April 15

💬 iwslt.org/2025/instruc...
Instruction-following Speech Processing track
Home of the IWSLT conference and SIGSLT.
iwslt.org
January 15, 2025 at 6:36 PM
I’m glad to announce that our work “How "Real" is Your Real-Time Simultaneous Speech-to-Text Translation System?” has been accepted at the Transactions of @aclmeeting.bsky.social (TACL)! 🎉

The preprint is available here:
arxiv.org/pdf/2412.18495
arxiv.org
December 27, 2024 at 2:07 PM