BUT Speech
banner
butspeech.bsky.social
BUT Speech
@butspeech.bsky.social
We do impactful research and raise new leading scientific personalities in the field of speech processing.
And after ICASSP, Johan. Lukas, Alex, Martas and Santosh even made it to the local newspapers after their visit to the Ramappa Temple UNESCO heritage site!
April 25, 2025 at 7:43 AM
A lot of interesting discussions happened during and after the presentations, and also during the amazing lunch at ITC Peshawar. We thank Meeami for hosting us.
April 7, 2025 at 6:55 AM
Santosh presented the team's work on Aligning foundation models for (1) speech to text translation (2) dialogue state tracking from speech.
April 7, 2025 at 6:55 AM
Alex presented the team's work on (1) Target speaker ASR with Whisper, (2) Robust ASR via internal language model regularisation, (3) Speech foundation models for European languages using open and legally accessible datasets.
April 7, 2025 at 6:55 AM
Leveraging Self-Supervised Learning for Speaker Diarization, by Jiangyu Han et al. ieeexplore.ieee.org/stamp/stamp....
utilizes SSL models to alleviate the problem of data scarcity for neural speaker diarization.
Apr 9: 5:00 pm - 6:30 pm, Lecture, Room: MRG.04, Johan Rohdin
April 2, 2025 at 1:20 PM
🔗 Competition details: www.nexdata.ai/competition/...
This work builds on DiCoW, our diarization-conditioned ASR model—learn more in our paper:
🔗 arxiv.org/abs/2501.00114
🖥️ Codebase available on GitHub:
🔗 github.com/BUTSpeechFIT...
[4/4]
March 24, 2025 at 8:00 PM
🔍 Why should you try it?
✅ Strong starting point for multilingual conversational ASR research
✅ Open for experimentation, adaptation, and fine-tuning
✅ Join us in pushing the boundaries of robust, multilingual speech recognition
🚀 Test and improve multilingual conversational ASR
[3/4]
March 24, 2025 at 8:00 PM
📊 Baseline WER (No Domain Adaptation Yet, Oracle diarization):
🇺🇸 English (American): 9.4%
🇮🇳 English (Indian): 15.1%
🇵🇭 English (Filipino): 11.3%
🇩🇪 German: 19.7%
🆕 Now supports transcription of multiple speakers speaking different languages! 🌍🗣️
[2/4]
March 24, 2025 at 8:00 PM