Lightnews — Scholar-powered news

BUT Speech

@butspeech.bsky.social

11 followers 1 following 32 posts

We do impactful research and raise new leading scientific personalities in the field of speech processing.

Posts Replies Media Videos

BUT Speech

@butspeech.bsky.social

And after ICASSP, Johan. Lukas, Alex, Martas and Santosh even made it to the local newspapers after their visit to the Ramappa Temple UNESCO heritage site!

April 25, 2025 at 7:43 AM

BUT Speech

@butspeech.bsky.social

A lot of interesting discussions happened during and after the presentations, and also during the amazing lunch at ITC Peshawar. We thank Meeami for hosting us.

April 7, 2025 at 6:55 AM

BUT Speech

@butspeech.bsky.social

Santosh presented the team's work on Aligning foundation models for (1) speech to text translation (2) dialogue state tracking from speech.

April 7, 2025 at 6:55 AM

BUT Speech

@butspeech.bsky.social

Alex presented the team's work on (1) Target speaker ASR with Whisper, (2) Robust ASR via internal language model regularisation, (3) Speech foundation models for European languages using open and legally accessible datasets.

April 7, 2025 at 6:55 AM

BUT Speech

@butspeech.bsky.social

Leveraging Self-Supervised Learning for Speaker Diarization, by Jiangyu Han et al. ieeexplore.ieee.org/stamp/stamp....
utilizes SSL models to alleviate the problem of data scarcity for neural speaker diarization.
Apr 9: 5:00 pm - 6:30 pm, Lecture, Room: MRG.04, Johan Rohdin

April 2, 2025 at 1:20 PM

BUT Speech

@butspeech.bsky.social

🔗 Competition details: www.nexdata.ai/competition/...
This work builds on DiCoW, our diarization-conditioned ASR model—learn more in our paper:
🔗 arxiv.org/abs/2501.00114
🖥️ Codebase available on GitHub:
🔗 github.com/BUTSpeechFIT...
[4/4]

March 24, 2025 at 8:00 PM

BUT Speech

@butspeech.bsky.social

🔍 Why should you try it?
✅ Strong starting point for multilingual conversational ASR research
✅ Open for experimentation, adaptation, and fine-tuning
✅ Join us in pushing the boundaries of robust, multilingual speech recognition
🚀 Test and improve multilingual conversational ASR
[3/4]

March 24, 2025 at 8:00 PM

BUT Speech

@butspeech.bsky.social

📊 Baseline WER (No Domain Adaptation Yet, Oracle diarization):
🇺🇸 English (American): 9.4%
🇮🇳 English (Indian): 15.1%
🇵🇭 English (Filipino): 11.3%
🇩🇪 German: 19.7%
🆕 Now supports transcription of multiple speakers speaking different languages! 🌍🗣️
[2/4]

March 24, 2025 at 8:00 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news