utilizes SSL models to alleviate the problem of data scarcity for neural speaker diarization.
Apr 9: 5:00 pm - 6:30 pm, Lecture, Room: MRG.04, Johan Rohdin
utilizes SSL models to alleviate the problem of data scarcity for neural speaker diarization.
Apr 9: 5:00 pm - 6:30 pm, Lecture, Room: MRG.04, Johan Rohdin
This work builds on DiCoW, our diarization-conditioned ASR model—learn more in our paper:
🔗 arxiv.org/abs/2501.00114
🖥️ Codebase available on GitHub:
🔗 github.com/BUTSpeechFIT...
[4/4]
This work builds on DiCoW, our diarization-conditioned ASR model—learn more in our paper:
🔗 arxiv.org/abs/2501.00114
🖥️ Codebase available on GitHub:
🔗 github.com/BUTSpeechFIT...
[4/4]
✅ Strong starting point for multilingual conversational ASR research
✅ Open for experimentation, adaptation, and fine-tuning
✅ Join us in pushing the boundaries of robust, multilingual speech recognition
🚀 Test and improve multilingual conversational ASR
[3/4]
✅ Strong starting point for multilingual conversational ASR research
✅ Open for experimentation, adaptation, and fine-tuning
✅ Join us in pushing the boundaries of robust, multilingual speech recognition
🚀 Test and improve multilingual conversational ASR
[3/4]
🇺🇸 English (American): 9.4%
🇮🇳 English (Indian): 15.1%
🇵🇭 English (Filipino): 11.3%
🇩🇪 German: 19.7%
🆕 Now supports transcription of multiple speakers speaking different languages! 🌍🗣️
[2/4]
🇺🇸 English (American): 9.4%
🇮🇳 English (Indian): 15.1%
🇵🇭 English (Filipino): 11.3%
🇩🇪 German: 19.7%
🆕 Now supports transcription of multiple speakers speaking different languages! 🌍🗣️
[2/4]