Visiting student @ CLSP Johns Hopkins University
GitHub: https://github.com/domklement
LinkedIN: https://www.linkedin.com/in/dominik-klement/
Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok
Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok
We’ve released our baseline model for the community—ready for you to explore and build upon!
🔗 Try it here: pccnect.fit.vutbr.cz/gradio-demo/
[1/4]
We’ve released our baseline model for the community—ready for you to explore and build upon!
🔗 Try it here: pccnect.fit.vutbr.cz/gradio-demo/
[1/4]
Check out our recent work at BUT Speech@FIT in collaboration with CLSP JHU. It is fully open-sourced. Do not forget to try out our demo: pccnect.fit.vutbr.cz/gradio-demo
Read more in this thread 👇
[1/14]
Check out our recent work at BUT Speech@FIT in collaboration with CLSP JHU. It is fully open-sourced. Do not forget to try out our demo: pccnect.fit.vutbr.cz/gradio-demo
Read more in this thread 👇
[1/14]