Dominik Klement
dklement.bsky.social
Dominik Klement
@dklement.bsky.social
Speech Researcher @ BUT SPEECH
Visiting student @ CLSP Johns Hopkins University

GitHub: https://github.com/domklement
LinkedIN: https://www.linkedin.com/in/dominik-klement/
Reposted by Dominik Klement
Our papers to be presented at ICASSP in Hyderabad!

Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok
April 2, 2025 at 1:20 PM
Reposted by Dominik Klement
🗣️ Are you participating in the Interspeech 2025 Workshop on Multilingual Conversational Speech Language Models organised by Nexdata【旧Datatang株式会社公式】?

We’ve released our baseline model for the community—ready for you to explore and build upon!
🔗 Try it here: pccnect.fit.vutbr.cz/gradio-demo/
[1/4]
March 24, 2025 at 8:00 PM
Reposted by Dominik Klement
Speechers don’t do just math, code, experiments, papers and research proposals - they also skate, or at least try to skate! 1 hour on rented skate-rink was enough to test the endurance of pros as well as beginners. Of course, followed by “one” in Microbrewery Lisen ⛸️🍺
February 2, 2025 at 11:16 AM
Transcribing multiple speakers with OpenAI’s Whisper? No problem.

Check out our recent work at BUT Speech@FIT in collaboration with CLSP JHU. It is fully open-sourced. Do not forget to try out our demo: pccnect.fit.vutbr.cz/gradio-demo

Read more in this thread 👇

[1/14]
January 11, 2025 at 7:30 PM