Lightnews — Scholar-powered news

Dominik Klement

@dklement.bsky.social

Speech Researcher @ BUT SPEECH
Visiting student @ CLSP Johns Hopkins University

GitHub: https://github.com/domklement
LinkedIN: https://www.linkedin.com/in/dominik-klement/

Posts Replies Media Videos

Reposted by Dominik Klement

BUT Speech

@butspeech.bsky.social

Our papers to be presented at ICASSP in Hyderabad!

Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok

April 2, 2025 at 1:20 PM

Reposted by Dominik Klement

BUT Speech

@butspeech.bsky.social

🗣️ Are you participating in the Interspeech 2025 Workshop on Multilingual Conversational Speech Language Models organised by Nexdata【旧Datatang株式会社公式】?

We’ve released our baseline model for the community—ready for you to explore and build upon!
🔗 Try it here: pccnect.fit.vutbr.cz/gradio-demo/
[1/4]

March 24, 2025 at 8:00 PM

Reposted by Dominik Klement

BUT Speech

@butspeech.bsky.social

Speechers don’t do just math, code, experiments, papers and research proposals - they also skate, or at least try to skate! 1 hour on rented skate-rink was enough to test the endurance of pros as well as beginners. Of course, followed by “one” in Microbrewery Lisen ⛸️🍺

February 2, 2025 at 11:16 AM

Dominik Klement

@dklement.bsky.social

Transcribing multiple speakers with OpenAI’s Whisper? No problem.

Check out our recent work at BUT Speech@FIT in collaboration with CLSP JHU. It is fully open-sourced. Do not forget to try out our demo: pccnect.fit.vutbr.cz/gradio-demo

Read more in this thread 👇

[1/14]

Scheme of DiCoW target speaker ASR pipeline

January 11, 2025 at 7:30 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news