WAV Lab @wavlab.bsky.social , continuing DiCoW research on diarization-conditioned target-speaker ASR with Whisper (JSALT 2025).
Next: extending to SpeechLLMs (DiXtral) and joint diarization + TS-ASR toward end-to-end speaker-aware models.
WAV Lab @wavlab.bsky.social , continuing DiCoW research on diarization-conditioned target-speaker ASR with Whisper (JSALT 2025).
Next: extending to SpeechLLMs (DiXtral) and joint diarization + TS-ASR toward end-to-end speaker-aware models.
📍 FIT BUT, G108 or 💻 via MS Teams.
Official announcement & abstract: www.fit.vut.cz/fit/info/dd/...
MS-Teams: teams.microsoft.com/l/meetup-joi...
📍 FIT BUT, G108 or 💻 via MS Teams.
Official announcement & abstract: www.fit.vut.cz/fit/info/dd/...
MS-Teams: teams.microsoft.com/l/meetup-joi...
www.linkedin.com/feed/update/...
www.linkedin.com/feed/update/...
vgs-it.fit.vutbr.cz/2025/11/04/j...
vgs-it.fit.vutbr.cz/2025/11/04/j...
landing.signalprocessingsociety.org/ieee-sps-web...
www.isca-archive.org/interspeech_...
landing.signalprocessingsociety.org/ieee-sps-web...
www.isca-archive.org/interspeech_...
BUT Speech@FIT is proud to be a part of it. We recommend also having a look at the other Czech AI companies and University labs! www.cnaip.cz/en/czech-ai
BUT Speech@FIT is proud to be a part of it. We recommend also having a look at the other Czech AI companies and University labs! www.cnaip.cz/en/czech-ai
jsalt2025.fit.vut.cz/plenary-lect...
jsalt2025.fit.vut.cz/plenary-lect...
jsalt2025.fit.vut.cz/plenary-lect...
jsalt2025.fit.vut.cz/plenary-lect...
www.mff.cuni.cz/en/research-...
www.mff.cuni.cz/en/research-...
excel.fit.vutbr.cz/vysledky/
excel.fit.vutbr.cz/vysledky/
jsalt2025.fit.vut.cz/summer-works...
jsalt2025.fit.vut.cz/summer-works...
Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok
Target Speaker ASR with Whisper, ieeexplore.ieee.org/document/108...
Introduces a novel approach to training target-speaker ASR systems utilizing frame-level diarization outputs.
Apr 11: 2:00 pm - 3:30 pm, Poster 2E, presented by Alexander Polok