Kwanghee Choi
juice500ml.bsky.social
Kwanghee Choi
@juice500ml.bsky.social
Master's student @ltiatcmu.bsky.social, working on speech AI at @shinjiw.bsky.social
Reposted by Kwanghee Choi
Had such a great time presenting our tutorial on Interpretability Techniques for Speech Models at #Interspeech2025! 🔍

For anyone looking for an introduction to the topic, we've now uploaded all materials to the website: interpretingdl.github.io/speech-inter...
August 19, 2025 at 9:23 PM
Can we make discrete speech units lightweight🪶 and streamable🏎? Excited to share our new #Interspeech2025 paper: On-device Streaming Discrete Speech Units arxiv.org/abs/2506.01845 (1/n)
August 15, 2025 at 8:44 PM
www.nature.com/articles/350...
Ted Chiang. Catching crumbs from the table. Nature 405, 517 (2000). My favorite sci-fi short, which surprisingly well-summarizes what I actually do nowadays. I bet self-supervised speech models contain undiscovered theories on phonetics and phonology.
Catching crumbs from the table - Nature
In the face of metahuman science, humans have become metascientists.
www.nature.com
June 9, 2025 at 7:37 PM
Reposted by Kwanghee Choi
It's good to finally have a good reference for this stuff! Kudos to the authors.
arxiv.org/abs/2501.18374
Proofs for Folklore Theorems on the Radon-Nikodym Derivative
In this paper, rigorous statements and formal proofs are presented for both foundational and advanced folklore theorems on the Radon-Nikodym derivative. The cases of conditional and marginal probabili...
arxiv.org
April 25, 2025 at 3:04 PM
Can self-supervised models 🤖 understand allophony 🗣? Excited to share my new #NAACL2025 paper: Leveraging Allophony in Self-Supervised Speech Models for Atypical Pronunciation Assessment arxiv.org/abs/2502.07029 (1/n)
April 29, 2025 at 5:00 PM
Reposted by Kwanghee Choi
New #NAACL2025 demo, Excited to introduce ESPnet-SDS, a new open-source toolkit for building unified web interfaces for both cascaded & end-to-end spoken dialogue system, providing real-time evaluation, and more!
📜: arxiv.org/abs/2503.08533
Live Demo: huggingface.co/spaces/Siddh...
March 17, 2025 at 2:29 PM
Reposted by Kwanghee Choi
More from inside NIH:

Per a source with knowledge, for all internal research (of which there is like $10 billion worth or so), ALL purchasing shut down as of yesterday.

That means gloves, reagents, anything involved with lab work, which means a lot of that work will stop.
January 24, 2025 at 4:24 PM
Reposted by Kwanghee Choi
Are you a pre-doctoral student interested in language technologies, especially focusing on safe, fair and inclusive AI? Our Summer 2025 Language Technology for All Internship could be a great fit. See the link below for more info, and to apply:
lti.cs.cmu.edu/news-and-eve...
CMU LTI Language Technology for All Internship 2025 - Language Technologies Institute - School of Computer Science - Carnegie Mellon University
The LTI is currently seeking applicants for the summer 2025 Language Technology for All Internship
lti.cs.cmu.edu
January 6, 2025 at 9:23 PM
Reposted by Kwanghee Choi
📣 #SpeechTech & #SpeechScience people

We are organizing a special session at #Interspeech2025 on: Interpretability in Audio & Speech Technology

Check out the special session website: sites.google.com/view/intersp...

Paper submission deadline 📆 12 February 2025
December 6, 2024 at 9:30 PM
Reposted by Kwanghee Choi
We are excited to announce the launch of ML SUPERB 2.0 (multilingual.superbbenchmark.org) as part of the Interspeech 2024 official challenge! We hope this upgraded version of ML SUPERB advances universal access to speech processing worldwide. Please join it!

#Interspeech2025
December 4, 2024 at 2:45 PM