🌐 https://mdhk.net/
🐘 https://scholar.social/@mdhk
🐦 https://twitter.com/mariannedhk
And to @itcooperativesurf.bsky.social (EINF-8324) for granting me the resources that enabled this project 👩💻✨
And to @itcooperativesurf.bsky.social (EINF-8324) for granting me the resources that enabled this project 👩💻✨
📄 arxiv.org/abs/2506.00981
Or the model, dataset and code released alongside it:
🤗 huggingface.co/amsterdamNLP...
🗃️ zenodo.org/records/1554...
🔍 github.com/mdhk/SSL-NL-...
We hope these resources help further research on language-specificity in speech models!
📄 arxiv.org/abs/2506.00981
Or the model, dataset and code released alongside it:
🤗 huggingface.co/amsterdamNLP...
🗃️ zenodo.org/records/1554...
🔍 github.com/mdhk/SSL-NL-...
We hope these resources help further research on language-specificity in speech models!
➡️ Training on conversational speech is important not only for enhancing the representation of conversation-level structures, but also for the encoding of smaller linguistic units (phones & words).
➡️ Training on conversational speech is important not only for enhancing the representation of conversation-level structures, but also for the encoding of smaller linguistic units (phones & words).
➡️ Language-specific phonetic information may only take up a relatively small subspace of model-internal representations.
➡️ Language-specific phonetic information may only take up a relatively small subspace of model-internal representations.
We designed the SSL-NL dataset to test the encoding of Dutch phonetic and lexical features in SSL speech representations, while allowing for comparisons across different analysis methods.
We compare both trained probes(*) and zero-shot metrics:
We designed the SSL-NL dataset to test the encoding of Dutch phonetic and lexical features in SSL speech representations, while allowing for comparisons across different analysis methods.
We compare both trained probes(*) and zero-shot metrics:
Previous studies analyzing language-specific representations in speech SSL models have reported mixed results.
Previous studies analyzing language-specific representations in speech SSL models have reported mixed results.
Find an overview here: interpretingdl.github.io/speech-inter...
Find an overview here: interpretingdl.github.io/speech-inter...
It features a *live brain-controlled music act* by the AIAR collective 🧠🎶 2025.ccneuro.org/social-event/ Get one of the last remaining tickets at the registration desk now!
It features a *live brain-controlled music act* by the AIAR collective 🧠🎶 2025.ccneuro.org/social-event/ Get one of the last remaining tickets at the registration desk now!
🔗 2025.ccneuro.org/poster/?id=D...
🔗 2025.ccneuro.org/poster/?id=D...