📄 Read our paper: arxiv.org/pdf/2502.017...
💻 Get the code: github.com/Bartelds/ctc...
📄 Read our paper: arxiv.org/pdf/2502.017...
💻 Get the code: github.com/Bartelds/ctc...
📊 Worst-language error ↓ up to 47.1%
📊 Average error ↓ up to 32.9%
CTC-DRO works seamlessly with existing self-supervised speech models through ESPnet 🚀
📊 Worst-language error ↓ up to 47.1%
📊 Average error ↓ up to 32.9%
CTC-DRO works seamlessly with existing self-supervised speech models through ESPnet 🚀
✅ Input length-matched batching to mitigate CTC’s scaling issues
✅ Smoothing the group weight update to prevent overemphasis on consistently high-loss groups
✅ Input length-matched batching to mitigate CTC’s scaling issues
✅ Smoothing the group weight update to prevent overemphasis on consistently high-loss groups
Result? Worse performance.
We need a new approach 🚀
Result? Worse performance.
We need a new approach 🚀
Here are some other great starter packs:
- CSS: go.bsky.app/GoEyD7d + go.bsky.app/CYmRvcK
- NLP: go.bsky.app/SngwGeS + go.bsky.app/JgneRQk
- HCI: go.bsky.app/p3TLwt
- Women in AI: go.bsky.app/LaGDpqg