From Qwen3-8B-Base
✅ 100K synthetic problems: better than Qwen3-8B
✅ Combining with human written problems: matches DeepSeek-R1-671B
🧵(1/5)
From Qwen3-8B-Base
✅ 100K synthetic problems: better than Qwen3-8B
✅ Combining with human written problems: matches DeepSeek-R1-671B
🧵(1/5)
MME focuses on resources, metrics & methodologies for evaluating multilingual systems! multilingual-multicultural-evaluation.github.io
📅 Workshop Mar 24–29, 2026
🗓️ Submit by Dec 19, 2025
MME focuses on resources, metrics & methodologies for evaluating multilingual systems! multilingual-multicultural-evaluation.github.io
📅 Workshop Mar 24–29, 2026
🗓️ Submit by Dec 19, 2025
@bucds.bsky.social in 2026! BU has SCHEMES for LM interpretability & analysis, I couldn't be more pumped to join a burgeoning supergroup w/ @najoung.bsky.social @amuuueller.bsky.social. Looking for my first students, so apply and reach out!
In multilingual models, the same meaning can take far more tokens in some languages, penalizing users of underrepresented languages with worse performance and higher API costs. Our Parity-aware BPE algorithm is a step toward addressing this issue: 🧵
In multilingual models, the same meaning can take far more tokens in some languages, penalizing users of underrepresented languages with worse performance and higher API costs. Our Parity-aware BPE algorithm is a step toward addressing this issue: 🧵
aclanthology.org/2025.acl-lon...
aclanthology.org/2025.acl-lon...
If you’re at #ACL, stop by to learn more!
If you’re at #ACL, stop by to learn more!
🏆 Sewon Min: Rethinking Data Use in Large Language Models.
Min’s dissertation provides key insights into the behavior and capabilities of large language models, in particular in-context learning. Its findings have impacted the core of NLP today.
🏆 Sewon Min: Rethinking Data Use in Large Language Models.
Min’s dissertation provides key insights into the behavior and capabilities of large language models, in particular in-context learning. Its findings have impacted the core of NLP today.
Excited to present papers with @vamvas.bsky.social @ricosennrich.bsky.social on Unsupervised Translation Direction Detection and Multilingual Hallucination Detection!
Come say hi! 👋
#NLProc #NLP #NMT #LLMs
Excited to present papers with @vamvas.bsky.social @ricosennrich.bsky.social on Unsupervised Translation Direction Detection and Multilingual Hallucination Detection!
Come say hi! 👋
#NLProc #NLP #NMT #LLMs
More details at the website: www2.statmt.org/wmt25/termin...
More details at the website: www2.statmt.org/wmt25/termin...
Read: direct.mit.edu/coli/article...
Read: direct.mit.edu/coli/article...
We are honored to receive Best Paper Award for it! ✨
We are honored to receive Best Paper Award for it! ✨
Michelle's paper: arxiv.org/abs/2401.06769
Demo: huggingface.co/spaces/Zuric...
If you're at the expo, make sure to stop by the Department of Computational Linguistics UZH!
Michelle's paper: arxiv.org/abs/2401.06769
Demo: huggingface.co/spaces/Zuric...
If you're at the expo, make sure to stop by the Department of Computational Linguistics UZH!
jobs.uzh.ch/job-vacancie...
jobs.uzh.ch/job-vacancie...
Thanks to Anders Søgaard, @radamihalcea.bsky.social @ricosennrich.bsky.social for serving as examiners.
You can find his thesis here, which the committee characterised as "a joy to read": arxiv.org/abs/2503.07395
Thanks to Anders Søgaard, @radamihalcea.bsky.social @ricosennrich.bsky.social for serving as examiners.
You can find his thesis here, which the committee characterised as "a joy to read": arxiv.org/abs/2503.07395
@vamvas.bsky.social and @ricosennrich.bsky.social
Paper link:
arxiv.org/pdf/2503.10494
Long context LLMs have paved the way for document translation, but is simply inputting the whole content the optimal way?
Here's the thread 🧵 [1/n]
@vamvas.bsky.social and @ricosennrich.bsky.social
Paper link:
arxiv.org/pdf/2503.10494
Long context LLMs have paved the way for document translation, but is simply inputting the whole content the optimal way?
Here's the thread 🧵 [1/n]
stellen.uni-konstanz.de/jobposting/2...
stellen.uni-konstanz.de/jobposting/2...
I'm especially grateful for volunteers in the area of machine translation.
I'm especially grateful for volunteers in the area of machine translation.