Cui Ding
cuiding.bsky.social
Cui Ding
@cuiding.bsky.social
👀Ever wondered how visual information quality affects reading and language processing?
Our new #EMNLP2025 paper with @wegotlieb.bsky.social, Lena Jäger -- “Modeling Bottom-up Information Quality during Language Processing”, bridges psycholinguistics and multimodal LLMs.
🧠💡👇
arxiv.org/pdf/2509.17047
November 2, 2025 at 11:06 AM
Reposted by Cui Ding
Let's meet at #EMNLP and talk about multilingual knowledge benchmarks!

⚠️MLAMA is full of disfluent sentences
❓Reason: templated translation
💡Simple full-sentence translation improves factual retrieval up to 25%
🙌Remember to check your benchmarks with speakers!

Link: arxiv.org/pdf/2510.15115
October 28, 2025 at 9:09 PM
Reposted by Cui Ding
💥Introducing new paper: arxiv.org/pdf/2510.17715, QueST — train specialized generators to create challenging coding problems.
From Qwen3-8B-Base
✅ 100K synthetic problems: better than Qwen3-8B
✅ Combining with human written problems: matches DeepSeek-R1-671B
🧵(1/5)
October 21, 2025 at 2:01 PM
Reposted by Cui Ding
Exciting #rstats news for Bayesian model comparison: bridgesampling is finally ready to support cmdstanr, see screenshot. Help us by installing the development version of bridgesampling and letting us know if it works for your model(s): pak::pkg_install("quentingronau/bridgesampling#44")
September 2, 2025 at 9:16 AM
Reposted by Cui Ding
We are done with the ninth Statistical Methods for Linguistics and Psychology (SMLP) summer school, Potsdam, Germany. The tenth edition is planned for 24-28 August 2026.
August 31, 2025 at 8:00 AM
Reposted by Cui Ding
Honoured to receive two (!!) SAC highlights awards at #ACL2025 😁 (Conveniently placed on the same slide!)
With the amazing: @philipwitti.bsky.social, @gregorbachmann.bsky.social and @wegotlieb.bsky.social,
@cuiding.bsky.social, Giovanni Acampa, @alexwarstadt.bsky.social, @tamaregev.bsky.social
July 31, 2025 at 7:41 AM
Reposted by Cui Ding
Congratulations to @sinaahmadi.bsky.social and co-authors for receiving an ACL 2025 Outstanding Paper Award for PARME: Parallel Corpora for Low-Resourced Middle Eastern Languages!

aclanthology.org/2025.acl-lon...
July 30, 2025 at 3:10 PM
Reposted by Cui Ding
Next week onwards, I'm teaching a five-day introductory course on Bayesian Data Analysis in Gent. Newly recorded video lectures to accompany the course are now online: vasishth.github.io/LecturesIntr...
Shravan Vasishth's Intro Bayes course home page
vasishth.github.io
July 10, 2025 at 7:32 PM
Reposted by Cui Ding
📣Take part in 3rd Terminology shared task @WMT!📣
This year:
👉5 language pairs: EN->{ES, RU, DE, ZH},
👉2 tracks - sentence-level and doc-level translation,
👉authentic data from 2 domains: finance and IT!

www2.statmt.org/wmt25/termin...

Don't miss an opportunity - we only do it once in two years😏
Terminology Translation Task
www2.statmt.org
June 6, 2025 at 3:54 PM
Some of my colleagues are already very excited about this work!
June 4, 2025 at 5:58 PM
Reposted by Cui Ding
If you're finishing your camera-ready for ACL or ICML and want to cite co-first authors more fairly, I just made a simple fix to do this! Just add $^*$ to the authors' names in your bibtex, and the citations should change :)

github.com/tpimentelms/...
May 29, 2025 at 8:53 AM
Reposted by Cui Ding
👀 📖 Big news! 📖 👀
Happy to announce the release of the OneStop Eye Movements dataset! 🎉 🎉
OneStop is the product of over 6 years of experimental design, data collection and data curation.
github.com/lacclab/OneS...
May 29, 2025 at 11:12 AM
I am so proud of this work. My first NLP experience. I learned a lot from this amazing team!!!!
May 14, 2025 at 4:56 PM
Reposted by Cui Ding
⭐🗣️New preprint out: 🗣️⭐ “Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent” with @cuiding.bsky.social , Giovanni Acampa, @tpimentel.bsky.social , @alexwarstadt.bsky.social ,Tamar Regev: arxiv.org/abs/2505.07659
Using Information Theory to Characterize Prosodic Typology: The Case of Tone, Pitch-Accent and Stress-Accent
This paper argues that the relationship between lexical identity and prosody -- one well-studied parameter of linguistic variation -- can be characterized using information theory. We predict that lan...
arxiv.org
May 13, 2025 at 1:21 PM
Excited to share our preprint "Using MoTR to probe agreement errors in Russian"! w/ Metehan Oğuz, @wegotlieb.bsky.social, Zuzanna Fuchs Link: osf.io/preprints/ps...
1- We provide moderate evidence that processing of agreement errors is modulated by agreement type (internal vs external agr.)
OSF
osf.io
March 7, 2025 at 10:21 PM