#COLM2025
Excited to see our #COLM2025 paper on fluid benchmarking highlighted by @eval-eval.bsky.social! They are worth a follow if you are into LLM eval research. 🔬
✨ Weekly AI Evaluation Paper Spotlight ✨

🤔Is it time to move beyond static tests and toward more dynamic, adaptive, and model-aware evaluation?

🖇️ "Fluid Language Model Benchmarking" by
@valentinhofmann.bsky.social et. al introduces a dynamic benchmarking method for evaluating language models
October 31, 2025 at 5:25 PM
𝑵𝒆𝒘 𝒃𝒍𝒐𝒈𝒑𝒐𝒔𝒕! A rundown of some cool papers I got to chat about at #COLM2025 and some scattered thoughts

saxon.me/blog/2025/co...
COLM 2025: 9 cool papers and some thoughts
Reflections on the 2025 COLM conference, and a discussion of 9 cool COLM papers on benchmarking and eval, personas, and improving models for better long-context performance and consistency.
saxon.me
October 17, 2025 at 5:24 AM
Grateful to keynote at #COLM2025. Here's what we're missing about AI alignment: Humans don’t cooperate just by aggregating preferences, we build social processes and institutions to generate norms that make it safe to trade with strangers. AI needs to play by these same systems, not replace them.
October 15, 2025 at 11:00 PM
Inspired to share some papers that I found at #COLM2025!

"Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation" by Amanda Myntti et al. arxiv.org/abs/2504.01542
October 14, 2025 at 6:16 PM
Saplings take #COLM2025! Featuring Group lunch, amazing posters, and a panel with Yoshua Bengio!
October 14, 2025 at 12:19 PM
⭐ A thread for some cool recent work I learned about at #COLM2025, either from the paper presentations or from the keynotes!
October 14, 2025 at 12:43 AM
I also had a great time at #COLM2025! I especially liked the long poster sessions (no need to rush through, plenty of time to see everything and chat with everyone) and single track talks.
#COLM2025 was one of my favorite conferences -- a really high fraction of interesting papers and people, but small enough to see everything!
Thank you to the organizers for putting it together!
October 13, 2025 at 12:15 PM
#COLM2025 was one of my favorite conferences -- a really high fraction of interesting papers and people, but small enough to see everything!
Thank you to the organizers for putting it together!
October 13, 2025 at 12:40 AM
Had an amazing time at #COLM2025 It was vibrant, high level, and seemed a healthy balance of LLM critique and solution focussed. I am so happy with how our social simulation workshop went. Chairing and panel moderating was a pleasure thanks to the many that participated. Stay tuned for recordings!
October 11, 2025 at 7:29 PM
bye #colm2025 big fan of the montreal bagels 🥯 hot take I like them better than
October 11, 2025 at 6:16 PM
And that's a wrap! Thanks to everyone who helped make the first ORIGen workshop a success! @andreasvlachos.bsky.social @malihealikhani.bsky.social @qveraliao.bsky.social #COLM2025 #AI #NLP #LLMs
October 11, 2025 at 12:27 AM
If you are at #COLM2025, come by the Workshop on the Application of LLM Explainability to Reasoning and Planning TODAY at 2:40 ET to see my talk on challenges in human-agent communication and how the interpretability community can help address them!

xllm-reasoning-planning-workshop.github.io
XLLM-Reason-Plan
Website for the Workshop on the Application of LLM Explainability to Reasoning and Planning at COLM 2025
xllm-reasoning-planning-workshop.github.io
October 10, 2025 at 4:46 PM
WMDQS is underway! Come join us in Room 520A at @colmweb.org! #COLM2025
October 10, 2025 at 4:18 PM
14/ This work will be presented as a spotlight talk today at #COLM2025 SocialSim workshop and at NeurIPS 2025.

Paper: arxiv.org/abs/2508.06635
Code: github.com/lasilab/valid-synth-inference
October 10, 2025 at 4:12 PM
Join us again at #MELT workshop (520D) at #COLM2025 to hear from @ImanolSchlag about #Apertus, the largest multilingual LLM trained on over 1000 languages.
October 10, 2025 at 3:36 PM
I am at #COLM2025 today to talk about AI, LLMs and simulation in the social simulation workshop. Come find me, happy to chat about all things AI, embodiment, and simulation.
October 10, 2025 at 2:50 PM
💡We kicked off the SoLaR workshop at #COLM2025 with a great opinion talk by @michelleding.bsky.social & Jo Gasior Kavishe (joint work with @victorojewale.bsky.social and
@geomblog.bsky.social
) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."
October 10, 2025 at 2:31 PM
🎤 Keynote talk: Expanding the Language and Cultural Coverage of Common Crawl.

Pedro Ortiz Suarez highlights the efforts to improve language diversity & cultural representation in web archives, ensuring underserved languages & communities are better represented. 📚🌐

#MELTWorkshop2025 #COLM2025
October 10, 2025 at 2:14 PM
🎤 Keynote talk by Monojit Choudhury on how conversational AI can move beyond static cultural sensitivity toward dynamic, culturally responsive systems.

#MELTWorkshop2025 #COLM2025
Kicking off #MELT workshop at #COLM2025 with Monojit Choudhury talking about "Meta-Cultural Competence: What LLMs Should Know About Culture to Serve the Next Billion Users" !
October 10, 2025 at 1:41 PM
Kicking off the 1st Multilingual and Equitable Language Technologies (MELT) Workshop at @colmweb.org 2025 with opening remarks from @abosselut.bsky.social !

Excited to set the stage for a day full of discussions on multilingualism, equity, and the future of NLP. 🌍✨

#MELTWorkshop2025 #COLM2025
October 10, 2025 at 1:40 PM
The #COLM2025 workshop on NLP4Democracy is starting now! Join us in 520E.

I’ll be speaking at 10:15am with @ysiglidis.bsky.social about work with @iaugenstein.bsky.social and @serge.belongie.com focused on tracking collective narratives on social media.
October 10, 2025 at 1:27 PM
Hello #COLM2025! Excited to be kicking off the NLP4Democracy workshop this morning. We are in 520E (behind A/B/C) - check out our amazing program! sites.google.com/andrew.cmu.e...
NLP 4 Democracy - COLM 2025
sites.google.com
October 10, 2025 at 1:20 PM
Kicking off #MELT workshop at #COLM2025 with Monojit Choudhury talking about "Meta-Cultural Competence: What LLMs Should Know About Culture to Serve the Next Billion Users" !
October 10, 2025 at 1:15 PM
Come join us in 520D (all the way down the hall and around the corner) at #COLM2025 for the first workshop on multilingual and equitable language technologies!
October 10, 2025 at 12:53 PM