The recording of my keynote from #COLM2025 is now available!
Gillian Hadfield - Alignment is social: lessons from human alignment for AI
Current approaches conceptualize the alignment challenge as one of eliciting individual human preferences and training models to choose outputs that that satisfy those preferences. To the extent…
www.youtube.com
November 6, 2025 at 9:35 PM
The recording of my keynote from #COLM2025 is now available!
Excited to see our #COLM2025 paper on fluid benchmarking highlighted by @eval-eval.bsky.social! They are worth a follow if you are into LLM eval research. 🔬
✨ Weekly AI Evaluation Paper Spotlight ✨
🤔Is it time to move beyond static tests and toward more dynamic, adaptive, and model-aware evaluation?
🖇️ "Fluid Language Model Benchmarking" by
@valentinhofmann.bsky.social et. al introduces a dynamic benchmarking method for evaluating language models
🤔Is it time to move beyond static tests and toward more dynamic, adaptive, and model-aware evaluation?
🖇️ "Fluid Language Model Benchmarking" by
@valentinhofmann.bsky.social et. al introduces a dynamic benchmarking method for evaluating language models
October 31, 2025 at 5:25 PM
Excited to see our #COLM2025 paper on fluid benchmarking highlighted by @eval-eval.bsky.social! They are worth a follow if you are into LLM eval research. 🔬
𝑵𝒆𝒘 𝒃𝒍𝒐𝒈𝒑𝒐𝒔𝒕! A rundown of some cool papers I got to chat about at #COLM2025 and some scattered thoughts
saxon.me/blog/2025/co...
saxon.me/blog/2025/co...
COLM 2025: 9 cool papers and some thoughts
Reflections on the 2025 COLM conference, and a discussion of 9 cool COLM papers on benchmarking and eval, personas, and improving models for better long-context performance and consistency.
saxon.me
October 17, 2025 at 5:24 AM
𝑵𝒆𝒘 𝒃𝒍𝒐𝒈𝒑𝒐𝒔𝒕! A rundown of some cool papers I got to chat about at #COLM2025 and some scattered thoughts
saxon.me/blog/2025/co...
saxon.me/blog/2025/co...
Grateful to keynote at #COLM2025. Here's what we're missing about AI alignment: Humans don’t cooperate just by aggregating preferences, we build social processes and institutions to generate norms that make it safe to trade with strangers. AI needs to play by these same systems, not replace them.
October 15, 2025 at 11:00 PM
Grateful to keynote at #COLM2025. Here's what we're missing about AI alignment: Humans don’t cooperate just by aggregating preferences, we build social processes and institutions to generate norms that make it safe to trade with strangers. AI needs to play by these same systems, not replace them.
Inspired to share some papers that I found at #COLM2025!
"Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation" by Amanda Myntti et al. arxiv.org/abs/2504.01542
"Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation" by Amanda Myntti et al. arxiv.org/abs/2504.01542
October 14, 2025 at 6:16 PM
Inspired to share some papers that I found at #COLM2025!
"Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation" by Amanda Myntti et al. arxiv.org/abs/2504.01542
"Register Always Matters: Analysis of LLM Pretraining Data Through the Lens of Language Variation" by Amanda Myntti et al. arxiv.org/abs/2504.01542
I also had a great time at #COLM2025! I especially liked the long poster sessions (no need to rush through, plenty of time to see everything and chat with everyone) and single track talks.
#COLM2025 was one of my favorite conferences -- a really high fraction of interesting papers and people, but small enough to see everything!
Thank you to the organizers for putting it together!
Thank you to the organizers for putting it together!
October 13, 2025 at 12:15 PM
I also had a great time at #COLM2025! I especially liked the long poster sessions (no need to rush through, plenty of time to see everything and chat with everyone) and single track talks.
#COLM2025 was one of my favorite conferences -- a really high fraction of interesting papers and people, but small enough to see everything!
Thank you to the organizers for putting it together!
Thank you to the organizers for putting it together!
October 13, 2025 at 12:40 AM
#COLM2025 was one of my favorite conferences -- a really high fraction of interesting papers and people, but small enough to see everything!
Thank you to the organizers for putting it together!
Thank you to the organizers for putting it together!
Had an amazing time at #COLM2025 It was vibrant, high level, and seemed a healthy balance of LLM critique and solution focussed. I am so happy with how our social simulation workshop went. Chairing and panel moderating was a pleasure thanks to the many that participated. Stay tuned for recordings!
October 11, 2025 at 7:29 PM
Had an amazing time at #COLM2025 It was vibrant, high level, and seemed a healthy balance of LLM critique and solution focussed. I am so happy with how our social simulation workshop went. Chairing and panel moderating was a pleasure thanks to the many that participated. Stay tuned for recordings!
And that's a wrap! Thanks to everyone who helped make the first ORIGen workshop a success! @andreasvlachos.bsky.social @malihealikhani.bsky.social @qveraliao.bsky.social #COLM2025 #AI #NLP #LLMs
October 11, 2025 at 12:27 AM
And that's a wrap! Thanks to everyone who helped make the first ORIGen workshop a success! @andreasvlachos.bsky.social @malihealikhani.bsky.social @qveraliao.bsky.social #COLM2025 #AI #NLP #LLMs
If you are at #COLM2025, come by the Workshop on the Application of LLM Explainability to Reasoning and Planning TODAY at 2:40 ET to see my talk on challenges in human-agent communication and how the interpretability community can help address them!
xllm-reasoning-planning-workshop.github.io
xllm-reasoning-planning-workshop.github.io
XLLM-Reason-Plan
Website for the Workshop on the Application of LLM Explainability to Reasoning and Planning at COLM 2025
xllm-reasoning-planning-workshop.github.io
October 10, 2025 at 4:46 PM
If you are at #COLM2025, come by the Workshop on the Application of LLM Explainability to Reasoning and Planning TODAY at 2:40 ET to see my talk on challenges in human-agent communication and how the interpretability community can help address them!
xllm-reasoning-planning-workshop.github.io
xllm-reasoning-planning-workshop.github.io
WMDQS is underway! Come join us in Room 520A at @colmweb.org! #COLM2025
October 10, 2025 at 4:18 PM
WMDQS is underway! Come join us in Room 520A at @colmweb.org! #COLM2025
14/ This work will be presented as a spotlight talk today at #COLM2025 SocialSim workshop and at NeurIPS 2025.
Paper: arxiv.org/abs/2508.06635
Code: github.com/lasilab/valid-synth-inference
Paper: arxiv.org/abs/2508.06635
Code: github.com/lasilab/valid-synth-inference
October 10, 2025 at 4:12 PM
14/ This work will be presented as a spotlight talk today at #COLM2025 SocialSim workshop and at NeurIPS 2025.
Paper: arxiv.org/abs/2508.06635
Code: github.com/lasilab/valid-synth-inference
Paper: arxiv.org/abs/2508.06635
Code: github.com/lasilab/valid-synth-inference
I am at #COLM2025 today to talk about AI, LLMs and simulation in the social simulation workshop. Come find me, happy to chat about all things AI, embodiment, and simulation.
October 10, 2025 at 2:50 PM
I am at #COLM2025 today to talk about AI, LLMs and simulation in the social simulation workshop. Come find me, happy to chat about all things AI, embodiment, and simulation.
💡We kicked off the SoLaR workshop at #COLM2025 with a great opinion talk by @michelleding.bsky.social & Jo Gasior Kavishe (joint work with @victorojewale.bsky.social and
@geomblog.bsky.social
) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."
@geomblog.bsky.social
) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."
October 10, 2025 at 2:31 PM
💡We kicked off the SoLaR workshop at #COLM2025 with a great opinion talk by @michelleding.bsky.social & Jo Gasior Kavishe (joint work with @victorojewale.bsky.social and
@geomblog.bsky.social
) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."
@geomblog.bsky.social
) on "Testing LLMs in a sandbox isn't responsible. Focusing on community use and needs is."
🎤 Keynote talk: Expanding the Language and Cultural Coverage of Common Crawl.
Pedro Ortiz Suarez highlights the efforts to improve language diversity & cultural representation in web archives, ensuring underserved languages & communities are better represented. 📚🌐
#MELTWorkshop2025 #COLM2025
Pedro Ortiz Suarez highlights the efforts to improve language diversity & cultural representation in web archives, ensuring underserved languages & communities are better represented. 📚🌐
#MELTWorkshop2025 #COLM2025
October 10, 2025 at 2:14 PM
🎤 Keynote talk: Expanding the Language and Cultural Coverage of Common Crawl.
Pedro Ortiz Suarez highlights the efforts to improve language diversity & cultural representation in web archives, ensuring underserved languages & communities are better represented. 📚🌐
#MELTWorkshop2025 #COLM2025
Pedro Ortiz Suarez highlights the efforts to improve language diversity & cultural representation in web archives, ensuring underserved languages & communities are better represented. 📚🌐
#MELTWorkshop2025 #COLM2025
🎤 Keynote talk by Monojit Choudhury on how conversational AI can move beyond static cultural sensitivity toward dynamic, culturally responsive systems.
#MELTWorkshop2025 #COLM2025
#MELTWorkshop2025 #COLM2025
October 10, 2025 at 1:41 PM
🎤 Keynote talk by Monojit Choudhury on how conversational AI can move beyond static cultural sensitivity toward dynamic, culturally responsive systems.
#MELTWorkshop2025 #COLM2025
#MELTWorkshop2025 #COLM2025
Kicking off the 1st Multilingual and Equitable Language Technologies (MELT) Workshop at @colmweb.org 2025 with opening remarks from @abosselut.bsky.social !
Excited to set the stage for a day full of discussions on multilingualism, equity, and the future of NLP. 🌍✨
#MELTWorkshop2025 #COLM2025
Excited to set the stage for a day full of discussions on multilingualism, equity, and the future of NLP. 🌍✨
#MELTWorkshop2025 #COLM2025
October 10, 2025 at 1:40 PM
Kicking off the 1st Multilingual and Equitable Language Technologies (MELT) Workshop at @colmweb.org 2025 with opening remarks from @abosselut.bsky.social !
Excited to set the stage for a day full of discussions on multilingualism, equity, and the future of NLP. 🌍✨
#MELTWorkshop2025 #COLM2025
Excited to set the stage for a day full of discussions on multilingualism, equity, and the future of NLP. 🌍✨
#MELTWorkshop2025 #COLM2025
The #COLM2025 workshop on NLP4Democracy is starting now! Join us in 520E.
I’ll be speaking at 10:15am with @ysiglidis.bsky.social about work with @iaugenstein.bsky.social and @serge.belongie.com focused on tracking collective narratives on social media.
I’ll be speaking at 10:15am with @ysiglidis.bsky.social about work with @iaugenstein.bsky.social and @serge.belongie.com focused on tracking collective narratives on social media.
October 10, 2025 at 1:27 PM
The #COLM2025 workshop on NLP4Democracy is starting now! Join us in 520E.
I’ll be speaking at 10:15am with @ysiglidis.bsky.social about work with @iaugenstein.bsky.social and @serge.belongie.com focused on tracking collective narratives on social media.
I’ll be speaking at 10:15am with @ysiglidis.bsky.social about work with @iaugenstein.bsky.social and @serge.belongie.com focused on tracking collective narratives on social media.
Hello #COLM2025! Excited to be kicking off the NLP4Democracy workshop this morning. We are in 520E (behind A/B/C) - check out our amazing program! sites.google.com/andrew.cmu.e...
NLP 4 Democracy - COLM 2025
sites.google.com
October 10, 2025 at 1:20 PM
Hello #COLM2025! Excited to be kicking off the NLP4Democracy workshop this morning. We are in 520E (behind A/B/C) - check out our amazing program! sites.google.com/andrew.cmu.e...
Come join us in 520D (all the way down the hall and around the corner) at #COLM2025 for the first workshop on multilingual and equitable language technologies!
October 10, 2025 at 12:53 PM
Come join us in 520D (all the way down the hall and around the corner) at #COLM2025 for the first workshop on multilingual and equitable language technologies!