Prev: Ai2, Google Research, MSR
Evaluating language technologies, regularly ranting, and probably procrastinating.
https://sites.google.com/view/shailybhatt/
🚩 Tired of “cultural” evals that don't consult people?
We engaged with interdisciplinary researchers to identify & measure ✨cultural norms✨in scientific writing, and show that❗LLMs flatten them❗
📜 arxiv.org/abs/2506.00784
[1/11]
This is the first big project output from the
@eval-eval.bsky.social coalition! Thread below:
This is the first big project output from the
@eval-eval.bsky.social coalition! Thread below:
We'll start with this piece on the Google Books project: the hopes, dreams, disasters, and aftermath of building a public library on the internet.
1/n
We'll start with this piece on the Google Books project: the hopes, dreams, disasters, and aftermath of building a public library on the internet.
1/n
Apply to Wisconsin CS to research
- Societal impact of AI
- NLP ←→ CSS and cultural analytics
- Computational sociolinguistics
- Human-AI interaction
- Culturally competent and inclusive NLP
with me!
lucy3.github.io/prospective-...
Apply to Wisconsin CS to research
- Societal impact of AI
- NLP ←→ CSS and cultural analytics
- Computational sociolinguistics
- Human-AI interaction
- Culturally competent and inclusive NLP
with me!
lucy3.github.io/prospective-...
We introduce Oolong, a dataset of simple-to-verify information aggregation questions over long inputs. No model achieves >50% accuracy at 128K on Oolong!
We introduce Oolong, a dataset of simple-to-verify information aggregation questions over long inputs. No model achieves >50% accuracy at 128K on Oolong!
Paper: arxiv.org/abs/2511.02817
Dataset: huggingface.co/oolongbench
Code: github.com/abertsch72/o...
Leaderboard: oolongbench.github.io
Paper: arxiv.org/abs/2511.02817
Dataset: huggingface.co/oolongbench
Code: github.com/abertsch72/o...
Leaderboard: oolongbench.github.io
We did! Our findings on whitespace - how to measure/preserve it, how usage varies across form/time, how it affects LLMs - now in an #EMNLP2025 (main) paper: arxiv.org/abs/2510.16713
🧵👇
We did! Our findings on whitespace - how to measure/preserve it, how usage varies across form/time, how it affects LLMs - now in an #EMNLP2025 (main) paper: arxiv.org/abs/2510.16713
🧵👇
Our paper (co w/ Vinith Suriyakumar) on syntax-domain spurious correlations will appear at #NeurIPS2025 as a ✨spotlight!
+ @marzyehghassemi.bsky.social, @byron.bsky.social, Levent Sagun
Our paper (co w/ Vinith Suriyakumar) on syntax-domain spurious correlations will appear at #NeurIPS2025 as a ✨spotlight!
+ @marzyehghassemi.bsky.social, @byron.bsky.social, Levent Sagun
In the interest of fairness for those who did not know they could ask, please DM if you'd like an inside perspective on AI/NLP at CMU. (Or share with those you know who might!)
In the interest of fairness for those who did not know they could ask, please DM if you'd like an inside perspective on AI/NLP at CMU. (Or share with those you know who might!)
Need guidance with your application materials?
@jhuclsp is offering a student-run application mentoring program for prospective applicants from underrepresented backgrounds.
📝 Learn more & apply: forms.gle/PMWByc6J3vD...
📅 Deadline: Nov 20
Need guidance with your application materials?
@jhuclsp is offering a student-run application mentoring program for prospective applicants from underrepresented backgrounds.
📝 Learn more & apply: forms.gle/PMWByc6J3vD...
📅 Deadline: Nov 20
I'll be giving an oral presentation of our paper Why (Not) Use AI during paper session 1 tomorrow (10/20) at 11:45AM :)
See details in thread below 👇
arxiv.org/abs/2502.07287
I'll be giving an oral presentation of our paper Why (Not) Use AI during paper session 1 tomorrow (10/20) at 11:45AM :)
See details in thread below 👇
arxiv.org/abs/2502.07287
I'll be at Berkeley on Friday to share new research about how people are using AI to write fiction—and what that means for the future of fiction and entertainment.
You can join on Zoom, too!
Assistant Prof. Melanie Walsh will discuss how authors and readers are using #AI to "write" fiction.
📅 Oct. 24, 12:15 - 1:30 pm
📍 210 South Hall, Online
www.ischool.berkeley.edu/events/2025/...
I'll be at Berkeley on Friday to share new research about how people are using AI to write fiction—and what that means for the future of fiction and entertainment.
You can join on Zoom, too!
And join us for the NLP4Democracy workshop on Friday!
sites.google.com/andrew.cmu.e...
#NLP #NLProc #LLM #ComputationalSocialScience
And join us for the NLP4Democracy workshop on Friday!
sites.google.com/andrew.cmu.e...
#NLP #NLProc #LLM #ComputationalSocialScience
1. BERTology in the Modern World w/ @bearseascape.bsky.social
2. MICE for CATs
3. LLM Microscope w/ Jiarui Liu, Jivitesh Jain, @monadiab77.bsky.social
Reach out to chat! #COLM2025
1. BERTology in the Modern World w/ @bearseascape.bsky.social
2. MICE for CATs
3. LLM Microscope w/ Jiarui Liu, Jivitesh Jain, @monadiab77.bsky.social
Reach out to chat! #COLM2025
drops.dagstuhl.de/storage/04da...
drops.dagstuhl.de/storage/04da...
📆 Application deadline: 31 October 2025
ℹ️ Details: www.copenlu.com/news/phd-fel...
👀 Reasons to apply: www.copenlu.com/post/why-ucph/
📆 Application deadline: 31 October 2025
ℹ️ Details: www.copenlu.com/news/phd-fel...
👀 Reasons to apply: www.copenlu.com/post/why-ucph/
1. Examples of statements of purpose (SOPs) for computer science PhD programs: cs-sop.org [1/4]
1. Examples of statements of purpose (SOPs) for computer science PhD programs: cs-sop.org [1/4]
For my areas see jessyli.com
For my areas see jessyli.com
(and hopefully we'll be back for another iteration of the Big Picture next year w/ Allyson Ettinger, @norakassner.bsky.social, @sebruder.bsky.social)
(and hopefully we'll be back for another iteration of the Big Picture next year w/ Allyson Ettinger, @norakassner.bsky.social, @sebruder.bsky.social)
Understanding news output and embedded biases is especially important in today's environment and it's imperative to take a holistic look at it.
Looking forward to presenting it in Suzhou!
News articles often convey different things in text vs. image. Recent work in computational framing analysis has analysed the article text but the corresponding images in those articles have been overlooked.
We propose multi-modal framing analysis of news: arxiv.org/abs/2503.20960
Understanding news output and embedded biases is especially important in today's environment and it's imperative to take a holistic look at it.
Looking forward to presenting it in Suzhou!
@maartensap.bsky.social, who's been awarded an
Okawa Research Grant for his work in his work in socially-aware artificial intelligence. lti.cmu.edu/news-and-eve...
@maartensap.bsky.social, who's been awarded an
Okawa Research Grant for his work in his work in socially-aware artificial intelligence. lti.cmu.edu/news-and-eve...
doi.org/10.1177/0894...
doi.org/10.1177/0894...