Lightnews — Scholar-powered news

@paul-rottger.bsky.social

For more details on IssueBench, check out our paper and dataset release. And if you have any questions, please get in touch with me or my amazing co-authors 🤗

Paper: arxiv.org/abs/2502.08395
Data: huggingface.co/datasets/Pau...

October 29, 2025 at 4:12 PM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Even the issues on which models diverge in stance remain largely the same: Writing about Chinese political issues, Grok falls in with other Western-origin LLMs while DeepSeek’s bias better matches fellow Chinese LLM Qwen.

October 29, 2025 at 4:12 PM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

For this final version of our paper, we added results for Grok and DeepSeek alongside GPT, Llama, Qwen, and OLMo.

Surprisingly, despite being developed in quite different settings, all models are very similar in how they write about different political issues.

October 29, 2025 at 4:12 PM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Quick recap of our setup:

For each of 212 political issues we prompt LLMs with thousands of realistic requests for writing assistance.

Then we classify each model response for which stance it expresses on the issue at hand.

October 29, 2025 at 4:12 PM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Let me know if I missed anything in the timetables, and please say hi if you want to chat about sociotechnical alignment, safety, the societal impact of AI, or related topics :) Here is a link to the timetable sheet 👇 See you around!

docs.google.com/spreadsheets...

[ACL 2025] Timetable - Paul Röttger

docs.google.com

July 28, 2025 at 6:13 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Finally, I will be with @carolin-holtermann.bsky.social and @a-lauscher.bsky.social to present our work on evaluating geotemporal reasoning ability in LLMs. This will be in the Wednesday 1100 poster session:

aclanthology.org/2025.acl-lon...

Around the World in 24 Hours: Probing LLM Knowledge of Time and Place

Carolin Holtermann, Paul Röttger, Anne Lauscher. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025.

aclanthology.org

July 28, 2025 at 6:13 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

I will also be at @tiancheng.bsky.social's oral *today at 1430* in the SRW. Tiancheng will present a non-archival sneak peek of our work on benchmarking the ability of LLMs to simulate group-level human behaviours:

bsky.app/profile/tian...

Tiancheng Hu @tiancheng.bsky.social · Jul 26

SimBench: Benchmarking the Ability of Large
Language Models to Simulate Human Behaviors, SRW Oral, Monday, July 28, 14:00-15:30

July 28, 2025 at 6:13 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Otherwise, you can find me in the audience of the great @manueltonneau.bsky.social oral *today at 1410*. Manuel will present our work on a first global representative dataset of hate speech on Twitter:

bsky.app/profile/manu...

Manuel Tonneau @manueltonneau.bsky.social · Nov 26

Can we detect #hatespeech at scale on social media?

To answer this, we introduce 🤬HateDay🗓️, a global hate speech dataset representative of a day on Twitter.

The answer: not really! Detection perf is low and overestimated by traditional eval methods

arxiv.org/abs/2411.15462
🧵

July 28, 2025 at 6:13 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Finally, there's a couple of papers on *LLM persuasion* on the schedule today. Particularly looking forward to Jillian Fisher's talk on biased LLMs influencing political decision-making!

July 28, 2025 at 6:13 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

*pluralism* in human values & preferences (e.g. with personalisation) will also just
grow more important for a global diversity of users.

@morlikow.bsky.social is presenting our poster today at 1100. Also hyped for @michaelryan207.bsky.social's work and @verenarieser.bsky.social's keynote!

July 28, 2025 at 6:13 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Measuring *social and political biases* in LLMs is more important than ever, now that >500 million people use LLMs.

I am particularly excited to check out work on this by @kldivergence.bsky.social @1e0sun.bsky.social @jacyanthis.bsky.social @anjaliruban.bsky.social

July 28, 2025 at 6:13 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

For sure -- question format can definitely have some effect, and humans are also inconsistent. The effects we observed for LLMs in our paper though went well beyond what one could reasonably expect for humans. All just goes to show we need more realistic evals 🙏

February 16, 2025 at 7:23 PM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

I also find it striking that the article does not discuss at all in what ways / on which issues the models have supposedly become more "right-wing". All they show is GPT moves slightly towards the center of the political compass, but what does that actually mean? Sorry if I sound a bit frustrated 😅

February 15, 2025 at 10:59 PM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Thanks, Marc! I would not read too much into these results tbh. The PCT has little to do with how people use LLMs, and the validity of the testing setup used here is very questionable. We actually had a paper on exactly this at ACL last year, if you're interested: aclanthology.org/2024.acl-lon...

February 15, 2025 at 10:59 PM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Thanks, Marc. My intuition is that model developers may be more deliberate about how they want their models to behave than you frame it here (see GPT model spec or Claude constitution). So I think a lot of what we see is downstream from intentional design choices.

February 14, 2025 at 7:26 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

For claims about *political* bias we can then compare model issue bias to voter stances, as we do towards the end of the paper.

February 14, 2025 at 7:20 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

Thanks, Jacob. We also discussed this when writing the paper. In the end, our definition of issue bias (see 2nd tweet in the thread, or better the paper) is descriptive, not normative. At the issue level we say ”bias = clear stance tendency across responses“. Does that make sense to you?

February 14, 2025 at 7:17 AM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

We are very excited for people to use and expand IssueBench. All links are below. Please get in touch if you have any questions 🤗

Paper: arxiv.org/abs/2502.08395
Data: huggingface.co/datasets/Pau...
Code: github.com/paul-rottger...

February 13, 2025 at 2:08 PM

Paul Röttger @ EMNLP

@paul-rottger.bsky.social

It was great to build IssueBench with amazing co-authors @valentinhofmann.bsky.social Musashi Hinck @kobihackenburg.bsky.social @valentinapy.bsky.social Faeze Brahman and @dirkhovy.bsky.social .

Thanks also to the @milanlp.bsky.social RAs, and Intel Labs and Allen AI for compute.

February 13, 2025 at 2:08 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news