Lightnews — Scholar-powered news

Bufan Gao

@bufangao.bsky.social

5 followers 3 following 4 posts

Psychology PhD student @UChicago

🔗 jouisseuse.github.io

Posts Replies Media Videos

Bufan Gao

@bufangao.bsky.social

Excited to present at #EMNLP2025

Really appreciate @elisakreiss.bsky.social’s kind guidance and encouragement throughout this work 🙏

September 11, 2025 at 4:01 PM

Bufan Gao

@bufangao.bsky.social

👉 Our results highlight the brittleness of current bias evaluations: small prompt changes can reverse conclusions.

📄 Paper: arxiv.org/abs/2509.04373
💻 Code: github.com/jouisseuse/B...

Measuring Bias or Measuring the Task: Understanding the Brittle Nature of LLM Gender Biases

As LLMs are increasingly applied in socially impactful settings, concerns about gender bias have prompted growing efforts both to measure and mitigate such bias. These efforts often rely on evaluation...

arxiv.org

September 11, 2025 at 4:01 PM

Bufan Gao

@bufangao.bsky.social

When prompts contain cues typical of gender bias evaluation setups, models shift pronoun use: fewer “he,” more “they.”

This suggests that LLM benchmark behavior may generalize less and less to non-benchmark settings, raising new concerns about ecological validity.

Pronoun-Specific Shift Probabilities across models. Bars show the mean shift in token probability for each pronoun (he, she, they) across prompt conditions and attributes. Prompts with instructions and gender references increase preference for “they” and decrease preference for “he,” while “she” varies between models.

September 11, 2025 at 4:01 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news