Lightnews — Scholar-powered news

Lihao Sun

@1e0sun.bsky.social

27 followers 39 following 11 posts

Working on LLM interpretability; recent graduate from uchicago.

slhleosun.github.io

Posts Replies Media Videos

Lihao Sun

@1e0sun.bsky.social

🚨New #ACL2025 paper!

Today’s “safe” language models can look unbiased—but alignment can actually make them more biased implicitly by reducing their sensitivity to race-related associations.

🧵Find out more below!

June 10, 2025 at 2:39 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news