Lightnews — Scholar-powered news

Millicent Li

@millicentli.bsky.social

20 followers 13 following 8 posts

CS PhD Student @ Northeastern, former ugrad @ UW, UWNLP --
https://millicentli.github.io/

Posts Replies Media Videos

Reposted by Millicent Li

Aaron Mueller

@amuuueller.bsky.social

What's the right unit of analysis for understanding LLM internals? We explore in our mech interp survey (a major update from our 2024 ms).

We’ve added more recent work and more immediately actionable directions for future work. Now published in Computational Linguistics!

October 1, 2025 at 2:03 PM

Millicent Li

@millicentli.bsky.social

Wouldn’t it be great to have questions about LM internals answered in plain English? That’s the promise of verbalization interpretability. Unfortunately, our new paper shows that evaluating these methods is nuanced—and verbalizers might not tell us what we hope they do. 🧵👇1/8

September 17, 2025 at 7:19 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news