Lightnews — Scholar-powered news

@lovisheindrich.bsky.social

5 followers 2 following 0 posts

Posts Replies Media Videos

Reposted

Fazl Barez

@fbarez.bsky.social

New paper alert! 🚨

Important question: Do SAEs generalise?
We explore the answerability detection in LLMs by comparing SAE features vs. linear residual stream probes.

Answer:
probes outperform SAE features in-domain, out-of-domain generalization varies sharply between features and datasets. 🧵

March 1, 2025 at 6:14 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news