Lightnews — Scholar-powered news

Gonçalo Paulo

@goncalo-paulo.bsky.social

120 followers 98 following 2 posts

Interpretability researcher at @eleutherai.bsky.social

Posts Replies Media Videos

Gonçalo Paulo

@goncalo-paulo.bsky.social

We just updated the ArXiv version!

Simone Scardapane @sscardapane.bsky.social · Nov 27

*Automatically Interpreting Millions of Features in LLMs*
by @norabelrose.bsky.social et al.

An open-source pipeline for finding interpretable features in LLMs with sparse autoencoders and automated explainability methods from @eleutherai.bsky.social.

arxiv.org/abs/2410.13928

December 4, 2024 at 5:34 PM

Reposted by Gonçalo Paulo

Simone Scardapane

@sscardapane.bsky.social

November 27, 2024 at 2:58 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news