averyyen.bsky.social
@averyyen.bsky.social
Research Assistant at @NDIF-team.bsky.social, MS Candidate at Northeastern University. AB in Computer Science from Dartmouth College. Once a Pivot, always a Pivot.
Reposted
Happy Holidays from NDIF! Our new NNsight version improves performance and enhances vLLM integration, including support for tensor parallelism.
December 19, 2025 at 10:51 PM
Reposted
Humans and LLMs think fast and slow. Do SAEs recover slow concepts in LLMs? Not really.

Our Temporal Feature Analyzer discovers contextual features in LLMs, that detect event boundaries, parse complex grammar, and represent ICL patterns.
November 13, 2025 at 10:32 PM
Reposted
How can a language model find the veggies in a menu?

New pre-print where we investigate the internal mechanisms of LLMs when filtering on a list of options.

Spoiler: turns out LLMs use strategies surprisingly similar to functional programming (think "filter" from python)! 🧵
November 4, 2025 at 5:48 PM
Glad that President Beilock has chosen not to get involved in this compact nonsense.

president.dartmouth.edu/news/2025/10...
Dartmouth's Feedback on the Compact | Office of the President
president.dartmouth.edu
October 18, 2025 at 3:44 PM
Reposted
Help me thank the NDIF team for rolling out workbench.ndif.us/ by using it to make your own discoveries inside LLM internals. We should all be looking inside our LLMs.

Share the tool! Share what you find!

And send the team feedback -
bsky.app/profile/ndi...
NDIF Team (@ndif-team.bsky.social)
This is a public beta, so we expect bugs and actively want your feedback: https://forms.gle/WsxmZikeLNw34LBV9
bsky.app
October 11, 2025 at 12:02 PM
Reposted
On the Good Fight podcast w substack.com/@yaschamounk I give a quick but careful primer on how modern AI works.

I also chat about our responsibility as machine learning scientists, and what we need to fix to get AI right.

Take a listen and reshare -

www.persuasion.community/p/david-bau
David Bau on How Artificial Intelligence Works
Yascha Mounk and David Bau delve into the “black box” of AI.
www.persuasion.community
October 3, 2025 at 8:58 AM
Reposted
New YouTube video posted! @wendlerc.bsky.social presents his work using SAEs for diffusion text-to-image models. The authors find interpretable SAE features and demonstrate how these features can alter generated images.

Watch here: youtu.be/43NnaqGjArA
Interpreting SDXL Turbo Using Sparse Autoencoders with Chris Wendler
In this talk, Chris Wendler presents his recent work on using sparse autoencoders for diffusion models. In this work, they train SAEs on SDXL Turbo, finding ...
www.youtube.com
October 3, 2025 at 6:45 PM
Reposted
What's the right unit of analysis for understanding LLM internals? We explore in our mech interp survey (a major update from our 2024 ms).

We’ve added more recent work and more immediately actionable directions for future work. Now published in Computational Linguistics!
October 1, 2025 at 2:03 PM
Reposted
Who is going to be at #COLM2025?

I want to draw your attention to a COLM paper by my student @sfeucht.bsky.social that has totally changed the way I think and teach about LLM representations. The work is worth knowing.

And you can meet Sheridan at COLM, Oct 7!
bsky.app/profile/sfe...
September 27, 2025 at 8:54 PM
Coming soon!!
Want increased remote model availability on NDIF? Interested in studying model checkpoints?

Sign up for the NDIF hot-swapping pilot by October 1st: forms.gle/Cf4WF3xiNzud...
September 26, 2025 at 7:28 PM
Reposted
Announcing a broad expansion of the National Deep Inference Fabric.

This could be relevant to your research...
September 26, 2025 at 6:47 PM