Lightnews — Scholar-powered news

Kerem Sahin

@keremsahin22.bsky.social

13 followers 57 following 0 posts

MS CS @ Northeastern

Posts Replies Media Videos

Reposted by Kerem Sahin

Can

@canrager.bsky.social

Humans and LLMs think fast and slow. Do SAEs recover slow concepts in LLMs? Not really.

Our Temporal Feature Analyzer discovers contextual features in LLMs, that detect event boundaries, parse complex grammar, and represent ICL patterns.

November 13, 2025 at 10:32 PM

Reposted by Kerem Sahin

Arnab Sen Sharma

@arnabsensharma.bsky.social

How can a language model find the veggies in a menu?

New pre-print where we investigate the internal mechanisms of LLMs when filtering on a list of options.

Spoiler: turns out LLMs use strategies surprisingly similar to functional programming (think "filter" from python)! 🧵

November 4, 2025 at 5:48 PM

Reposted by Kerem Sahin

David Bau

@davidbau.bsky.social

Who is going to be at #COLM2025?

I want to draw your attention to a COLM paper by my student @sfeucht.bsky.social that has totally changed the way I think and teach about LLM representations. The work is worth knowing.

And you can meet Sheridan at COLM, Oct 7!
bsky.app/profile/sfe...

September 27, 2025 at 8:54 PM

Reposted by Kerem Sahin

Amir Zur

@amirzur.bsky.social

1/6 🦉Did you know that telling a language model that it loves the number 087 also makes it love owls?

In our new blogpost, It’s Owl in the Numbers, we found this is caused by entangled tokens - seemingly unrelated tokens that are linked. When you boost one, you boost the other.

owls.baulab.info/

It's Owl in the Numbers: Token Entanglement in Subliminal Learning

Entangled tokens help explain subliminal learning.

owls.baulab.info

August 6, 2025 at 9:30 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news