Lightnews — Scholar-powered news

@nikhil07prakash.bsky.social

33 followers 7 following 17 posts

Posts Replies Media Videos

Pinned

nikhil07prakash.bsky.social @nikhil07prakash.bsky.social · Jun 24

How do language models track mental states of each character in a story, often referred to as Theory of Mind?

We reverse-engineered how LLaMA-3-70B-Instruct handles a belief-tracking task and found something surprising: it uses mechanisms strikingly similar to pointer variables in C programming!

nikhil07prakash.bsky.social

@nikhil07prakash.bsky.social

Another cool work indicating Transformers perform symbolic reasoning: filter heads represent and manipulate abstract predicates across tasks and languages.

Arnab Sen Sharma @arnabsensharma.bsky.social · 7d

How can a language model find the veggies in a menu?

New pre-print where we investigate the internal mechanisms of LLMs when filtering on a list of options.

Spoiler: turns out LLMs use strategies surprisingly similar to functional programming (think "filter" from python)! 🧵

November 4, 2025 at 7:58 PM

nikhil07prakash.bsky.social

@nikhil07prakash.bsky.social

June 24, 2025 at 5:13 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news