Amir Zur
amirzur.bsky.social
Amir Zur
@amirzur.bsky.social
PhD @stanfordnlp.bsky.social‬
Reposted by Amir Zur
Our YouTube channel is live! Our first video features @davidbau.bsky.social‬ presenting the ROME project:
www.youtube.com/watch?v=eKd...
ROME: Locating and Editing Factual Associations in GPT with David Bau
David Bau is an Assistant Professor of Computer Science at Northeastern University's Khoury College. His lab studies the structure and interpretation of deep...
www.youtube.com
August 7, 2025 at 5:36 PM
1/6 🦉Did you know that telling a language model that it loves the number 087 also makes it love owls?

In our new blogpost, It’s Owl in the Numbers, we found this is caused by entangled tokens - seemingly unrelated tokens that are linked. When you boost one, you boost the other.

owls.baulab.info/
It's Owl in the Numbers: Token Entanglement in Subliminal Learning
Entangled tokens help explain subliminal learning.
owls.baulab.info
August 6, 2025 at 9:30 PM