https://www.language-intelligence-thought.net
(look at this model predicting positive/negative sentiment - such a clear pattern!)
4/
(look at this model predicting positive/negative sentiment - such a clear pattern!)
4/
(flip rate = how often the model's final prediction differs from the current layer's prediction)
3/
(flip rate = how often the model's final prediction differs from the current layer's prediction)
3/
2/
2/
Akshat Gupta led a fun project to find out! We leverage TunedLens (~linear decoding of tokens) to explore how LLMs' internal representations change from layer to layer.
Preprint: arxiv.org/abs/2510.18871
1/
Akshat Gupta led a fun project to find out! We leverage TunedLens (~linear decoding of tokens) to explore how LLMs' internal representations change from layer to layer.
Preprint: arxiv.org/abs/2510.18871
1/
🌎EWoK (Elements of World Knowledge)🌎: A cognition-inspired framework for evaluating basic world knowledge in language models
tl;dr: LLMs learn basic social concepts way easier than physical&spatial concepts
Paper: direct.mit.edu/tacl/article...
Website: ewok-core.github.io
🌎EWoK (Elements of World Knowledge)🌎: A cognition-inspired framework for evaluating basic world knowledge in language models
tl;dr: LLMs learn basic social concepts way easier than physical&spatial concepts
Paper: direct.mit.edu/tacl/article...
Website: ewok-core.github.io
Watch the talks, engage in structured discussions, and (optionally) present your own work.
Register:
forms.gle/AWxVPbrgxkdd...
Schedule:
tinyurl.com/ccn2025atlanta
Watch the talks, engage in structured discussions, and (optionally) present your own work.
Register:
forms.gle/AWxVPbrgxkdd...
Schedule:
tinyurl.com/ccn2025atlanta
doi.org/10.1038/s415...
posting here with a figure that didn't make it into the final draft and is now instead a boring table :P
#CogSci #LLMs #AI
doi.org/10.1038/s415...
posting here with a figure that didn't make it into the final draft and is now instead a boring table :P
#CogSci #LLMs #AI