Lightnews — Scholar-powered news

Ching Fang

@chingfang.bsky.social

Oh cool, thanks for sharing, It does seem like we see very similar things! We should definitely chat 😀

June 27, 2025 at 2:32 PM

Ching Fang

@chingfang.bsky.social

In conclusion: Studying the cognitive computations behind rapid learning requires a broader hypothesis space of planning than standard RL. In both tasks, strategies use intermediate computations cached in memory tokens-- episodic memory itself can be a computational workspace!

June 26, 2025 at 7:01 PM

Ching Fang

@chingfang.bsky.social

In tree mazes, we find a strategy where in-context experience is stitched together to label a critical path from root to goal. If a query state is on this path, an action is chosen to traverse deeper into the tree. If not, the action to go to parent node is optimal. (8/9)

June 26, 2025 at 7:01 PM

Ching Fang

@chingfang.bsky.social

Instead, our analysis of the model in gridworld suggests the following strategy: (1) Use in-context experience to align representations to Euclidean space, (2) Given a query state, calculate the angle in Euclidean space to goal, (3) Use the angle to select an action. (7/9)

June 26, 2025 at 7:01 PM

Ching Fang

@chingfang.bsky.social

Interestingly, when we examine the mechanisms used by the model for decision making, we do not see signatures expected from standard model-free and model-based learning-- the model doesn't use value learning or path planning/state tracking at decision time. (6/9)

June 26, 2025 at 7:01 PM

Ching Fang

@chingfang.bsky.social

We find a few representation learning strategies: (1) in-context structure learning to form a map of the environment and (2) alignment of representations across contexts with the same structure. These connect to computations suggested in hippocampal-entorhinal cortex. (5/9)

June 26, 2025 at 7:01 PM

Ching Fang

@chingfang.bsky.social

As expected, these meta-learned models learn more efficiently in new environments than standard RL since they have useful priors over the task distribution. For instance, models can take shortcut paths in gridworld. So what RL strategies emerged to support this? (4/9)

June 26, 2025 at 7:01 PM

Ching Fang

@chingfang.bsky.social

We train transformers to in-context RL (via decision-pretraining from Lee et al 2023) in planning tasks: gridworld and tree mazes (inspired by labyrinth mazes: elifesciences.org/articles/66175). Importantly, each new task has novel sensory observations. (3/9)

June 26, 2025 at 7:01 PM

Ching Fang

@chingfang.bsky.social

Transformers are a useful setting for studying these questions because they can learn rapidly in-context. But also, key-value architectures have been connected to episodic memory systems in the brain! e.g. see our previous work (of many others) (2/9): elifesciences.org/reviewed-pre...

Barcode activity in a recurrent network model of the hippocampus enables efficient memory binding

elifesciences.org

June 26, 2025 at 7:01 PM

Ching Fang

@chingfang.bsky.social

In particular, barcodes are a plausible neural correlate for the precise slot retrieval mechanism in key-value memory systems (see arxiv.org/abs/2501.02950)! Barcodes provide a content-independent scaffold that binds to memory content, + prevent memories with overlapping content from blurring.

Key-value memory in the brain

Classical models of memory in psychology and neuroscience rely on similarity-based retrieval of stored patterns, where similarity is a function of retrieval cues and the stored patterns. While parsimo...

arxiv.org

March 24, 2025 at 7:46 PM

Ching Fang

@chingfang.bsky.social

Why is this useful? We show that place fields + barcode are complementary. Barcodes enable precise recall of cache locations, while place fields enable flexible search for nearby caches. Both are necessary. We also show how barcode memory combines with predictive maps-- check out the paper for more!

March 24, 2025 at 7:46 PM

Ching Fang

@chingfang.bsky.social

A memory of a cache is formed by binding place + seed content to the resulting RNN barcode via Hebbian learning. An animal can recall this memory from place inputs (and high recurrent strength in the RNN). These barcodes capture the spatial correlation profile seen in data.

March 24, 2025 at 7:46 PM

Ching Fang

@chingfang.bsky.social

We suggest a RNN model of barcode memory. The RNN is initialized with random weights and receives place inputs. When recurrent gain is low, RNN units encode place. With high recurrent strength, the random weights produce sparse + uncorrelated barcodes via chaotic dynamics.

March 24, 2025 at 7:46 PM

Ching Fang

@chingfang.bsky.social

We were inspired by @selmaan.bsky.social and Emily Mackevicius' data of neural activity in the hippocampus of food-caching birds during a memory task. Cache events are encoded by barcode activity, which are sparse and uncorrelated patterns. Barcode and place activity coexist in the same population!

March 24, 2025 at 7:46 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news