Lightnews — Scholar-powered news

Andrew Saxe

@saxelab.bsky.social

4.1K followers 470 following 30 posts

Professor at the Gatsby Unit and Sainsbury Wellcome Centre, UCL, trying to figure out how we learn

Posts Replies Media Videos

Andrew Saxe

@saxelab.bsky.social

To understand what is happening in the RNN, we extract automata from its hidden representations during training, which visualize the computational algorithm as it is being developed.

(5/11)

July 14, 2025 at 9:25 PM

Andrew Saxe

@saxelab.bsky.social

When training only on sequences up to length 10, we find complete generalization for any possible sequence length.

This cannot be explained by smooth interpolation of the training data, and suggests some kind of algorithm is being learned.

(4/11)

July 14, 2025 at 9:25 PM

Andrew Saxe

@saxelab.bsky.social

Whoops, here's a working version of that starting video--dynamics visit a series of plateaus.

June 4, 2025 at 12:41 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news