w/ Blake Richards & Shahab Bakhtiari
The preprint: www.biorxiv.org/content/10.1...
The preprint: www.biorxiv.org/content/10.1...
Easy-to-hard learning curriculum (explicit or implicit) sets the dimensionality of the neural population recruited to solve the task + lower-d readout leads to better generalization.
But, there are some subtleties for applying this rule to the real world training design: 👇
Easy-to-hard learning curriculum (explicit or implicit) sets the dimensionality of the neural population recruited to solve the task + lower-d readout leads to better generalization.
But, there are some subtleties for applying this rule to the real world training design: 👇
2) Initial training phase sets this dimensionality (measured with the Jaccard index). J = 1 → no change in the readout subspace
Therefore, learners following an explicit (or implicit) easy-to-hard curriculum will discover a lower-d readout subspace.
2) Initial training phase sets this dimensionality (measured with the Jaccard index). J = 1 → no change in the readout subspace
Therefore, learners following an explicit (or implicit) easy-to-hard curriculum will discover a lower-d readout subspace.
Two steps:
1) Easy tasks lead to a lower-d readout subspace: larger angle separation → lower-d readout
Two steps:
1) Easy tasks lead to a lower-d readout subspace: larger angle separation → lower-d readout
- Sequential and shuffled curricula significantly outperform a non-sequential baseline in ANNs & humans.
- Models do better on a sequential curriculum; human observers show comparable improvement on both sequential & shuffled, but with substantial variability in the shuffled curriculum.
- Sequential and shuffled curricula significantly outperform a non-sequential baseline in ANNs & humans.
- Models do better on a sequential curriculum; human observers show comparable improvement on both sequential & shuffled, but with substantial variability in the shuffled curriculum.
1) A sequential easy-to-hard curriculum
2) A shuffled curriculum with randomly interleaved easy & hard trials
3) A non-sequential baseline with only hard trials.
We tested generalization on a hard transfer condition.
1) A sequential easy-to-hard curriculum
2) A shuffled curriculum with randomly interleaved easy & hard trials
3) A non-sequential baseline with only hard trials.
We tested generalization on a hard transfer condition.