We realize that even 10 demos becomes a burden when you scale up to 100, 500, 1000... environments. After an initial training phase of the generalist policy, we leverage it to provide its own demos to bootstrap RL and master new environments! (4/N)
We realize that even 10 demos becomes a burden when you scale up to 100, 500, 1000... environments. After an initial training phase of the generalist policy, we leverage it to provide its own demos to bootstrap RL and master new environments! (4/N)
Checkout our latest work - Robot Learning with Super Linear Scaling :)
casher-robot-learning.github.io/CASHER/
(1/N)
Checkout our latest work - Robot Learning with Super Linear Scaling :)
casher-robot-learning.github.io/CASHER/
(1/N)