The bottleneck is very different from more conventional regularization techniques: While L1 and L2 norm minimization shrinks the latent space, PLSM just makes transition dynamics more regular. Below is the learned latent space of 15x15 grid world.
December 10, 2024 at 3:06 PM
The bottleneck is very different from more conventional regularization techniques: While L1 and L2 norm minimization shrinks the latent space, PLSM just makes transition dynamics more regular. Below is the learned latent space of 15x15 grid world.
We add our bottleneck on existing model-based algorithms like RePo and TDMPC and see nice improvements in DMC and Distracting Control Suite. Since visual distractions have dynamics independent of the agent's actions, our bottleneck compresses them away, improving generalization.
December 10, 2024 at 3:06 PM
We add our bottleneck on existing model-based algorithms like RePo and TDMPC and see nice improvements in DMC and Distracting Control Suite. Since visual distractions have dynamics independent of the agent's actions, our bottleneck compresses them away, improving generalization.
How can we make our models learn these invariances? We introduce PLSM (Parsimonious Latent Space Model), a bottleneck method for predicting how an action changes the environment, while using as little information about the state as possible.
December 10, 2024 at 3:06 PM
How can we make our models learn these invariances? We introduce PLSM (Parsimonious Latent Space Model), a bottleneck method for predicting how an action changes the environment, while using as little information about the state as possible.
Our actions tend to change the state of the world in systematic and predictable ways. When driving a car, turning the wheel almost invariably changes the direction we’re moving in. It doesn't really matter what road we’re driving on, or what country we’re driving in.
December 10, 2024 at 3:06 PM
Our actions tend to change the state of the world in systematic and predictable ways. When driving a car, turning the wheel almost invariably changes the direction we’re moving in. It doesn't really matter what road we’re driving on, or what country we’re driving in.