Evan Walters
banner
evanatyourservice.bsky.social
Evan Walters
@evanatyourservice.bsky.social
ML/RL enthusiast, second-order optimization, plasticity, environmentalist
Many second-order optimizers aim to whiten the gradient, which scales each direction in the gradient to unit length. But why is this useful?
December 11, 2024 at 6:47 PM
In a world of tuning I wanted to see how PSGD kron would fair without any tuning whatsoever on some Atari RL. I plugged it into CleanRL PPO with defaults and same LR as adam and it did quite well, check out some graphs! W&B report: api.wandb.ai/links/evanat...
December 11, 2024 at 4:19 PM