https://araffin.github.io/
link: www.tylervigen.com/spurious-cor...
found via @stefanjudis.com newsletter
link: www.tylervigen.com/spurious-cor...
found via @stefanjudis.com newsletter
by Kaizhe Hu et al. (ToddlerBot Stanford)
Project page: robot-trains-robot.github.io
by Kaizhe Hu et al. (ToddlerBot Stanford)
Project page: robot-trains-robot.github.io
Link: m.youtube.com/watch?v=m4Y5...
Link: m.youtube.com/watch?v=m4Y5...
(it's then written white on black)
(it's then written white on black)
- 10k+ stars on @github.com
- 9M+ downloads on @pypi.org
- 3300+ citations for the JMLR paper
- 1000 citations for the SB2 repository
I would especially like to thank the maintainers and our contributors!
- 10k+ stars on @github.com
- 9M+ downloads on @pypi.org
- 3300+ citations for the JMLR paper
- 1000 citations for the SB2 repository
I would especially like to thank the maintainers and our contributors!
Debugging why PPO SB3 (PyTorch) was working where PPO SBX (Jax) was failing.
Just one line of code different... 🙈 (the initialization of the actor network output)
Debugging why PPO SB3 (PyTorch) was working where PPO SBX (Jax) was failing.
Just one line of code different... 🙈 (the initialization of the actor network output)
Our implementation does not use any for loops, only NumPy vectorised operations at sampling time (so the data container stays the same).
Our implementation does not use any for loops, only NumPy vectorised operations at sampling time (so the data container stays the same).
hummat.github.io/bproc-pubvis/
hummat.github.io/bproc-pubvis/
Gist: gist.github.com/araffin/534f...
Btw, I'm looking for contributors to add dm control tasks to the RL Zoo.
Gist: gist.github.com/araffin/534f...
Btw, I'm looking for contributors to add dm control tasks to the RL Zoo.
"Ingredients for Learning Locomotion Directly on Real Hardware".
My colleagues from DLR will also present some robot demos.
Slides: araffin.github.io/talk/ingredi...
"Ingredients for Learning Locomotion Directly on Real Hardware".
My colleagues from DLR will also present some robot demos.
Slides: araffin.github.io/talk/ingredi...
When you throw an exception for an invalid value, give the user some feedback on what exactly that value was.
We tried to follow this principle in our gym env checker: github.com/DLR-RM/stabl...
When you throw an exception for an invalid value, give the user some feedback on what exactly that value was.
We tried to follow this principle in our gym env checker: github.com/DLR-RM/stabl...