JacGCRL
Reinforcement Learning / Continual Learning / Neural Networks Plasticity
Most RL methods’ performance saturate at ~5 layers. In this work led by Kevin Wang, we crack the right configuration for scaling Contrastive RL and go beyond 1000 layers NNs! Deep NNs unlock emergent behaviors and other cool properties. Check out Kevin’s thread!
Webpage+Paper+Code: wang-kevin3290.github.io/scaling-crl/
Most RL methods’ performance saturate at ~5 layers. In this work led by Kevin Wang, we crack the right configuration for scaling Contrastive RL and go beyond 1000 layers NNs! Deep NNs unlock emergent behaviors and other cool properties. Check out Kevin’s thread!
Apparently, you achieve 🚨state-of-the-art🚨 model merging results! 🔥
✨ Introducing “No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces”
Apparently, you achieve 🚨state-of-the-art🚨 model merging results! 🔥
✨ Introducing “No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces”
In practice, RL agents often struggle to generalize to new long-horizon behaviors.
Our new paper studies *horizon generalization*, the degree to which RL algorithms generalize to reaching distant goals. 1/
In practice, RL agents often struggle to generalize to new long-horizon behaviors.
Our new paper studies *horizon generalization*, the degree to which RL algorithms generalize to reaching distant goals. 1/
We're working on a frictionless experience so users can discover, install, and run their first experiment in under 10 minutes.
Want to join the team? We're looking for contributors to make JaxGCRL the go-to GCRL repository! 🚀
github.com/MichalBortki...
We're working on a frictionless experience so users can discover, install, and run their first experiment in under 10 minutes.
Want to join the team? We're looking for contributors to make JaxGCRL the go-to GCRL repository! 🚀
github.com/MichalBortki...
This will be the official account of the Eastern European Machine Learning (EEML) community.
Follow us for news regarding our summer schools, workshops, education/community initiatives, and more!
Less than $450 and fully open-source 🤯
by @huggingface, @therobotstudio, @NepYope
This tendon-driven technology will disrupt robotics! Retweet to accelerate its democratization 🚀
A thread 🧵
Less than $450 and fully open-source 🤯
by @huggingface, @therobotstudio, @NepYope
This tendon-driven technology will disrupt robotics! Retweet to accelerate its democratization 🚀
A thread 🧵
📍 Poster #6302
📅 West Ballroom A-D
🕚 Friday, 11:00-14:00
Join us to discuss with Michał Nauman and me. Let’s talk SOTA in RL! 💪
🧵👇
📍 Poster #6302
📅 West Ballroom A-D
🕚 Friday, 11:00-14:00
Join us to discuss with Michał Nauman and me. Let’s talk SOTA in RL! 💪
🧵👇