All posts are my own.
go.bsky.app/21nFz12
@ericeaton.bsky.social @mkearnsphilly.bsky.social @aaroth.bsky.social @sikatasengupta.bsky.social @optimistsinc.bsky.social
📖 Replicable Reinforcement Learning with Linear Function Approximation
🔗 arxiv.org/abs/2509.08660
In this paper, we study formal replicability in RL with linear function approximation. The... (1/6)
@ericeaton.bsky.social @mkearnsphilly.bsky.social @aaroth.bsky.social @sikatasengupta.bsky.social @optimistsinc.bsky.social
😮 Want to have stable on-policy RL without filling your GPU with an enormous replay buffer? 😮
🤖 Are you a roboticist and just want your RL code to run? 🤖
🎉 Fear not, we started adding new REPPO versions! 🎉
github.com/cvoelcker/rs...
😮 Want to have stable on-policy RL without filling your GPU with an enormous replay buffer? 😮
🤖 Are you a roboticist and just want your RL code to run? 🤖
🎉 Fear not, we started adding new REPPO versions! 🎉
github.com/cvoelcker/rs...
go.bsky.app/21nFz12
prontotriage.com
Also recently featured by Meta:
ai.meta.com/blog/upenn-d...
(1/5)
prontotriage.com
Also recently featured by Meta:
ai.meta.com/blog/upenn-d...
(1/5)
P.S. Quan is currently looking for PhD positions, keep an eye out for him!
P.S. Quan is currently looking for PhD positions, keep an eye out for him!