Jake Tuero
banner
tuero.ca
Jake Tuero
@tuero.ca
PhD Candidate @ University of Alberta
Reinforcement Learning and Policy Tree Search
C++ | Math | Caffiene | Video Games | Hockey | 🇨🇦

github.com/tuero
This enables our method to learn from failed attempts where we do not solve the problem outright, but solve some of the subgoals. The results show that this approach helps the system learn faster and more efficiently, without sacrificing the quality of the policy.
June 10, 2025 at 4:34 PM
This paper looks at a class of algorithms called policy tree search, which combines policies from reinforcement learning with traditional tree search. We show how one can decompose a problem into learnable subgoals, without any prior knowledge of the environment.
June 10, 2025 at 4:34 PM
Thanks for these lists! Is the grumpy list an inside joke or am I missing something 🤣
November 22, 2024 at 9:28 AM