Lightnews — Scholar-powered news

Jake Tuero

@tuero.ca

Posts Replies Media Videos

Jake Tuero

@tuero.ca

This enables our method to learn from failed attempts where we do not solve the problem outright, but solve some of the subgoals. The results show that this approach helps the system learn faster and more efficiently, without sacrificing the quality of the policy.

June 10, 2025 at 4:34 PM

Jake Tuero

@tuero.ca

This paper looks at a class of algorithms called policy tree search, which combines policies from reinforcement learning with traditional tree search. We show how one can decompose a problem into learnable subgoals, without any prior knowledge of the environment.

June 10, 2025 at 4:34 PM

Jake Tuero

@tuero.ca

Thanks for these lists! Is the grumpy list an inside joke or am I missing something 🤣

November 22, 2024 at 9:28 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news