Adrien Bolland
@adrienbolland.bsky.social
Researcher in RL at the University of Liège
Check our work on max entropy RL! We introduce an off-policy method to maximize the entropy of the future state-action visitation distribution, leading to policies that explore effectively and achieve high performance 🎯
Link 📑 arxiv.org/abs/2412.06655
#RL #MaxEntRL #Exploration
Link 📑 arxiv.org/abs/2412.06655
#RL #MaxEntRL #Exploration
Off-Policy Maximum Entropy RL with Future State and Action Visitation Measures
We introduce a new maximum entropy reinforcement learning framework based on the distribution of states and actions visited by a policy. More precisely, an intrinsic reward function is added to the re...
arxiv.org
December 13, 2024 at 9:22 AM
Check our work on max entropy RL! We introduce an off-policy method to maximize the entropy of the future state-action visitation distribution, leading to policies that explore effectively and achieve high performance 🎯
Link 📑 arxiv.org/abs/2412.06655
#RL #MaxEntRL #Exploration
Link 📑 arxiv.org/abs/2412.06655
#RL #MaxEntRL #Exploration