It addresses the exploration challenge when the agent can only interact with a perturbed MDP, i.e., robust MDP
Full paper: openreview.net/pdf?id=pJdMO...
It addresses the exploration challenge when the agent can only interact with a perturbed MDP, i.e., robust MDP
Full paper: openreview.net/pdf?id=pJdMO...
awards.acm.org/about/2024-t...
awards.acm.org/about/2024-t...