Pan Xu
pan-xu.bsky.social
Pan Xu
@pan-xu.bsky.social
Assistant Professor @DukeU | Previously @Caltech @UCLA | Machine Learning | Views are my own | he/him/his. 🌈
homepage: https://panxulab.github.io/
My amazing student @zhishuai_liu will be presenting this paper at #ICML2025 on Tue 15 Jul from 4:30 p.m. PDT — 7 p.m. PDT in West Exhibition Hall B2-B3 #W-715.

While I won't attend ICML this time, I encourage you to chat with Zhishuai if you're interested in robust MDPs!
July 15, 2025 at 1:28 PM
We propose a computationally efficient algorithm that achieves sublinear regret in online robust MDPs when the transition dynamics falls in f-divergence based uncertainty sets, and also establish matching regret lower bounds.
July 15, 2025 at 1:28 PM
We identify a unique information deficit issue inherent to this setting. We show that when the supremal visitation ratio, a quantity that measures the mismatch between the training dynamics and the deployment dynamics, is unbounded, online learning becomes exponentially hard.
July 15, 2025 at 1:28 PM