Amir Mesbah
banner
amirmesbah.bsky.social
Amir Mesbah
@amirmesbah.bsky.social
Graduate Student - Interested in RL and its mathematics 👾

> https://amirhosein-mesbah.github.io/
I came across a couple of other definitions that might be helpful to mention (apologies if you’re already considering these).
The first one is from Csaba Szepesvári’s RL theory lecture notes (lecture 2, planning in MDPs), and the second one is from Puterman's MDP book (chapter 1).
August 4, 2025 at 9:45 AM