But genuine question, how does it differ from,
Csaba's Algorithms for Reinforcement Learning, or Lattimore's Bandit Algorithms, Sutton & Barto, or even "Reinforcement Learning: Theory and Algorithms" by Agarwal et. al.? (well the latter is unfinished anyway...)
But genuine question, how does it differ from,
Csaba's Algorithms for Reinforcement Learning, or Lattimore's Bandit Algorithms, Sutton & Barto, or even "Reinforcement Learning: Theory and Algorithms" by Agarwal et. al.? (well the latter is unfinished anyway...)