Lightnews — Scholar-powered news

I came across a couple of other definitions that might be helpful to mention (apologies if you’re already considering these).
The first one is from Csaba Szepesvári’s RL theory lecture notes (lecture 2, planning in MDPs), and the second one is from Puterman's MDP book (chapter 1).

Definition of dynamic programming in RL, from Csaba Szepesvári’s RL theory lecture notes (Lecture 2, "Planning in MDPs")

Definition of dynamic programming, from Puterman’s Markov Decision Processes — chapter 1.

August 4, 2025 at 9:45 AM

Amir Mesbah

@amirmesbah.bsky.social

I wanted to send you the link just now but hopefully you have found it =)

March 18, 2025 at 9:08 PM

Amir Mesbah

@amirmesbah.bsky.social

Sure *_*
Looking forward to it :)

March 17, 2025 at 8:55 PM

Amir Mesbah

@amirmesbah.bsky.social

Not yet. Just the classical claim that they're trying to learn the distribuition of the return =))
Do yo have any insights?

March 17, 2025 at 6:37 PM

Amir Mesbah

@amirmesbah.bsky.social

I was reading about the ways that I can enhance the performance of dqn on a real-world problem. One of the candidates was c51 but i haven't implement it yet becuase of computational costs. But it was interesting for becuase i haven't read the papers before

March 17, 2025 at 2:24 PM

Amir Mesbah

@amirmesbah.bsky.social

I didn't know until last week that it can cause a huge performance boost using it with dqn.

March 17, 2025 at 2:06 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news