Lightnews — Scholar-powered news

Faisal Mohamed

@thefirstfaisal.bsky.social

12 followers 88 following 0 posts

MSc at Mila, Reinforcement learning, representation learning and probabilistic inference.

Posts Replies Media Videos

Reposted by Faisal Mohamed

Glen Berseth

@glenberseth.bsky.social

Training #deepRL agents has always been a tricky and unstable process. What is the cause of these instabilities? We study the coupling effects of policy training and value estimation and find a chain effect of the value and policy churn in popular DRL agents.

December 11, 2024 at 5:34 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news