External lecturer at Johannes Kepler University Linz, Austria
Drug Discovery | Deep Learning | RL
http://www.arjonamedina.com
I don't see the credit assignment mechanism from future rewards to current actions in this formulation, which is the key factor in RL.
I don't see the credit assignment mechanism from future rewards to current actions in this formulation, which is the key factor in RL.