https://larsvanderlaan.github.io
Calibrating nuisance estimates in DML protects against model misspecification and slow convergence.
Just one line of code is all it takes.
Calibrating nuisance estimates in DML protects against model misspecification and slow convergence.
Just one line of code is all it takes.
My talk will be on Automatic Double Reinforcement Learning and long term causal inference!
I’ll discuss Markov decision processes, Q-functions, and a new form of calibration for RL!
My talk will be on Automatic Double Reinforcement Learning and long term causal inference!
I’ll discuss Markov decision processes, Q-functions, and a new form of calibration for RL!