
Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation
Published on 2009-09-177463 Views
Sutton, Szepesvari and Maei (2009) recently introduced the first temporal-difference learning algorithm compatible with both linear function approximation and off-policy training, and whose complexity