Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation
Published on Sep 17, 20097443 Views
Sutton, Szepesvari and Maei (2009) recently introduced the first temporal-difference learning algorithm compatible with both linear function approximation and off-policy training, and whose complexity