Fast Gradient-Descent Methods for Temporal-Difference Learning with Linear Function Approximation

Published on 2009-09-177476 Views

Richard S. Sutton

Sutton, Szepesvari and Maei (2009) recently introduced the first temporal-difference learning algorithm compatible with both linear function approximation and off-policy training, and whose complexity

Sessions

Related categories