A Worst-Case Comparison Between Temporal Difference and Residual Gradient with Linear Function Approximation
Published on Sep 04, 2019170 Views
Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them except in very limited