A Worst-Case Comparison Between Temporal Difference and Residual Gradient with Linear Function Approximation thumbnail
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

A Worst-Case Comparison Between Temporal Difference and Residual Gradient with Linear Function Approximation

Published on Sep 04, 2019169 Views

Residual gradient (RG) was proposed as an alternative to TD(0) for policy evaluation when function approximation is used, but there exists little formal analysis comparing them except in very limited