Regularization and Feature Selection in Least Squares Temporal-Difference Learning thumbnail
Pause
Mute
Subtitles
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Regularization and Feature Selection in Least Squares Temporal-Difference Learning

Published on Sep 17, 20094043 Views

We consider the task of reinforcement learning with linear value function approximation. Temporal difference algorithms, and in particular the Least-Squares Temporal Difference (LSTD) algorithm, provi

Related categories