A Semi-parametric Statistical Approach to Model-free Policy Evaluation

Published on 2008-08-123000 Views

Tsuyoshi Ueno

Reinforcement learning (RL) methods based on least-squares temporal difference (LSTD) have been developed recently and have shown good practical performance. However, the quality of their estimation h

A Semi-parametric Statistical Approach to Model-free Policy Evaluation

Tsuyoshi Ueno

Reinforcement Learning

Presentation