Preconditioned Temporal Difference Learning
Published on Aug 12, 20083275 Views
This paper extends many of the recent popular reinforcement learning (RL) algorithms to a generalized framework that includes least-squares temporal difference (LSTD) learning, least-squares policy ev