LSTD with Random Projections
Published on Mar 25, 20113133 Views
We consider the problem of reinforcement learning in high-dimensional spaces when the number of features is bigger than the number of samples. In particular, we study the least-squares temporal differ