
0.25
0.5
0.75
1.25
1.5
1.75
2
On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient
Published on 2011-03-253587 Views
Likelihood ratio policy gradient methods have been some of the most successful reinforcement learning algorithms, especially for learning on physical systems. We describe how the likelihood ratio poli