
0.25
0.5
0.75
1.25
1.5
1.75
2
On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient
Published on Feb 4, 20253586 Views
Likelihood ratio policy gradient methods have been some of the most successful reinforcement learning algorithms, especially for learning on physical systems. We describe how the likelihood ratio poli