On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient

Published on 2011-03-253598 Views

Jie Tang

Likelihood ratio policy gradient methods have been some of the most successful reinforcement learning algorithms, especially for learning on physical systems. We describe how the likelihood ratio poli

Knowledge 4 All Foundation Video Journal Volume 1

Related categories

Reinforcement Learning

On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient

Jie Tang

Knowledge 4 All Foundation Video Journal Volume 1

Related categories

Presentation