Trust Region Policy Optimization
Published on Dec 05, 20152785 Views
In this article, we describe a method for optimizing control policies, with guaranteed monotonic improvement. By making several approximations to the theoretically-justified scheme, we develop a pract