Non-Parametric Policy Gradients: A Unified Treatment of Propositional and Relational Domains

Published on 2008-08-043083 Views

Kurt Driessens

Policy gradient approaches are a powerful instrument for learning how to interact with the environment.Existing approaches have focused on propositional and continuous domains only. Without extensive

Non-Parametric Policy Gradients: A Unified Treatment of Propositional and Relational Domains

Kurt Driessens

Reinforcement Learning

Presentation