
en-fr
en-de
en-pt
en-sl
en
en-zh
en-es
0.25
0.5
0.75
1.25
1.5
1.75
2
Non-Parametric Policy Gradients: A Unified Treatment of Propositional and Relational Domains
Published on Feb 4, 20253077 Views
Policy gradient approaches are a powerful instrument for learning how to interact with the environment.Existing approaches have focused on propositional and continuous domains only. Without extensive