Off-policy Model-based Learning under Unknown Factored Dynamics

Published on 2015-09-271863 Views

Assaf Hallak

Off-policy learning in dynamic decision problems is essential for providing strong evidence that a new policy is better than the one in use. But how can we prove superiority without testing the new po

ICML 2015 - Lille

Related categories

Off-policy Model-based Learning under Unknown Factored Dynamics

Assaf Hallak

ICML 2015 - Lille

Related categories

Presentation