The Fixed Points of Off-Policy TD

Published on 2012-09-062848 Views

J. Zico Kolter

Off-policy learning, the ability for an agent to learn about a policy other than the one it is following, is a key element of Reinforcement Learning, and in recent years there has been much work on de

Knowledge 4 All Foundation Video Journal Volume 2

Related categories

Reinforcement Learning

Presentation

The Fixed Points of Off-Policy TD00:00

Can be solved, in principle, by Temporal Difference learning00:32

This work is about fixing off-policy TD01:37

Guarantees on resulting solution quality02:47