Action-Gap Phenomenon in Reinforcement Learning

Published on 2012-09-063013 Views

Amir-massoud Farahmand

Many practitioners of reinforcement learning problems have observed that oftentimes the performance of the agent reaches very close to the optimal performance even though the estimated (action-)value

Knowledge 4 All Foundation Video Journal Volume 2

Related categories

Reinforcement Learning

Presentation

Action-Gap Phenomenon in Reinforcement Learning00:00

Easy choice!00:12

Not a big deal if we choose the wrong one!00:46

Finite-action discounted MDP with general state space.01:27