
en
0.25
0.5
0.75
1.25
1.5
1.75
2
Action-Gap Phenomenon in Reinforcement Learning
Published on 2012-09-063001 Views
Many practitioners of reinforcement learning problems have observed that oftentimes the performance of the agent reaches very close to the optimal performance even though the estimated (action-)value
Related categories
Presentation
Action-Gap Phenomenon in Reinforcement Learning00:00
Easy choice!00:12
Not a big deal if we choose the wrong one!00:46
Finite-action discounted MDP with general state space.01:27