Improved Regret Bounds for Undiscounted Continuous Reinforcement Learning
Published on Dec 05, 20151928 Views
We consider the problem of undiscounted reinforcement learning in continuous state space. Regret bounds in this setting usually hold under various assumptions on the structure of the reward and transi