The optimistic principle for online planning in Markov decision processes
Published on May 28, 20132643 Views
Given an initial state, what is the best possible action that can be returned by a planning algorithm that is given a finite numerical budget (e.g. number of calls to a model of the state-transition