Simple regret for infinitely many armed bandits
Published on Dec 05, 20152056 Views
We consider a stochastic bandit problem with infinitely many arms. In this setting, the learner has no chance of trying all the arms even once and has to dedicate its limited number of samples only to