
Simple regret for infinitely many armed bandits
Published on 2015-12-052059 Views
We consider a stochastic bandit problem with infinitely many arms. In this setting, the learner has no chance of trying all the arms even once and has to dedicate its limited number of samples only to