
Lipschitz Bandits: Regret Lower Bounds and Optimal Algorithms
Published on 2014-07-152324 Views
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function of the arm, and where the set of arms is either discrete or continuous. For discrete Lipschitz band