Lipschitz Bandits: Regret Lower Bounds and Optimal Algorithms
Published on Jul 15, 20142320 Views
We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function of the arm, and where the set of arms is either discrete or continuous. For discrete Lipschitz band