Lipschitz Bandits: Regret Lower Bounds and Optimal Algorithms

Published on 2014-07-152336 Views

Stefan Magureanu

We consider stochastic multi-armed bandit problems where the expected reward is a Lipschitz function of the arm, and where the set of arms is either discrete or continuous. For discrete Lipschitz band

COLT 2014 - Barcelona

Related categories