Bounded regret in stochastic multi-armed bandits

Published on 2013-08-095579 Views

Sébastien Bubeck

We study the stochastic multi-armed bandit problem when one knows the value μ(⋆) of an optimal arm, as a well as a positive lower bound on the smallest positive gap Δ. We propose a new randomized poli

COLT 2013 - Princeton

Related categories

Presentation

Bounded regret in stochastic multi-armed bandits - 100:00

Bounded regret in stochastic multi-armed bandits - 200:10

Bounded regret in stochastic multi-armed bandits - 300:21