Fast rates for the multi-armed bandit

Published on 2014-10-062253 Views

Sébastien Bubeck

Since the seminal work of Lai and Robbins (1985) we know bandit strategies with normalized regret of order (i) 1/sqrt(T) for any stochastic bandit, and (ii) log(T) / T for 'benign' distributions. In B

NIPS Workshops 2013 - Lake Tahoe

Related categories

Fast rates for the multi-armed bandit

Sébastien Bubeck

NIPS Workshops 2013 - Lake Tahoe

Related categories

Presentation