Information Complexity in Bandit Subset Selection

Published on 2013-08-092894 Views

Emilie Kaufmann

We consider the problem of efficiently exploring the arms of a stochastic bandit to identify the best subset. Under the PAC and the fixed-budget formulations, we derive improved bounds by using KL-div

COLT 2013 - Princeton

Related categories

Presentation

Information complexity in bandit subset selection00:00

Stochastic Bandit: One statistical model00:11

Stochastic Bandit: Two objectives - 100:54

Stochastic Bandit: Two objectives - 201:20

The Explore-m problem - 101:43

The Explore-m problem - 202:11

The Explore-m problem - 302:33

Open questions for Explore-m - 102:44

Open questions for Explore-m - 203:20

Our contributions - 103:46

Our contributions - 204:09

Our contributions - 304:21

KL-Racing: uniform sampling and eliminations04:33

KL-LUCB: adaptive sampling05:18

Sample Complexity bound for KL-LUCB05:45