Cheap Bandits

Published on 2015-12-051471 Views

Manjesh Kumar Hanawal

We consider stochastic sequential learning problems where the learner can observe the average reward of several actions. Such a setting is interesting in many applications involving monitoring and sur

ICML 2015 - Lille

Related categories

Cheap Bandits

Manjesh Kumar Hanawal

ICML 2015 - Lille

Related categories

Presentation