
0.25
0.5
0.75
1.25
1.5
1.75
2
Qualitative Multi-Armed Bandits: A Quantile-Based Approach
Published on 2015-12-051418 Views
We formalize and study the multi-armed bandit (MAB) problem in a generalized stochastic setting, in which rewards are not assumed to be numerical. Instead, rewards are measured on a qualitative scale