Qualitative Multi-Armed Bandits: A Quantile-Based Approach
Published on Dec 05, 20151415 Views
We formalize and study the multi-armed bandit (MAB) problem in a generalized stochastic setting, in which rewards are not assumed to be numerical. Instead, rewards are measured on a qualitative scale