Qualitative Multi-Armed Bandits: A Quantile-Based Approach thumbnail
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Qualitative Multi-Armed Bandits: A Quantile-Based Approach

Published on Dec 05, 20151414 Views

We formalize and study the multi-armed bandit (MAB) problem in a generalized stochastic setting, in which rewards are not assumed to be numerical. Instead, rewards are measured on a qualitative scale

Related categories