A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits

Published on 2015-09-272316 Views

Pratik Gajane

We study the K-armed dueling bandit problem which is a variation of the classical Multi-Armed Bandit (MAB) problem in which the learner receives only relative feedback about the selected pairs of arms

ICML 2015 - Lille

Related categories

A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits

Pratik Gajane

ICML 2015 - Lille

Related categories

Presentation