From Bandits to Experts: On the Value of Side-Observations

Published on 2012-09-062428 Views

Ohad Shamir

We consider an adversarial online learning setting where a decision maker can choose an action in every stage of the game. In addition to observing the reward of the chosen action, the decision maker

Knowledge 4 All Foundation Video Journal Volume 2

Related categories

On-line Learning

Presentation

From Bandits to Experts: On the Value of Side-Observations00:00

What We Do00:09

Why01:15

Results02:34