First-order regret bounds for combinatorial semi-bandits

Published on 2015-08-201722 Views

Gergely Neu

We consider the problem of online combinatorial optimization under semi-bandit feedback, where a learner has to repeatedly pick actions from a combinatorial decision set in order to minimize the total

COLT 2015 - Paris

Related categories

Presentation

First-order regret bounds for combinatorial semi-bandits00:00

Combinatorial semi-bandits - 100:10

Combinatorial semi-bandits - 200:21

Combinatorial semi-bandits - 300:47

Regret - 100:55

Regret - 201:20

First-order bounds - 101:24

First-order bounds - 202:19

This paper02:43

Thanks!04:04