Minimax Regret of Finite Partial-Monitoring Games in Stochastic Environments

Published on 2011-08-023826 Views

Gábor Bartók

In a partial monitoring game, the learner repeatedly chooses an action, the environment responds with an outcome, and then the learner suffers a loss and receives a feedback signal, both of which ar

COLT 2011 - Budapest

Related categories

Presentation

Minimax Regret of Finite Partial-Monitoring Games in Stochastic Environments00:00

Finite Stochastic Partial-Monitoring Games00:06

Examples01:27

Goal03:16

Previous work04:33

Our contribution06:57

Main tools 1: using L 08:14

Main tools 2: using H09:50

What makes a game easy?12:36

Algorithm outline13:54

Lower bound for hard games15:45

Discussion17:33

Thank you21:03