Monotone multi-armed bandit allocations
published: Aug. 2, 2011, recorded: July 2011, views: 3297
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
We present a novel angle for multi-armed bandits (henceforth abbreviated MAB) which follows from the recent work on MAB mechanisms (Devanur and Kakade, 2009, Babaio et al., 2009, 2010). The new problem is, essentially, about designing MAB algorithms under an additional constraint motivated by their application to MAB mechanisms. This note is self-contained, although some familiarity with MAB is assumed; we refer the reader to Cesa-Bianchi and Lugosi (2006) for more background.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !