TP3 Partial or Delayed Feedback
author:
Nicolò Cesa-Bianchi,
Università degli Studi di Milano
You might be experiencing some problems with Your Video player.
| Slides | |
| 0:00 | Partial and Delayed Feedback |
| 0:33 | Motivation - 1 |
| 1:40 | Motivation - 2 |
| 1:59 | Motivation - 3 |
| 2:40 | Motivation - 4 |
| 3:12 | Motivation - 5 |
| 4:47 | Bandit problems |
| 6:12 | Some applications in PASCAL - 1 |
| 7:10 | Some applications in PASCAL - 2 |
| 7:38 | Lightweight reinforcement learning models - 1 |
| 8:25 | Lightweight reinforcement learning models - 2 |
| 9:29 | Lightweight reinforcement learning models - 3 |
| 10:28 | Game-theoretic models |
| 13:45 | Connection with Multicomponent Learning |
| 16:58 | Clustering |
| 17:56 | Preliminary list of events |
| 18:48 | - Questions |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





