event thumbnail image
Machine Learning Summer School 2006 - Taipei
Pascal

Reinforcement Learning Theory

author: John Langford, Yahoo Research, Yahoo!

Description

The tutorial is on several new pieces of Reinforcement learning theory developed in the last 7 years. This includes:
1. Sample based analysis of RL including E3 and sparse sampling.
2. Generalization based analysis of RL including conservative policy iteration and RL-to-Classification reductions.
For each of these forms of theory, we cover the basic results and cover the weaknesses and strengths of the approach in context.

You might be experiencing some problems with Your Video player.
Slides
0:05 Modern Reinforcement Learning Theory
0:22 Reinforcement Learning is Always Relevant
2:47 The answer to: \"Is this an RL problem?\" is always \"yes\"
6:22 Outline
7:08 What is a sample complexity guarantee?
12:04 The E3 guarantee
17:23 E3 Theorem Statement
29:03 The Known(h) MDP
31:10 The Known(h) MDP 01
32:07 The Known(h) MDP 02
32:17 The Known(h) MDP 03
33:05 The Unknown(h) MDP
36:01 Dynamic Program
41:46 E3(h) Explicit Explore or Exploit Algorithm
46:04 The proof uses (5!) MDPs
48:05 Proof Sketch:
55:48 R-Max(h) Modification
61:00 Delayed Q-learning
64:35 Outline
64:57 The Limits of Sample Complexity: A lower bound
65:36 Proof
68:43 Implications
69:35 Attempt 1: Factored - E3
73:57 Attempt 2: Metric - E3
79:21 Do we really want the guarantee these algorithms provide?

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

 Watch videos:   (click on thumbnail to launch)

Watch Part 1
Part 1 1:21:22
Flash video Slide Synchronization Windows Media video

!NOW PLAYING
Watch Part 2
Part 2 0:54:20
Flash video Slide Synchronization Windows Media video

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment: