Reinforcement Learning Theory

author:John Langford, Yahoo! Research
published: Feb. 25, 2007,   recorded: July 2006,   views: 908
Categories
You might be experiencing some problems with Your Video player.

Related content

Visitors who watched this lecture also watched...
04:58:57
Reinforcement Learning

1395 views - Satinder Singh, 2006
05:47:38
Introduction to Reinforcement Learning

872 views - Csaba Szepesvari, 2008
19:38
Introduction to Reinforcement Learning and Bayesian learning

1013 views - Mohammad Ghavamzadeh, 2007
05:09:25
Reinforcement Learning

290 views - Peter L. Bartlett, 2002
04:59:19
Machine Learning, Probability and Graphical Models

18419 views - Sam Roweis, 2006
01:31:32
Reinforcement Learning

202 views - Douglas Aberdeen, 2005
01:20:19
Policy-gradient Reinforcement Learning

437 views - Douglas Aberdeen, 2006
01:00:47
Gaussian Process Basics

12603 views - David MacKay, 2006
03:54:31
Support Vector Machines

12745 views - Chih-Jen Lin, 2006
05:02:23
Statistical Learning Theory

7988 views - John Shawe-Taylor, 2004

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.

 Watch videos:   (click on thumbnail to launch)

Watch Part 1
Part 1 1:21:22 Flash video Slide Synchronization Windows Media video
!NOW PLAYING
Watch Part 2
Part 2 0:54:20 Flash video Slide Synchronization Windows Media video

Description

The tutorial is on several new pieces of Reinforcement learning theory developed in the last 7 years. This includes:
1. Sample based analysis of RL including E3 and sparse sampling.
2. Generalization based analysis of RL including conservative policy iteration and RL-to-Classification reductions.
For each of these forms of theory, we cover the basic results and cover the weaknesses and strengths of the approach in context.

Link this page  

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: