Reinforcement Learning Theory

Published on 2007-02-2513392 Views

John Langford

The tutorial is on several new pieces of Reinforcement learning theory developed in the last 7 years. This includes:\\ 1. Sample based analysis of RL including E3 and sparse sampling.\\ 2. Generalizat

MLSS 2006 - Taipei

Related categories

Reinforcement Learning

Presentation

Modern Reinforcement Learning Theory00:05

Reinforcement Learning is Always Relevant00:22

The answer to: \"Is this an RL problem?\" is always \"yes\"02:47

Outline06:22

What is a sample complexity guarantee?07:08

The E3 guarantee12:04

E3 Theorem Statement17:23

The Known(h) MDP29:03

The Known(h) MDP 0131:10

The Known(h) MDP 0232:07

The Known(h) MDP 0332:17

The Unknown(h) MDP 33:05

Dynamic Program36:01

E3(h) Explicit Explore or Exploit Algorithm41:46

The proof uses (5!) MDPs46:04

Proof Sketch:48:05

R-Max(h) Modification55:48

Delayed Q-learning01:01:00

The Limits of Sample Complexity: A lower bound01:04:57

Proof01:05:36

Implications01:08:43

Attempt 1: Factored - E301:09:35

Attempt 2: Metric - E301:13:57

Do we really want the guarantee these algorithms provide?01:19:21