event thumbnail image
Sessions

The Adaptive k-Meteorologists Problem and Its Application to Structure Learning and Feature Selection in Reinforcement Learning

author: Carlos Diuk, Department of Computer Science, Rutgers, The State University of New Jersey

Description

The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an algorithm, known as the Adaptive k-Meteorologists Algorithm, analyze its sample complexity upper bound, and give a matching lower bound. Second, this algorithm is used to create a new reinforcement learning algorithm for factoredstate problems that enjoys significant improvement over the previous state-of-the-art algorithm. Finally, we apply the Adaptive k-Meteorologists Algorithm to remove a limiting assumption in an existing reinforcement-learning algorithm. The effectiveness of our approaches are demonstrated empirically in a couple benchmark domains as well as a robotics navigation problem.

You might be experiencing some problems with Your Video player.
Slides
0:00 The Adaptive k-Meteorologists Problem and Applications to Reinforcement Learning
0:05 Let’s start with a problem we all have…
0:49 Our paper
1:30 The k-Meteorologists metaphor
1:54 The k-Meteorologists
2:30 Probabilistic concepts
3:22 KWIK Definition
4:50 The Adaptive k-Meteorologist
5:58 Analysis
6:46 Application I: Structure learning
7:03 Application I
7:54 Sample Domains
8:30 Stocks results
9:30 SysAdmin results
10:30 Application II
11:06 Artificial Dynamics
11:42 Features tested
12:46 Results
13:38 Conclusions
14:18 Current/Future work
15:10 The Adaptive k-Meteorologists Problem and Applications to Reinforcement Learning

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: