The Adaptive k-Meteorologists Problem and Its Application to Structure Learning and Feature Selection in Reinforcement Learning
Description
The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an algorithm, known as the Adaptive k-Meteorologists Algorithm, analyze its sample complexity upper bound, and give a matching lower bound. Second, this algorithm is used to create a new reinforcement learning algorithm for factoredstate problems that enjoys significant improvement over the previous state-of-the-art algorithm. Finally, we apply the Adaptive k-Meteorologists Algorithm to remove a limiting assumption in an existing reinforcement-learning algorithm. The effectiveness of our approaches are demonstrated empirically in a couple benchmark domains as well as a robotics navigation problem.
| Slides | |
| 0:00 | The Adaptive k-Meteorologists Problem and Applications to Reinforcement Learning |
| 0:05 | Let’s start with a problem we all have… |
| 0:49 | Our paper |
| 1:30 | The k-Meteorologists metaphor |
| 1:54 | The k-Meteorologists |
| 2:30 | Probabilistic concepts |
| 3:22 | KWIK Definition |
| 4:50 | The Adaptive k-Meteorologist |
| 5:58 | Analysis |
| 6:46 | Application I: Structure learning |
| 7:03 | Application I |
| 7:54 | Sample Domains |
| 8:30 | Stocks results |
| 9:30 | SysAdmin results |
| 10:30 | Application II |
| 11:06 | Artificial Dynamics |
| 11:42 | Features tested |
| 12:46 | Results |
| 13:38 | Conclusions |
| 14:18 | Current/Future work |
| 15:10 | The Adaptive k-Meteorologists Problem and Applications to Reinforcement Learning |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !




