An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning
author:
Ronald Parr,
Department of Computer Science, Duke University
Description
We show that linear value function approximation is equivalent to a form of linear model approximation. We derive a relationship between the model approximation error and the Bellman error, and show how this relationship can guide feature selection for model improvement and/or value function improvement. We also show how these results give insight into the behavior of existing feature-selection algorithms.
You might be experiencing some problems with Your Video player.
| Slides | |
| 0:00 | An Analysis of Linear Models, Linear Value-Function Approximation, and Feature Selection for Reinforcement Learning |
| 0:10 | A Walk Through Our Paper (1) |
| 1:20 | A Walk Through Our Paper (2) |
| 3:03 | Outline |
| 3:14 | Basic Terminology |
| 3:39 | Linear Value Function Approximation |
| 4:12 | Bellman Operator |
| 4:43 | Linear Fixed Point |
| 6:02 | Outline |
| 6:08 | Linear Model Approximation |
| 7:18 | Value Function of the Linear Model |
| 8:20 | Linear Model, Linear Fixed Point Equivalence (1) |
| 8:56 | Linear Model, Linear Fixed Point Equivalence (2) |
| 9:12 | Outline |
| 9:20 | Model Error |
| 9:53 | Bellman Error |
| 10:19 | Outline |
| 10:28 | Insights into Feature Selection I |
| 10:56 | Insights into Feature Selection II |
| 12:06 | Achieving Zero Feature Error (DF = 0) |
| 13:13 | Insight into Adding Features |
| 15:09 | Insight into Proto Value Functions |
| 16:05 | Outline |
| 16:09 | Experimental Results |
| 17:35 | Chain Results |
| 20:28 | Conclusions From Experiments |
| 21:21 | Ground Covered |
| 21:43 | Thank you! |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Visitors who watched this lecture also watched...
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





