Manifold Embeddings for Model-Based Reinforcement Learning of Neurostimulation Policies
Description
Real-world reinforcement learning problems often exhibit nonlinear, continuous-valued,
noisy, partially-observable state-spaces that are prohibitively expensive to explore. The
formal reinforcement learning framework, unfortunately, has not been successfully demonstrated in a real-world domain having all of these constraints. We approach this domain
with a two-part solution. First, we overcome continuous-valued, partially observable state-spaces by constructing manifold embeddings of the system’s underlying dynamics, which
substitute as a complete state-space representation. We then define a generative model
over this manifold to learn a policy off-line. The model-based approach is preferred because it enables simplification of the learning problem by domain knowledge. In this work we formally integrate manifold embeddings into the reinforcement learning framework, summarize a spectral method for estimating embedding parameters, and demonstrate the model-based approach in a complex domain-adaptive seizure suppression of an epileptic neural system.
| Slides | |
| 0:00 | Manifold Embeddings for Model-Based Reinforcement Learning of Neurostimulation Policies |
| 0:05 | Motivation - 1 |
| 0:48 | Motivation - 2 |
| 1:34 | Idea - 1 |
| 2:11 | Idea - 2 |
| 2:14 | Summary of Approach: Step 1 |
| 3:15 | Summary of Approach: Step 2 |
| 5:03 | Summary of Approach: Step 3 |
| 5:41 | Summary of Approach: Step 4 |
| 6:24 | Summary of Approach: Step 5 |
| 7:08 | Model Validation and Learning |
| 8:30 | Unmasking Neurostimulation Policies |
| 9:52 | Discussion |
| 10:46 | Future Work - 1 |
| 12:19 | Future Work - 2 |
| 12:48 | Questions? |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !



