event thumbnail image
Workshops

Manifold Embeddings for Model-Based Reinforcement Learning of Neurostimulation Policies

author: Keith Bush, School of Computer Science, McGill University

Description

Real-world reinforcement learning problems often exhibit nonlinear, continuous-valued, noisy, partially-observable state-spaces that are prohibitively expensive to explore. The formal reinforcement learning framework, unfortunately, has not been successfully demonstrated in a real-world domain having all of these constraints. We approach this domain with a two-part solution. First, we overcome continuous-valued, partially observable state-spaces by constructing manifold embeddings of the system’s underlying dynamics, which substitute as a complete state-space representation. We then define a generative model over this manifold to learn a policy off-line. The model-based approach is preferred because it enables simplification of the learning problem by domain knowledge. In this work we formally integrate manifold embeddings into the reinforcement learning framework, summarize a spectral method for estimating embedding parameters, and demonstrate the model-based approach in a complex domain-adaptive seizure suppression of an epileptic neural system.

You might be experiencing some problems with Your Video player.
Slides
0:00 Manifold Embeddings for Model-Based Reinforcement Learning of Neurostimulation Policies
0:05 Motivation - 1
0:48 Motivation - 2
1:34 Idea - 1
2:11 Idea - 2
2:14 Summary of Approach: Step 1
3:15 Summary of Approach: Step 2
5:03 Summary of Approach: Step 3
5:41 Summary of Approach: Step 4
6:24 Summary of Approach: Step 5
7:08 Model Validation and Learning
8:30 Unmasking Neurostimulation Policies
9:52 Discussion
10:46 Future Work - 1
12:19 Future Work - 2
12:48 Questions?

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: