event thumbnail image
Uncertainty in Artificial Intelligence (UAI 2008)

Hierarchical POMDP Controller Optimization by Likelihood Maximization

author: Marc Toussaint, TU Berlin, TU Berlin

Description

Planning can often be simplified by decomposing the task into smaller tasks arranged hierarchically. Charlin et al. recently showed that the hierarchy discovery problem can be framed as a non-convex optimization problem. However, the inherent computational difficulty of solving such an optimization problem makes it hard to scale to real world problems. In another line of research, Toussaint et al. developed a method to solve planning problems by maximum likelihood estimation. In this paper, we show how the hierarchy discovery problem in partially observable domains can be tackled using a similar maximum likelihood approach. Our technique first transforms the problem into a dynamic Bayesian network through which a hierarchical structure can naturally be discovered while optimizing the policy. Experimental results demonstrate that this approach scales better than previous techniques based on non-convex optimization.

You might be experiencing some problems with Your Video player.
Slides
0:00 Hierarchical POMDP Controller Optimization by Likelihood Maximization
0:59 POMDPs - 1
2:06 POMDPs - 2
2:22 Hierarchical FSCs - 1
4:36 Hierarchical FSCs - 2
4:56 Hierarchical FSCs - 3
5:28 Representing Hierarchies in DBNs - 1
7:13 Representing Hierarchies in DBNs - 2
8:33 Representing Hierarchies in DBNs - 3
8:43 Outline - Expectation-Maximization for Controller Optimization
8:45 Expectation-Maximization for Controller Optimization - 1
9:21 Expectation-Maximization for Controller Optimization - 2
9:55 Expectation-Maximization for Controller Optimization - 3
11:31 Expectation-Maximization for Controller Optimization - 4
11:56 Expectation-Maximization for Controller Optimization - 5
12:10 Expectation-Maximization for Controller Optimization - 6
13:13 Inference in the Mixture of DBNs - 1
14:23 Inference in the Mixture of DBNs - 2
14:47 Inference in the Mixture of DBNs - 3
15:00 Inference in the Mixture of DBNs - 4
15:03 Inference in the Mixture of DBNs - 3
15:06 Inference in the Mixture of DBNs - 4
15:07 Inference in the Mixture of DBNs - 5
15:08 Inference in the Mixture of DBNs - 6
15:09 Inference in the Mixture of DBNs - 7
15:23 Outline - Results
15:25 Results - 1
17:20 Results - 2
19:09 Chain-of-Chains
20:43 Conclusions - 1
21:04 Conclusions - 2
21:26 Conclusions - 3
21:43 - Questions
23:39 - Questions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: