Learning to Control an Octopus Arm with Gaussian Process Temporal Difference Methods
author:Yaakov Engel,
University of Alberta
published: Feb. 25, 2007, recorded: June 2006, views: 176
published: Feb. 25, 2007, recorded: June 2006, views: 176
You might be experiencing some problems with Your Video player.
See Also:
Launch in a standalone WM Player
Switch to Windows Media Player
Download slides:
gpip06_engel_lcoag_01.pdf (456.2 KB)
Related content
Visitors who watched this lecture also watched...
14:15
323 views - Yaakov Engel, 2007
57:45
439 views - Yaakov Engel, 2007
01:00:47
15129 views - David MacKay, 2006
05:47:38
1140 views - Csaba Szepesvari, 2008
04:59:19
21468 views - Sam Roweis, 2006
24:32
94 views - Hengshuai Yao, 2008
33:48
426 views - Brian Ferris, 2006
04:22:56
1559 views - Carl Edward Rasmussen, 2003
22:56
80 views - 2006
23:11
102 views - J. Zico Kolter, 2009
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity:
You need to login to cast your vote.
We are currently conducting a short survey. We value your feedback, and would appreciate if you took a few moments to respond to some questions. Click here to take the survey.
We are currently conducting a short survey. We value your feedback, and would appreciate if you took a few moments to respond to some questions. Click here to take the survey.
Description
The Octopus arm is a highly versatile and complex limb. How the Octopus controls such a hyper-redundant arm (not to mention eight of them!) is as yet unknown. Robotic arms based on the same mechanical principles may render present day robotic arms obsolete. In this talk, I will describe how we tackle this problem using an online reinforcement learning algorithm, based on a Bayesian approach to policy evaluation known as Gaussian process temporal difference (GPTD) learning.
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !



Write your own review or comment: