Machine Learning for Robotics

author: Pieter Abbeel, Department of Electrical Engineering and Computer Sciences, UC Berkeley
published: Oct. 29, 2012,   recorded: September 2012,   views: 828
Categories
You might be experiencing some problems with Your Video player.

Slides

Slides
0:00 Machine Learning for Robotics
0:27 Outline - 1
0:55 Challenges in helicopter control
1:32 Many success stories in hover and forward flight regime
2:07 Example result - 1
2:27 Example result - 2
3:46 One of our first attempts at autonomous flips
5:13 Aggressive, non-stationary regimes
6:13 Stationary vs. aggressive flight
7:58 Learning to perform dynamic maneuvers: outline
8:00 Target trajectory
8:46 Expert demonstrations: Airshow
9:39 Learning Trajectory - 1
10:56 Learning Trajectory - 2
12:03 Results: Time-aligned demonstrations
12:39 Results: Loops
13:17 Learning to perform dynamic maneuvers: outline
13:20 Baseline dynamics model
14:22 Empirical evaluation of standard modeling approach
15:19 ecmlpkdd2012_abbeel_learning_roboti.jpg
15:35 Key observation - 1
15:58 Key observation - 2
16:54 Trajectory-specific local models
17:45 Learning to perform dynamic maneuvers: outline
17:48 Experimental Setup
18:56 Experimental procedure
23:14 Results: Autonomous airshow
25:56 Results: Flight accuracy
26:34 Thus far
27:29 Surgical knot tie - 1
27:36 Thus far
28:30 Surgical knot tie - 1
28:57 Surgical knot tie - 2
30:00 Surgical knot tie - 3
30:47 Generalizing Trajectories
31:17 Cartoon Problem Setting - 1
32:03 Cartoon Problem Setting - 2
32:39 Cartoon Problem Setting - 3
32:44 Cartoon Problem Setting - 4
33:04 Cartoon Problem Setting - 5
33:42 Learning f : R3 -> R3 from samples - 1
35:20 Learning f : R3 -> R3 from samples - 2
36:03 Experiments: Plate Pick-Up
36:58 Experiments: Scooping
37:07 Experiment: Knot-Tie
37:12 Autonomous tying of a knot for a previously unseen situation
38:21 Outline - 2
38:30 Problem Structure
39:41 Inverse RL History
40:26 Inverse RL Examples
40:34 Inverse RL Examples (ctd)
40:36 Quadruped
42:14 Experimental setup
42:56 Without learning
43:47 With learned reward function
44:35 Safe exploration
45:37 Safe exploration --- towards
46:36 Safe exploration – Key idea
47:40 Perception and clothes manipulation
49:26 Conclusion
50:14 Thank you

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.
 
    Delicious Bibliography

Description

Robots are typically far less capable in autonomous mode than in tele-operated mode. The few exceptions tend to stem from long days (and more often weeks, or even years) of expert engineering for a specific robot and its operating environment. Current control methodology is quite slow and labor intensive. I believe advances in machine learning have the potential to revolutionize robotics. In this talk, I will present new machine learning techniques we have developed that are tailored to robotics. I will describe in depth “Apprenticeship learning,” a new approach to high-performance robot control based on learning for control from ensembles of expert human demonstrations. Our initial work in apprenticeship learning has enabled the most advanced helicopter aerobatics to-date, including maneuvers such as chaos, tic-tocs, and auto-rotation landings which only exceptional expert human pilots can fly. Our most recent work in apprenticeship learning is providing traction on learning to perform challenging robotic manipulation tasks, such as knot-tying. I will also briefly highlight three other machine learning for robotics developments: Inverse reinforcement learning and its application to quadruped locomotion, Safe exploration in reinforcement learning which enables robots to learn on their own, and Learning for perception with application to robotic laundry.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: