Regret Bounds for the Adaptive Control of Linear Quadratic Systems thumbnail
slide-image
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Regret Bounds for the Adaptive Control of Linear Quadratic Systems

Published on Aug 02, 20113799 Views

We study the average cost Linear Quadratic (LQ) problem with unknown model parameters, also known as the adaptive control problem in the control community. We design an algorithm and prove that its

Related categories

Chapter list

Regret Bounds for the Adaptive Control of Linear Quadratic Systems00:00
Outline - 101:01
Control problems01:24
Learning to control02:20
Measure of performance of the learner02:48
This talk: Linear Quadratic Regulation04:00
The goal and why should we care?05:30
Previous works06:20
Outline - 207:46
The main ideas of the algorithm07:47
Estimation08:11
Optimism principle09:01
Avoiding frequent changes11:19
Outline - 311:36
How to choose the confidence set?11:47
Construction of confidence sets12:47
The algorithm13:10
Proof sketch13:21
Regret decomposition13:51
Term R114:34
Term R314:51
Term R215:04
Change the policy only when the determinant of confidence ellipsoid doubles.15:54
The algorithm16:13
Theorem16:25
Conclusions16:39
References19:19