Vision-Based Control, Control-Based Vision, and the Information Knot That Ties Them

author: Stefano Soatto, Computer Science Department, University of California, Los Angeles, UCLA
published: Jan. 12, 2011,   recorded: December 2010,   views: 888
Categories
You might be experiencing some problems with Your Video player.

Slides

Slides
0:00 Perception, action and the information knot that ties them
0:39 - Control? Vision? Learning? Nips? (missing slide)
3:43 - Outline (missing slide)
4:05 part I why?
4:28 How does a radio work?
7:56 - How does the brain "treat the data"? (missing slide)
9:15 Is data analysis necessary for intelligent behavior?
10:55 Why ?
12:09 The epistemological gap
13:00 Signal-symbol barrier
13:55 Intelligence
14:03 Many tasks
14:43 Plants?
15:07 - Motivation (missing slide)
15:55 - background (missing slide)
16:39 Groups
17:48 Orbits equivalence classes base/quotient
19:26 E.g. shape space
21:04 - mod-out/canonization (missing slide)
22:52 Mod-out/canonization - Graph
23:44 Singular perturbations
24:17 Singular perturbations - SEM
24:55 Infinite-dim space, finite-dim group
26:23 Infinite-dim space, infinite-dim group?
26:53 Semi-orbits (1)
27:34 Semi-orbits example
27:47 Semi-orbits (2)
28:12 Marginalization, max-out, canonization
29:57 Basic diff. topology
31:19 - sampling, sparsity (missing slide)
32:19 - Motivation (missing slide)
32:28 Shannon’s “equivocation”, “reproduction” shannon’s “equivocation” (graph)
33:04 Gibson’s information
37:24 Is a “gibsonian information theory” viable? (take I)
38:18 Is a “gibsonian information theory” viable? (take II)
40:15 Is a “gibsonian information theory” viable? (take III)
41:08 “The set of images modulo viewpoint and contrast changes”
42:04 The ART
42:44 Is a “gibsonian information theory” viable? (take IV)
45:00 Some notation (1)
47:31 Some notation (2)
48:07 Some notation (3)
48:57 Example of change
49:12 Some definitions
50:52 Representation and hallucination
52:23 Representation
53:04 Information gap

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.
 
    Delicious Bibliography

 Watch videos:   (click on thumbnail to launch)

Watch Part 1
Part 1 54:56
!NOW PLAYING
Watch Part 2
Part 2 1:02:44

Description

The purpose of this tutorial is to explore the interplay between sensing and control, to highlight the "information knot" that ties them, and to design inference and learning algorithms to compute "representations" from data that are optimal, by design, for decision and control tasks. We will focus on visual sensing, but the analysis developed extends to other modalities.

We will first review various notions of information proposed in different fields from economic theory to perception psychology, and adapt them to decision and control tasks, as opposed to transmission and storage of data. We will see that for complex sensing phenomena, such as vision, nuisance factors play an important role, especially those that are not "invertible" such as occlusions of line-of-sight and quantization-scale. Handling of the nuisances brings forward a notion of "representation," whose complexity measures the amount of "actionable information" contained in the data. We will discuss how to build representations that are optimal by design, in the sense of retaining all and only the statistics that matter to the task. For "invertible" nuisances, such representations can be made lossless (not in the classical sense of distortion, but in the sense of optimal performance in a decision or control task). In some cases, these representations are supported on a thin-set, which can help elucidate the "signal-to-symbol barrier" problem, and relate to a topology-based notion of "sparsity". However, non-invertible nuisances spoil the picture, requiring the introduction of a notion of "stability" of the representation with respect to non-invertible nuisances. This is not the classical notion of (bounded-input-bounded-output) stability from control theory, but instead relates to "structural stability" from catastrophe theory. The design of maximally stable statistics brings forward a notion of "proper sampling" of the data. However, this is not the traditional notion of proper sampling from Nyquist, but one related to persistent topology. Once an optimal representation is constructed, a bound on the risk or control functional can be derived, analog to distortion in communications. The "currency" that trades off this error (the equivalent of the bit-rate in communication) is not the amount of data, but instead the "control authority" over the sensing process. Thus, sensing and control are intimately tied: Actionable information drives the control process, and control of the sensing process is what allows computing a representation.

We will present case studies in which formulating visual decision problems (e.g. detection, localization, recognition, categorization) in the context of vision-based control leads to improved performance and reduced computational burden. They include established low-level vision tools (e.g. tracking, local invariant descriptors), robotic exploration, and action and activity recognition. We will describe some of these in detail and distribute source code at the workshop, together with course notes.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: