Probabilistic models for understanding images
Description
Getting a computer to understand an image is challenging due to the numerous sources of variability that influence the imaging process. The pixels of a typical photograph will depend on the scene type and geometry, the number, shape and appearance of objects present in the scene, their 3D positions and orientations, as well as effects such as occlusion, shading and shadows. The good news is that research into physics and computer graphics has given us a detailed understanding of how these variables affect the resulting image. This understanding can help us to build the right prior knowledge into our probabilistic models of images. In theory, building a model containing all of this knowledge would solve the image understanding problem. In practice, such a model would be intractable for current inference methods. The open challenge for machine learning and machine vision researchers is to create a model which captures the imaging process as accurately as possible, whilst remaining tractable for accurate inference. To illustrate this challenge, I will show how different aspects of the imaging process can be incorporated into models for object detection and segmentation, and discuss techniques for making inference tractable in such models. Disclaimer: Videolectures.Net emphasises that the quality of this video can not be improved,
because of low light quality conditions provided in the lecture auditorium.
| Slides | |
| 0:00 | Probabilistic models for understanding images |
| 0:48 | Roadmap |
| 2:27 | Why understanding images is hard |
| 2:57 | Sources of image variability (1) |
| 3:39 | Sources of image variability (2) |
| 3:57 | Sources of image variability (3) |
| 4:15 | Sources of image variability (4) |
| 5:10 | Sources of image variability (5) |
| 5:23 | Sources of image variability (6) |
| 5:51 | Sources of image variability (7) |
| 5:59 | Sources of image variability (8) |
| 6:13 | Sources of image variability (9) |
| 7:04 | Roadmap |
| 7:13 | Modelling image variability |
| 9:22 | Cues for object recognition (1) |
| 10:57 | Cues for object recognition (2) |
| 11:42 | Cues for object recognition (3) |
| 12:03 | Cues for object recognition (4) |
| 12:30 | Cues for object recognition (5) |
| 12:31 | Generative model of object shape (1) |
| 12:56 | Generative model of object shape (2) |
| 14:02 | LOCUS model |
| 16:31 | Inference using Variational EM (1) |
| 17:40 | Inference using Variational EM (2) |
| 18:16 | Inference using Variational EM (3) |
| 19:04 | Inference using Variational EM (4) |
| 19:57 | Inference using Variational EM (5) |
| 19:58 | Inference using Variational EM (6) |
| 20:00 | Inference using Variational EM (7) |
| 20:02 | Inference using Variational EM (8) |
| 21:00 | Results: Weizmann horse dataset (1) |
| 21:20 | Results: Weizmann horse dataset (2) |
| 22:26 | Results: Weizmann horse dataset (1) |
| 22:45 | Results: Weizmann horse dataset (2) |
| 22:55 | Unsupervised shape learning |
| 25:40 | Roadmap |
| 25:45 | Modelling image variability II |
| 27:23 | Discriminative model: occlusion (1) |
| 27:38 | Discriminative model: occlusion (2) |
| 28:16 | Images annotated with parts (1) |
| 29:14 | Images annotated with parts (2) |
| 29:18 | Discriminative appearance model (1) |
| 30:32 | Discriminative appearance model (2) |
| 31:36 | Random forest classifier are fast! |
| 33:15 | Applying the model |
| 35:05 | Layout consistency (1) |
| 36:00 | Layout consistency (2) |
| 36:34 | Layout consistency (3) |
| 37:17 | Layout consistency (4) |
| 37:28 | Layout Consistent Random Field |
| 38:42 | Effect of layout consistency |
| 39:31 | Results: UIUC car database |
| 40:33 | Results: occluded faces |
| 41:27 | Results: multiple classes |
| 43:13 | Roadmap |
| 43:25 | Modelling image variability III |
| 45:45 | Hybrid model of appearance |
| 46:14 | The Jigsaw model |
| 48:17 | The Jigsaw generative model |
| 49:25 | Jigsaw example (1) |
| 51:00 | Jigsaw example (2) |
| 52:23 | Sparse Belief Propagation (1) |
| 53:14 | Sparse Belief Propagation (2) |
| 54:07 | Using bottom-up cues |
| 55:00 | Hybrid Belief Propagation |
| 55:34 | Hybrid BP: accuracy vs. efficiency |
| 56:38 | Hybrid BP jigsaws |
| 56:58 | Deformable jigsaw on video |
| 58:25 | Unwrap mosaics (1) |
| 59:11 | Unwrap mosaics (2) |
| 59:43 | Roadmap |
| 59:48 | Open challenges |
| 60:59 | Thank you |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !



