event thumbnail image
2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms
Pascal

Dynamic Bayesian Networks for Multimodal Interaction

author: Tony Jebara, Columbia University

Description

Dynamic Bayesian networks (DBNs) offer a natural upgrade path beyond classical hidden Markov models and become especially relevant when temporal data contains higher order structure, multiple modalities or multi-person interaction. We describe several instantiations of dynamic Bayesian networks that are useful for modeling temporal phenomena spanning audio, video and haptic channels in single, two-person and multi-person activity. These models include input-output hidden Markov models, switched Kalman filters and, most generally, dynamical systems trees (DSTs). These models are used to learn audio-video interaction in social activities, video interaction in multi-person game playing and haptic-video interaction in robotic laparoscopy. Model parameters are estimated from data in an unsupervised setting using generalized expectation maximization methods. Subsequently, these models can predict, synthesize and classify various types of rich multimodal human activity. Experiments in gesture interaction, audio-video conversation, football game playing and surgical drill evaluation are shown.

You might be experiencing some problems with Your Video player.
Slides
0:00 Dynamic Bayesian Networks for Multimodal Interaction
2:00 Outline
4:17 Introduction
6:54 Bayesian Networks
11:01 Bayes Nets to Junction Trees
12:35 Junction Tree Algorithm
14:57 Junction Tree Algorithm
15:58 Maximum Likelihood with EM
19:52 Dynamic Bayes Nets
24:48 Two-Person Interaction
26:31 DBN: Hidden ARMA Model
28:41 DBN: Hidden ARMA Model
29:34 Hidden ARMA Features:
29:48 Conditional EM for hidden ARMA
31:19 Conditional EM
32:31 Conditional EM
33:51 Hidden ARMA on Gesture
34:11 DBN: Input-Output HMM
34:37 DBN: Input-Output HMM
35:04 Input-Output HMM Data
35:46 Video Representation
36:06 Video Representation
36:18 Input-Output HMM
36:36 Input-Output HMM with CEM
37:05 Input-Output HMM with CEM
37:35 Input-Output HMM Results
38:12 Intractable Dynamic Bayes Nets
38:43 Intractable DBNs: Generalized EM
39:51 Intractable DBNs Variational EM
40:44 Dynamical System Trees
41:11 Dynamical System Trees
41:55 DSTs and Generalized EM
42:07 DSTs for American Football
42:18 DSTs for American Football
43:19 DSTs for Gene Networks
43:44 Robotic Surgery, Haptics & Video
44:17 Robotic Surgery, Haptics & Video
44:26 Robotic Surgery, Haptics & Video
45:03 Robotic Surgical Drills Results
46:26 Conclusion

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Reviews and comments:

Comment1 singh, October 24, 2007 at 11:39 p.m.:

Not able to see the entire video, stops around 16 mins or so.


Write your own review or comment: