Dynamic Bayesian Networks for Multimodal Interaction
Description
Dynamic Bayesian networks (DBNs) offer a natural upgrade path beyond classical hidden Markov models and become especially relevant when temporal data contains higher order structure, multiple modalities or multi-person interaction. We describe several instantiations of dynamic Bayesian networks that are useful for modeling temporal phenomena spanning audio, video and haptic channels in single, two-person and multi-person activity. These models include input-output hidden Markov models, switched Kalman filters and, most generally, dynamical systems trees (DSTs). These models are used to learn audio-video interaction in social activities, video interaction in multi-person game playing and haptic-video interaction in robotic laparoscopy. Model parameters are estimated from data in an unsupervised setting using generalized expectation maximization methods. Subsequently, these models can predict, synthesize and classify various types of rich multimodal human activity. Experiments in gesture interaction, audio-video conversation, football game playing and surgical drill evaluation are shown.
| Slides | |
| 0:00 | Dynamic Bayesian Networks for Multimodal Interaction |
| 2:00 | Outline |
| 4:17 | Introduction |
| 6:54 | Bayesian Networks |
| 11:01 | Bayes Nets to Junction Trees |
| 12:35 | Junction Tree Algorithm |
| 14:57 | Junction Tree Algorithm |
| 15:58 | Maximum Likelihood with EM |
| 19:52 | Dynamic Bayes Nets |
| 24:48 | Two-Person Interaction |
| 26:31 | DBN: Hidden ARMA Model |
| 28:41 | DBN: Hidden ARMA Model |
| 29:34 | Hidden ARMA Features: |
| 29:48 | Conditional EM for hidden ARMA |
| 31:19 | Conditional EM |
| 32:31 | Conditional EM |
| 33:51 | Hidden ARMA on Gesture |
| 34:11 | DBN: Input-Output HMM |
| 34:37 | DBN: Input-Output HMM |
| 35:04 | Input-Output HMM Data |
| 35:46 | Video Representation |
| 36:06 | Video Representation |
| 36:18 | Input-Output HMM |
| 36:36 | Input-Output HMM with CEM |
| 37:05 | Input-Output HMM with CEM |
| 37:35 | Input-Output HMM Results |
| 38:12 | Intractable Dynamic Bayes Nets |
| 38:43 | Intractable DBNs: Generalized EM |
| 39:51 | Intractable DBNs Variational EM |
| 40:44 | Dynamical System Trees |
| 41:11 | Dynamical System Trees |
| 41:55 | DSTs and Generalized EM |
| 42:07 | DSTs for American Football |
| 42:18 | DSTs for American Football |
| 43:19 | DSTs for Gene Networks |
| 43:44 | Robotic Surgery, Haptics & Video |
| 44:17 | Robotic Surgery, Haptics & Video |
| 44:26 | Robotic Surgery, Haptics & Video |
| 45:03 | Robotic Surgical Drills Results |
| 46:26 | Conclusion |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





Not able to see the entire video, stops around 16 mins or so.