Training Structural SVMs when Exact Inference is Intractable
Description
While discriminative training (e.g., CRF, structural SVM) holds much promise for machine translation, image segmentation, and clustering, the complex inference these applications require make exact training intractable. This leads to a need for approximate training methods. Unfortunately, knowledge about how to perform efficient and effective approximate training is limited. Focusing on structural SVMs, we provide and explore algorithms for two different classes of approximate training algorithms, which we call undergenerating (e.g., greedy) and overgenerating (e.g., relaxations) algorithms. We provide a theoretical and empirical analysis of both types of approximate trained structural SVMs, focusing on fully connected pairwise Markov random fields. We find that models trained with overgenerating methods have theoretic advantages over undergenerating methods, are empirically robust relative to their undergenerating brethren, and relaxed trained models favor non-fractional predictions from relaxed predictor
| Slides | |
| 0:00 | Training Structural SVMs when Exact Inference is Intractable |
| 0:23 | Talk Outline - Structured Prediction |
| 0:28 | Structured Learning - 1 |
| 0:33 | Structured Learning - 2 |
| 0:36 | Structured Learning - 3 |
| 0:39 | Structured Learning - 4 |
| 0:44 | Structured Learning - 5 |
| 0:51 | Structured Learning - 6 |
| 0:56 | Structured Learning - 7 |
| 1:03 | Structured Learning - 8 |
| 1:08 | Parameters for Structured Predictors - 1 |
| 1:11 | Parameters for Structured Predictors - 2 |
| 1:24 | Parameters for Structured Predictors - 3 |
| 1:38 | Parameters for Structured Predictors - 4 |
| 1:48 | Parameters for Structured Predictors - 5 |
| 2:34 | Some Tasks - 1 |
| 2:38 | Some Tasks - 2 |
| 2:48 | Some Tasks - 3 |
| 2:59 | Some Tasks - 4 |
| 3:06 | Some Tasks - 5 |
| 3:16 | Talk Outline - Structural SVMs |
| 3:18 | Linear Constraint - 1 |
| 3:26 | Linear Constraint - 2 |
| 3:28 | Linear Constraint - 3 |
| 3:32 | Linear Constraint - 4 |
| 3:33 | Linear Constraint - 5 |
| 3:36 | Linear Constraint - 6 |
| 3:44 | Linear Constraint - 7 |
| 3:50 | Linear Constraint - 8 |
| 3:52 | Quadratic Program Formulation - 1 |
| 4:01 | Quadratic Program Formulation - 2 |
| 4:10 | Cutting Plane Example - 1 |
| 4:16 | Cutting Plane Example - 2 |
| 4:39 | Cutting Plane Example - 3 |
| 4:45 | Cutting Plane Example - 4 |
| 4:47 | Cutting Plane Example - 5 |
| 4:54 | Structural SVM Learner - 1 |
| 4:57 | Structural SVM Learner - 2 |
| 4:59 | Structural SVM Learner - 3 |
| 5:03 | Structural SVM Learner - 4 |
| 5:06 | Structural SVM Learner - 5 |
| 5:13 | Structural SVM Learner - 6 |
| 5:17 | Important Theoretical Properties |
| 5:48 | Talk Outline - Approximate Inference in SSVMs: Theoretical Analysis |
| 5:50 | Approximations - 1 |
| 6:04 | Approximations - 2 |
| 6:08 | Approximations - 3 |
| 6:20 | Cutting Plane Example - 1 |
| 6:20 | Cutting Plane Example - 2 |
| 6:23 | Cutting Plane Example - 3 |
| 6:24 | Cutting Plane Example - 4 |
| 6:26 | Cutting Plane Example - 5 |
| 6:30 | Cutting Plane Example - 6 |
| 6:53 | Undergenerating Approximations |
| 7:12 | Undergenerating ρ-Approximations |
| 7:33 | Undergenerating ρ-Approx Theorems - 1 |
| 7:37 | Undergenerating ρ-Approx Theorems - 2 |
| 7:42 | Undergenerating ρ-Approx Theorems - 3 |
| 7:45 | Undergenerating ρ-Approx Theorems - 4 |
| 7:49 | Undergenerating ρ-Approx Theorems - 5 |
| 7:54 | Undergenerating ρ-Approx Theorems - 6 |
| 8:20 | Undergenerating ρ-Approx Theorems - 7 |
| 8:27 | Approximations - 4 |
| 8:44 | Approximations - 5 |
| 9:18 | Overgenerating Approx Theory in a Nutshell |
| 9:56 | Talk Outline - Approximate Inference in SSVMs: Empirical Analysis |
| 10:04 | Our Testbed: Binary Pairwise MRFs - 1 |
| 10:10 | Our Testbed: Binary Pairwise MRFs - 2 |
| 10:24 | Application: Multilabel Classification |
| 12:11 | Training/Predictive Inference |
| 12:41 | Datasets |
| 13:04 | Undergenerating Approximations |
| 13:26 | Overgenerating Approximations |
| 13:42 | Third Algorithm Class. for Comparison Only |
| 14:11 | The Sorry State of LBP - 1 |
| 14:27 | The Sorry State of LBP - 2 |
| 14:42 | The Sorry State of LBP - 3 |
| 14:51 | The Sorry State of LBP - 4 |
| 14:55 | The Sorry State of LBP - 5 |
| 15:04 | The Sorry State of LBP - 6 |
| 15:47 | Relaxation - 1 |
| 16:01 | Relaxation - 2 |
| 16:46 | Relaxation - 3 |
| 17:32 | Known Approximations |
| 18:27 | Summary |
| 19:04 | Software |
| 19:17 | Thank You |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !


