Training Structural SVMs when Exact Inference is Intractable

author:Thomas Finley, Cornell University
published: July 28, 2008,   recorded: July 2008,   views: 124
You might be experiencing some problems with Your Video player.

Slides

Slides
0:00 Training Structural SVMs when Exact Inference is Intractable
0:23 Talk Outline - Structured Prediction
0:28 Structured Learning - 1
0:33 Structured Learning - 2
0:36 Structured Learning - 3
0:39 Structured Learning - 4
0:44 Structured Learning - 5
0:51 Structured Learning - 6
0:56 Structured Learning - 7
1:03 Structured Learning - 8
1:08 Parameters for Structured Predictors - 1
1:11 Parameters for Structured Predictors - 2
1:24 Parameters for Structured Predictors - 3
1:38 Parameters for Structured Predictors - 4
1:48 Parameters for Structured Predictors - 5
2:34 Some Tasks - 1
2:38 Some Tasks - 2
2:48 Some Tasks - 3
2:59 Some Tasks - 4
3:06 Some Tasks - 5
3:16 Talk Outline - Structural SVMs
3:18 Linear Constraint - 1
3:26 Linear Constraint - 2
3:28 Linear Constraint - 3
3:32 Linear Constraint - 4
3:33 Linear Constraint - 5
3:36 Linear Constraint - 6
3:44 Linear Constraint - 7
3:50 Linear Constraint - 8
3:52 Quadratic Program Formulation - 1
4:01 Quadratic Program Formulation - 2
4:10 Cutting Plane Example - 1
4:16 Cutting Plane Example - 2
4:39 Cutting Plane Example - 3
4:45 Cutting Plane Example - 4
4:47 Cutting Plane Example - 5
4:54 Structural SVM Learner - 1
4:57 Structural SVM Learner - 2
4:59 Structural SVM Learner - 3
5:03 Structural SVM Learner - 4
5:06 Structural SVM Learner - 5
5:13 Structural SVM Learner - 6
5:17 Important Theoretical Properties
5:48 Talk Outline - Approximate Inference in SSVMs: Theoretical Analysis
5:50 Approximations - 1
6:04 Approximations - 2
6:08 Approximations - 3
6:20 Cutting Plane Example - 1
6:20 Cutting Plane Example - 2
6:23 Cutting Plane Example - 3
6:24 Cutting Plane Example - 4
6:26 Cutting Plane Example - 5
6:30 Cutting Plane Example - 6
6:53 Undergenerating Approximations
7:12 Undergenerating ρ-Approximations
7:33 Undergenerating ρ-Approx Theorems - 1
7:37 Undergenerating ρ-Approx Theorems - 2
7:42 Undergenerating ρ-Approx Theorems - 3
7:45 Undergenerating ρ-Approx Theorems - 4
7:49 Undergenerating ρ-Approx Theorems - 5
7:54 Undergenerating ρ-Approx Theorems - 6
8:20 Undergenerating ρ-Approx Theorems - 7
8:27 Approximations - 4
8:44 Approximations - 5
9:18 Overgenerating Approx Theory in a Nutshell
9:56 Talk Outline - Approximate Inference in SSVMs: Empirical Analysis
10:04 Our Testbed: Binary Pairwise MRFs - 1
10:10 Our Testbed: Binary Pairwise MRFs - 2
10:24 Application: Multilabel Classification
12:11 Training/Predictive Inference
12:41 Datasets
13:04 Undergenerating Approximations
13:26 Overgenerating Approximations
13:42 Third Algorithm Class. for Comparison Only
14:11 The Sorry State of LBP - 1
14:27 The Sorry State of LBP - 2
14:42 The Sorry State of LBP - 3
14:51 The Sorry State of LBP - 4
14:55 The Sorry State of LBP - 5
15:04 The Sorry State of LBP - 6
15:47 Relaxation - 1
16:01 Relaxation - 2
16:46 Relaxation - 3
17:32 Known Approximations
18:27 Summary
19:04 Software
19:17 Thank You

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.

Description

While discriminative training (e.g., CRF, structural SVM) holds much promise for machine translation, image segmentation, and clustering, the complex inference these applications require make exact training intractable. This leads to a need for approximate training methods. Unfortunately, knowledge about how to perform efficient and effective approximate training is limited. Focusing on structural SVMs, we provide and explore algorithms for two different classes of approximate training algorithms, which we call undergenerating (e.g., greedy) and overgenerating (e.g., relaxations) algorithms. We provide a theoretical and empirical analysis of both types of approximate trained structural SVMs, focusing on fully connected pairwise Markov random fields. We find that models trained with overgenerating methods have theoretic advantages over undergenerating methods, are empirically robust relative to their undergenerating brethren, and relaxed trained models favor non-fractional predictions from relaxed predictor

Link this page  

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: