event thumbnail image
The 25th International Conference on Machine Learning (ICML 2008)

Training Structural SVMs when Exact Inference is Intractable

author: Thomas Finley, Cornell University

Description

While discriminative training (e.g., CRF, structural SVM) holds much promise for machine translation, image segmentation, and clustering, the complex inference these applications require make exact training intractable. This leads to a need for approximate training methods. Unfortunately, knowledge about how to perform efficient and effective approximate training is limited. Focusing on structural SVMs, we provide and explore algorithms for two different classes of approximate training algorithms, which we call undergenerating (e.g., greedy) and overgenerating (e.g., relaxations) algorithms. We provide a theoretical and empirical analysis of both types of approximate trained structural SVMs, focusing on fully connected pairwise Markov random fields. We find that models trained with overgenerating methods have theoretic advantages over undergenerating methods, are empirically robust relative to their undergenerating brethren, and relaxed trained models favor non-fractional predictions from relaxed predictor

You might be experiencing some problems with Your Video player.
Slides
0:00 Training Structural SVMs when Exact Inference is Intractable
0:23 Talk Outline - Structured Prediction
0:28 Structured Learning - 1
0:33 Structured Learning - 2
0:36 Structured Learning - 3
0:39 Structured Learning - 4
0:44 Structured Learning - 5
0:51 Structured Learning - 6
0:56 Structured Learning - 7
1:03 Structured Learning - 8
1:08 Parameters for Structured Predictors - 1
1:11 Parameters for Structured Predictors - 2
1:24 Parameters for Structured Predictors - 3
1:38 Parameters for Structured Predictors - 4
1:48 Parameters for Structured Predictors - 5
2:34 Some Tasks - 1
2:38 Some Tasks - 2
2:48 Some Tasks - 3
2:59 Some Tasks - 4
3:06 Some Tasks - 5
3:16 Talk Outline - Structural SVMs
3:18 Linear Constraint - 1
3:26 Linear Constraint - 2
3:28 Linear Constraint - 3
3:32 Linear Constraint - 4
3:33 Linear Constraint - 5
3:36 Linear Constraint - 6
3:44 Linear Constraint - 7
3:50 Linear Constraint - 8
3:52 Quadratic Program Formulation - 1
4:01 Quadratic Program Formulation - 2
4:10 Cutting Plane Example - 1
4:16 Cutting Plane Example - 2
4:39 Cutting Plane Example - 3
4:45 Cutting Plane Example - 4
4:47 Cutting Plane Example - 5
4:54 Structural SVM Learner - 1
4:57 Structural SVM Learner - 2
4:59 Structural SVM Learner - 3
5:03 Structural SVM Learner - 4
5:06 Structural SVM Learner - 5
5:13 Structural SVM Learner - 6
5:17 Important Theoretical Properties
5:48 Talk Outline - Approximate Inference in SSVMs: Theoretical Analysis
5:50 Approximations - 1
6:04 Approximations - 2
6:08 Approximations - 3
6:20 Cutting Plane Example - 1
6:20 Cutting Plane Example - 2
6:23 Cutting Plane Example - 3
6:24 Cutting Plane Example - 4
6:26 Cutting Plane Example - 5
6:30 Cutting Plane Example - 6
6:53 Undergenerating Approximations
7:12 Undergenerating ρ-Approximations
7:33 Undergenerating ρ-Approx Theorems - 1
7:37 Undergenerating ρ-Approx Theorems - 2
7:42 Undergenerating ρ-Approx Theorems - 3
7:45 Undergenerating ρ-Approx Theorems - 4
7:49 Undergenerating ρ-Approx Theorems - 5
7:54 Undergenerating ρ-Approx Theorems - 6
8:20 Undergenerating ρ-Approx Theorems - 7
8:27 Approximations - 4
8:44 Approximations - 5
9:18 Overgenerating Approx Theory in a Nutshell
9:56 Talk Outline - Approximate Inference in SSVMs: Empirical Analysis
10:04 Our Testbed: Binary Pairwise MRFs - 1
10:10 Our Testbed: Binary Pairwise MRFs - 2
10:24 Application: Multilabel Classification
12:11 Training/Predictive Inference
12:41 Datasets
13:04 Undergenerating Approximations
13:26 Overgenerating Approximations
13:42 Third Algorithm Class. for Comparison Only
14:11 The Sorry State of LBP - 1
14:27 The Sorry State of LBP - 2
14:42 The Sorry State of LBP - 3
14:51 The Sorry State of LBP - 4
14:55 The Sorry State of LBP - 5
15:04 The Sorry State of LBP - 6
15:47 Relaxation - 1
16:01 Relaxation - 2
16:46 Relaxation - 3
17:32 Known Approximations
18:27 Summary
19:04 Software
19:17 Thank You

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: