event thumbnail image
The 25th International Conference on Machine Learning (ICML 2008)

Optimized Cutting Plane Algorithm for Support Vector Machines

author: Vojtech Franc, Fraunhofer Institute Computer Architecture and Software Technology

Description

We have developed a new Linear Support Vector Machine (SVM) training algorithm called OCAS. Its computational effort scales linearly with the sample size. In an extensive empirical evaluation OCAS significantly outperforms current state of the art SVM solvers, like SVMLight, SVMPerf and BMRM, achieving speedups of over 1,000 on some datasets over SVMLight and 20 over SVMPerf, while obtaining the same precise Support Vector solution. OCAS even in the early optimization steps shows often faster convergence than the so far in this domain prevailing approximative methods SGD and Pegasos. Effectively parallelizing OCAS we were able to train on a dataset of size 15 million examples (itself about 32GB in size) in just 671 seconds --- a competing string kernel SVM required 97,484 seconds to train on 10 million examples sub-sampled from this dataset.

You might be experiencing some problems with Your Video player.
Slides
0:00 Optimized Cutting Plane Algorithm for Support Vector Machines
0:00 Motivation
1:21 Problem definition
2:38 Cutting plane algorithm [Joachims2006][Teo2007] (1)
3:24 Cutting plane algorithm [Joachims2006][Teo2007] (2)
4:03 Cutting plane algorithm: illustration (1)
4:13 Cutting plane algorithm: illustration (2)
4:15 Cutting plane algorithm: illustration (3)
4:17 Cutting plane algorithm: illustration (4)
4:27 Cutting plane algorithm: illustration (5)
4:35 Cutting plane algorithm
6:09 Our proposal to accelerate the CP algorithm (1)
7:36 Our proposal to accelerate the CP algorithm (2)
8:15 Our proposal to accelerate the CP algorithm (3)
10:26 Solving the line-search eficiently in mlogm time (1)
10:59 Solving the line-search eficiently in mlogm time (2)
11:31 Solving the line-search eficiently in mlogm time (3)
12:24 Experiments: Learning binary linear SVM classifier (1)
14:22 Experiments: Learning binary linear SVM classifier (2)
14:52 Experiments: Performance versus time (1)
16:08 Experiments: Performance versus time (2)
18:03 Experiments: Time versus objective value
19:39 Experiments: Speedups due to parallelizing (1)
20:53 Experiments: Speedups due to parallelizing (2)
21:11 Conclusions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: