event thumbnail image
Modelling in Classification and Statistical Learning Workshop

Robustness properties of support vector machines and related methods

author: Andreas Christmann, Department of Mathematics, University of Bayreuth

Description

The talk brings together methods from two disciplines: machine learning theory and robust statistics. We argue that robustness is an important aspect and we show that many existing machine learning methods based on convex risk minimization have - besides other good properties - also the advantage of being robust if the kernel and the loss function are chosen appropriately. Our results cover classification and regression problems. Assumptions are given for the existence of the influence function and for bounds on the influence function. Kernel logistic regression, support vector machines, least squares and the AdaBoost loss function are treated as special cases. We also consider Robust Learning from Bites, a simple method to make some methods from convex risk minimization applicable for huge data sets for which currently available algorithms are much to slow. As an example we use a data set from 15 German insurance companies.

You might be experiencing some problems with Your Video player.
Slides
0:29 Robustness properties of support vector machines
1:30 Convex Risk Minimization
3:55 Kernels
6:04 Convex Risk Minimization (Vapnik ’98)
10:47 Loss functions: classification
11:41 Loss functions: regression
13:48 Robustness of CRM
16:03 Main question
17:37 Robustness concepts
18:40 Robustness concepts
20:18 Comparison
23:14 Robustness: classification (Chr & Steinwart, ’04)
24:32 L0(zy; fP;¸(zx))©(zx) for KLR with RBF-kernel
26:31 Further results (Chr & Steinwart, ’04, ’05)
27:05 Regression examples: n=200 data points, skewness
31:37 Problem: SVM / CRM are ’non-robust posed problems’
35:21 3. Robust Learning from Bites (Chr ’05)
36:57 RLB: Robust Learning from Bites
38:15 RLB: computational aspects
39:15 RLB for kernel methods
39:43 RLB for kernel methods: number of support vectors
44:31 RLB for kernel methods: L¡risk consistency
44:52 Proof of L¡risk consistency
45:20 Consistency of RLB
46:02 Finite-sample breakdown point of RLB
47:42 4. Application: Motor Vehicle Insurance
48:30 Statistical objectives
48:41 Complex dependencies
49:09 Statistical model
50:59 Estimation of pure premium E(YjX=x)
51:45 Estimation of pure premium E(YjX=x)
54:22 RLB based on median: predictions ˆyi for 100 customers
63:40 References

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: