A Support Vector Method for Multivariate Performance Measures

author:Thorsten Joachims, Cornell University
published: April 12, 2007,   recorded: August 2005,   views: 112
Categories
You might be experiencing some problems with Your Video player.

Related content

Visitors who watched this lecture also watched...
45:02
Learning Rankings for Information Retrieval

369 views - Thorsten Joachims, 2005
05:17
KDD-07 Best Paper Awards

702 views - Thorsten Joachims, 2007
03:54:31
Support Vector Machines

12744 views - Chih-Jen Lin, 2006
01:28:05
Introduction to Support Vector Machines

4638 views - Colin Campbell, 2008
04:59:19
Machine Learning, Probability and Graphical Models

18414 views - Sam Roweis, 2006
05:52:18
Support Vector Machines and Kernels

706 views - Bernhard Schölkopf, 2002
26:32
Supervised Clustering with Support Vector Machines

233 views - Thomas Finley, 2005
02:13:31
Information Retrieval and Language Technology

263 views - Thorsten Joachims, 2003
04:16:53
Learning with Kernels

3765 views - Bernhard Schölkopf, 2003
48:35
Tricks of the trade for training SVMs

191 views - Gökhan H. Bakir, 2005

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.

Description

We examine the relationship between the predictions made by different learning algorithms and true posterior probabilities. We show that maximum margin methods such as boosted trees and boosted stumps push probability mass away from 0 and 1 yielding a characteristic sigmoid shaped distortion in the predicted probabilities. Models such as Naive Bayes, which make unrealistic independence assumptions, push probabilities toward 0 and 1. Other models such as neural nets and bagged trees do not have these biases and predict well calibrated probabilities. We experiment with two ways of correcting the biased probabilities predicted by some learning methods: Platt Scaling and Isotonic Regression. We qualitatively examine what kinds of distortions these calibration methods are suitable for and quantitatively examine how much data they need to be effective. The empirical results show that after calibration boosted trees, random forests, and SVMs predict the best probabilities.

Link this page  

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Reviews and comments:

Comment1 Urvang, February 12, 2009 at 2:35 p.m.:

The title of this video and the content seem totally unrelated.

Write your own review or comment:

make sure you have javascript enabled or clear this field: