event thumbnail image
International Conference on Machine Learning - Bonn 2005

Supervised Clustering with Support Vector Machines

author: Thomas Finley, Cornell University

Description

Supervised clustering is the problem of training a clustering algorithm to produce desirable clusterings: given sets of items and complete clusterings over these sets, we learn how to cluster future sets of items. Example applications include noun-phrase coreference clustering, and clustering news articles by whether they refer to the same topic. In this paper we present an SVM algorithm that trains a clustering algorithm by adapting the item-pair similarity measure. The algorithm may optimize a variety of different clustering functions to a variety of clustering performance measures. We empirically evaluate the algorithm for noun-phrase and news article clustering.

You might be experiencing some problems with Your Video player.
Slides
0:01 Supervised Clustering with SVMs
0:21 Supervised Clustering Talk Outline
0:38 Clustering Marbles
0:52 Multiple Possible Criteria for Similarity
1:02 How to Adjust
2:11 Noun Phrase Coreference
2:48 News Story Clustering
3:16 Supervised Clustering Talk Outline
3:31 How Do We Learn?
3:42 Simple Clustering
4:13 Clustering Objective
5:05 Pairwise Features & Similarity
5:54 Naïve Training Example
6:40 Problem 1: Hard Coded Performance Measure
7:27 Problem 2: Clustering Interactions
8:15 Supervised Clustering Talk Outline
8:18 SVMstruct Overview
9:20 Ψ for Clustering
10:05 Δ for Clustering
11:05 Linear Constraint
11:56 Quadratic Program Formulation
12:27 Algorithm to Select Constraints
13:22 Computing the Argmax
15:12 Supervised Clustering Talk Outline
15:21 NP Coreference
16:15 News Story Clustering
16:45 Building the pairwise feature vector ϕ
16:58 SVMcluster vs. PCC
19:34 Optimizing to Right Δ
20:57 Inclusion of Δ in Finding Constraint
22:20 Real Relaxation versus Greedy Clustering
23:30 Conclusions

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: