Supervised Clustering with Support Vector Machines
author:
Thomas Finley,
Cornell University
Description
Supervised clustering is the problem of training a clustering algorithm to produce desirable clusterings: given sets of items and complete clusterings over these sets, we learn how to cluster future sets of items. Example applications include noun-phrase coreference clustering, and clustering news articles by whether they refer to the same topic. In this paper we present an SVM algorithm that trains a clustering algorithm by adapting the item-pair similarity measure. The algorithm may optimize a variety of different clustering functions to a variety of clustering performance measures. We empirically evaluate the algorithm for noun-phrase and news article clustering.
Categories
Top: Computer Science: Machine Learning: ClusteringTop: Computer Science: Machine Learning: Kernel Methods: Support Vector Machines
You might be experiencing some problems with Your Video player.
| Slides | |
| 0:01 | Supervised Clustering with SVMs |
| 0:21 | Supervised Clustering Talk Outline |
| 0:38 | Clustering Marbles |
| 0:52 | Multiple Possible Criteria for Similarity |
| 1:02 | How to Adjust |
| 2:11 | Noun Phrase Coreference |
| 2:48 | News Story Clustering |
| 3:16 | Supervised Clustering Talk Outline |
| 3:31 | How Do We Learn? |
| 3:42 | Simple Clustering |
| 4:13 | Clustering Objective |
| 5:05 | Pairwise Features & Similarity |
| 5:54 | Naïve Training Example |
| 6:40 | Problem 1: Hard Coded Performance Measure |
| 7:27 | Problem 2: Clustering Interactions |
| 8:15 | Supervised Clustering Talk Outline |
| 8:18 | SVMstruct Overview |
| 9:20 | Ψ for Clustering |
| 10:05 | Δ for Clustering |
| 11:05 | Linear Constraint |
| 11:56 | Quadratic Program Formulation |
| 12:27 | Algorithm to Select Constraints |
| 13:22 | Computing the Argmax |
| 15:12 | Supervised Clustering Talk Outline |
| 15:21 | NP Coreference |
| 16:15 | News Story Clustering |
| 16:45 | Building the pairwise feature vector ϕ |
| 16:58 | SVMcluster vs. PCC |
| 19:34 | Optimizing to Right Δ |
| 20:57 | Inclusion of Δ in Finding Constraint |
| 22:20 | Real Relaxation versus Greedy Clustering |
| 23:30 | Conclusions |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
SEE ALSO:
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !





