On Finding Low Error Clusterings

author: Maria Balcan, Computer Science Department, Carnegie Mellon University
published: July 30, 2009,   recorded: June 2009,   views: 640
Categories
You might be experiencing some problems with Your Video player.

Slides

Slides
0:00 Finding Low Error Clusterings
0:27 Approximate clustering without the approximation
0:37 Clustering comes up everywhere
1:02 Formal Clustering Setup (1)
2:45 Formal Clustering Setup (2)
4:53 Standard theoretical approach (1)
6:17 Standard theoretical approach (2)
7:24 Formal Clustering Setup, Our Perspective (1)
7:50 Formal Clustering Setup, Our Perspective (2)
9:36 Formal Clustering Setup, Our Perspective (3)
10:38 Note on the Standard Approx. Algos Approach
11:47 Positive Results (1)
13:24 Positive Results (2)
14:23 How can we use the (c,) k-median property to cluster, without solving k-median?
14:45 Clustering from (c,) k-median prop (1)
18:17 Clustering from (c,) k-median prop (3)
19:13 Clustering from (c,) k-median prop (3)
19:56 Clustering from (c,) k-median prop (4)
20:37 Clustering from (c,) k-median prop (1)
20:53 Clustering from (c,) k-median prop (4)
21:10 Clustering from (c,) k-median prop (5)
21:54 Clustering from (c,) k-median prop (6)
21:54 Clustering from (c,) k-median prop (7)
22:28 Clustering from (c,) k-median prop (8)
23:14 Clustering from (c,) k-median prop (9)
23:31 Clustering from (c,) k-median prop (10)
24:32 O()-close ! -close (1)
26:27 O()-close ! -close (2)
27:38 Technical issues… (1)
29:12 Technical issues… (2)
29:56 Extensions: The inductive setting
31:46 (c,) k-means and min-sum properties
34:05 (c,) property; summary
35:44 Extensions: the (º,c,) k-median prop (1)
37:27 Extensions: the (º,c,) k-median prop (2)
37:45 Extensions: the (º,c,) k-median prop (3)
38:29 Extensions: the (º,c,) k-median prop (4)
38:39 Extensions: the (º,c,) k-median prop (5)
39:46 Extensions: the (º,c,) k-median prop (6)
40:31 (c,) Property: Open problems
41:31 Clustering: More General Picture (1)
42:31 Clustering: More General Picture (2)
43:22 Clustering: More General Picture (3)

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.
 
    Delicious Bibliography

Description

There has been substantial work on approximation algorithms for clustering data under distance-based objective functions such as k-median, k-means, and min-sum objectives. This work is fueled in part by the hope that approximating these objectives well will indeed yield more accurate solutions. That is, for problems such as clustering proteins by function, or clustering images by subject, there is some unknown correct "target" clustering and the implicit assumption is that clusterings that are approximately optimal in terms of these distance-based measures are also approximately correct in terms of error with respect to the target. In this work we show that if we make this implicit assumption explicit - that is, if we assume that any c-approximation to the given clustering objective Phi is epsilon-close to the target - then we can produce clusterings that are O(epsilon)-close to the target, even for values c for which obtaining a c-approximation is NP-hard. In particular, for the k-median, k-means, and min-sum objectives, we show that we can achieve this guarantee for any constant c > 1. Our results shows how for these clustering objectives one can get much better guarantees on accuracy than those implied by results obtained so far in the approximation literature, by wisely using all the available information for the problem at hand.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: