Correlation Clustering in MapReduce

Published on 2014-10-072645 Views

Flavio Chierichetti

Correlation clustering is a basic primitive in data miner’s toolkit with applications ranging from entity matching to social network analysis. The goal in correlation clustering is, given a graph wi

Research Sessions

Related categories

Presentation

Correlation Clustering in Map-Reduce00:00

Correlation Clustering - 301:14

Large-Scale Computation - 102:10

Large-Scale Computation - 202:57

Large-Scale Computation - 303:08

Correlation Clustering - 103:32

Correlation Clustering - 203:57

Correlation Clustering - 404:15

Correlation Clustering - 504:22

Correlation Clustering - 604:44

Correlation Clustering - 705:13

Correlation Clustering - 805:29

Correlation Clustering - 905:34

Correlation Clustering - 1005:41

Correlation Clustering - 1105:45

Correlation Clustering - 1206:05

How to find the best clustering?06:27

How to minimize the number of mistakes?06:45

The Pivot Algorithm - 106:57

The Pivot Algorithm - 207:19

The Pivot Algorithm - 307:25

The Pivot Algorithm - 407:29

The Pivot Algorithm - 507:38

The Pivot Algorithm - 607:43

The Pivot Algorithm - 707:49

The Pivot Algorithm - 807:51

The Pivot Algorithm - 907:58

The Pivot Algorithm - 1008:03

The Pivot Algorithm - 1108:10

The Pivot Algorithm - 1208:32

The Pivot Algorithm - 1308:46

The Pivot Algorithm - 1408:53

The Pivot Algorithm - 1509:03

The Pivot Algorithm - 1609:06

The Pivot Algorithm - 1709:07

The Pivot Algorithm - 1809:16

How to speed up the computation?09:21

Our Contribution - 109:37

Our Contribution - 209:57

Our Contribution - 310:07

Our Contribution - 410:21

Parallel Pivot - 110:46

Parallel Pivot - 211:07

Parallel Pivot - 311:10

Parallel Pivot - 411:24

Parallel Pivot - 511:45

Parallel Pivot - 611:51

Parallel Pivot - 712:06

Parallel Pivot - 812:35

Parallel Pivot - 912:42

Parallel Pivot - 1012:45

Parallel Pivot - 1112:49

Parallel Pivot - 1212:52

Parallel Pivot - 1312:53

Parallel Pivot - 1412:56

Parallel Pivot - 1513:03

Parallel Pivot - 1613:26

Parallel Pivot - 1713:41

Parallel Pivot - 1813:50

Parallel Pivot - 1913:59

Parallel Pivot - 2014:04

Parallel Pivot - 2114:44

Parallel Pivot - 2214:57

Parallel Pivot - 2315:11

Parallel Pivot - 2415:22

Why do we sample elements with probability epsilon over delta+ ?15:28

Other sampling approaches? - 115:51

Other sampling approaches? - 216:23

Other sampling approaches? - 316:32

Other sampling approaches? - 416:51

Parallel Pivot - 2517:23

Parallel Pivot - 2617:33

Twitter Dataset - 118:00

Twitter Dataset - 218:14

Twitter Dataset - 318:50

Twitter Dataset - 419:11

Thanks!19:56