event thumbnail image
The 13th International Conference on Knowledge Discovery and Data Mining

Correlation Search in Graph Databases

author: Yiping Ke, The Hong Kong University of Science and Technology

Description

Correlation mining has gained great success in many application domains for its ability to capture the underlying dependency between objects. However, the research of correlation mining from graph databases is still lacking despite the fact that graph data, especially in various scientific domains, proliferate in recent years. In this paper, we propose a new problem of correlation mining from graph databases, called Correlated Graph Search (CGS). CGS adopts Pearson’s correlation coefficient as a correlation measure to take into consideration the occurrence distributions of graphs. However, the problem poses significant challenges, since every subgraph of a graph in the database is a candidate but the number of subgraphs is exponential. We derive two necessary conditions which set bounds on the occurrence probability of a candidate in the database. With this result, we design an efficient algorithm that operates on a much smaller projected database and thus we are able to obtain a significantly smaller set of candidates. To further improve the efficiency, we develop three heuristic rules and apply them on the candidate set to further reduce the search space. Our extensive experiments demonstrate the effectiveness of our method on candidate reduction. The results also justify the efficiency of our algorithm in mining correlations from large real and synthetic datasets.

You might be experiencing some problems with Your Video player.
Slides
0:03 Correlation Search in Graph Databases
0:30 Outline
0:45 Introduction pt 1
1:01 Introduction pt 2
1:15 Introduction - Motivation pt 1
1:40 Introduction - Motivation pt 2
1:56 Introduction - Motivation pt 3
2:09 Introduction - Motivation pt 4
2:14 Introduction - Motivation pt 5
2:25 Introduction - Correlation Search in Graph Databases pt 1
2:50 Introduction - Correlation Search in Graph Databases pt 2
3:01 Introduction - Correlation Search in Graph Databases pt 3
3:13 Introduction - Correlation Search in Graph Databases pt 4
3:27 Introduction - Contributions pt 1
3:42 Introduction - Contributions pt 2
4:15 Problem Definition - Correlation Measure pt 1
4:34 Problem Definition - Correlation Measure pt 2
4:56 Problem Definition - Correlation Measure pt 3
5:21 Problem Definition
5:44 Solution - Candidate Generation pt 1
6:01 Solution - Candidate Generation pt 2
6:27 Solution - Candidate Generation pt 3
6:57 Solution - Candidate Generation (cont’) pt 1
7:14 Solution - Candidate Generation (cont’) pt 2
7:27 Solution - Candidate Generation (cont’) pt 3
7:39 Solution - Candidate Generation (cont’) pt 4
7:54 Solution - Heuristic Rules pt 1
8:18 Solution - Heuristic Rules pt 2
8:39 Solution - CGSearch Algorithm
9:06 Performance Evaluation
9:56 Effect of Candidate Generation when Varying Query Support
10:29 Effect of Graph Size
10:52 Conclusions
11:46 Thank You

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment: