Practical Theories for Exploratory Data Mining (PTDM), Brussels 2012

Practical Theories for Exploratory Data Mining (PTDM), Brussels 2012

9 Lectures · Dec 10, 2012

About

The goal of this ICDM 2012 workshop is to help closing the gap between data mining practice and theory. To this end, we intend to explore what is the essence of exploratory data mining and how to formalize it in a useful but theoretically well-founded way.

The workshop is motivated by a widely perceived discrepancy between theoretical data mining prototypes and practitioners’ requirements. A notable example is frequent pattern mining. Despite its attractive theoretical foundations, the practical use of frequent pattern mining methods has been limited. This is due to a difficulty to overcome issues, such as the pattern explosion problem and a discrepancy between usefulness and frequency. These issues have been addressed to some extent in the past 15 years, through heuristic post-processing steps and through rigorously motivated adaptations. The multitude of possible solution strategies has unfortunately to a large extent undermined the original elegance, and made it hard for practitioners to understand how to use these techniques.

The problem is however not restricted to frequent pattern mining alone. The multitude of available methods for typical exploratory data mining problems such as (subspace) clustering and dimensionality reduction is such that practitioners face a daunting task in selecting a suitable method. Additionally to the usability issues, less attention has been given on pattern mining methods for relational databases. Although most real world databases are relational, most pattern mining research has focused on one-table data.

We believe the core reasons for these difficulties are:

  • Different users inevitably have different prior beliefs and goals, whereas most exploratory data mining algorithms have a rigid objective function and do not consider this.
  • Formally comparing the quality of different data mining patterns is hard due to their widely varying nature (e.g. comparing a dimensionality reduction with a frequent itemset), unless their 'interestingness' can be quantified in a comparable manner.
  • The iterative process of data mining is often not considered.
  • Data mining in complex relational data is hard to fit into standard data mining prototypes.
  • More generally, data mining methods tend to be rigid, defined for highly specific tasks, for highly specific and idealized data, and for very specific types of patterns.

The purpose of this workshop will be to serve as a forum of exchanging ideas on how to formalize exploratory data mining in order to make it useful in practice. This workshop will survey (through invited as well as contributed talks and posters) some existing attempts at addressing the problems mentioned above. We particularly encourage papers that present principled theoretical contributions motivated by real world requirements.

For more information please visit the workshop´s website.

Related categories

Uploaded videos:

Opening Remarks

video-img
17:34

Introduction

Tijl De Bie

Jan 16, 2013

 · 

2552 Views

Opening

Keynote Talks

video-img
41:19

Network-based Data Integration for Computational Systems Biology

Kathleen Marchal

Jan 16, 2013

 · 

3208 Views

Keynote
video-img
58:29

From Inductive Querying to Declarative Modeling for Data Mining

Luc De Raedt

Jan 16, 2013

 · 

2774 Views

Keynote
video-img
55:29

The Use of Randomization and Statistical Significance in Data Mining

Kai Puolamäki

Jan 16, 2013

 · 

3157 Views

Keynote
video-img
55:15

Datamining "Looking backward, looking forward"

Pieter Adriaans

Jan 16, 2013

 · 

3218 Views

Keynote

Lectures

video-img
17:08

Thorough analysis of log data with dependency rules: Practical solutions and the...

Wilhelmiina Hämäläinen

Jan 16, 2013

 · 

2362 Views

Lecture
video-img
13:29

Enhancing the Analysis of Large Multimedia Applications Execution Traces with Fr...

Christiane Kamdem Kengne

Jan 16, 2013

 · 

2319 Views

Lecture
video-img
16:33

Generalized Expansion Dimension

Michael Nett

Jan 16, 2013

 · 

2376 Views

Lecture
video-img
18:08

Generating Diverse Realistic Data Sets for Episode Mining

Albrecht Zimmermann

Jan 16, 2013

 · 

2214 Views

Lecture