About
The goal of this ICDM 2012 workshop is to help closing the gap between data mining practice and theory. To this end, we intend to explore what is the essence of exploratory data mining and how to formalize it in a useful but theoretically well-founded way.
The workshop is motivated by a widely perceived discrepancy between theoretical data mining prototypes and practitioners’ requirements. A notable example is frequent pattern mining. Despite its attractive theoretical foundations, the practical use of frequent pattern mining methods has been limited. This is due to a difficulty to overcome issues, such as the pattern explosion problem and a discrepancy between usefulness and frequency. These issues have been addressed to some extent in the past 15 years, through heuristic post-processing steps and through rigorously motivated adaptations. The multitude of possible solution strategies has unfortunately to a large extent undermined the original elegance, and made it hard for practitioners to understand how to use these techniques.
The problem is however not restricted to frequent pattern mining alone. The multitude of available methods for typical exploratory data mining problems such as (subspace) clustering and dimensionality reduction is such that practitioners face a daunting task in selecting a suitable method. Additionally to the usability issues, less attention has been given on pattern mining methods for relational databases. Although most real world databases are relational, most pattern mining research has focused on one-table data.
We believe the core reasons for these difficulties are:
- Different users inevitably have different prior beliefs and goals, whereas most exploratory data mining algorithms have a rigid objective function and do not consider this.
- Formally comparing the quality of different data mining patterns is hard due to their widely varying nature (e.g. comparing a dimensionality reduction with a frequent itemset), unless their 'interestingness' can be quantified in a comparable manner.
- The iterative process of data mining is often not considered.
- Data mining in complex relational data is hard to fit into standard data mining prototypes.
- More generally, data mining methods tend to be rigid, defined for highly specific tasks, for highly specific and idealized data, and for very specific types of patterns.
The purpose of this workshop will be to serve as a forum of exchanging ideas on how to formalize exploratory data mining in order to make it useful in practice. This workshop will survey (through invited as well as contributed talks and posters) some existing attempts at addressing the problems mentioned above. We particularly encourage papers that present principled theoretical contributions motivated by real world requirements.
For more information please visit the workshop´s website.
Related categories
Uploaded videos:
Opening Remarks
Introduction
Jan 16, 2013
·
2552 Views
Keynote Talks
Network-based Data Integration for Computational Systems Biology
Jan 16, 2013
·
3208 Views
From Inductive Querying to Declarative Modeling for Data Mining
Jan 16, 2013
·
2774 Views
The Use of Randomization and Statistical Significance in Data Mining
Jan 16, 2013
·
3157 Views
Datamining "Looking backward, looking forward"
Jan 16, 2013
·
3218 Views
Lectures
Thorough analysis of log data with dependency rules: Practical solutions and the...
Jan 16, 2013
·
2362 Views
Enhancing the Analysis of Large Multimedia Applications Execution Traces with Fr...
Jan 16, 2013
·
2319 Views
Generalized Expansion Dimension
Jan 16, 2013
·
2376 Views
Generating Diverse Realistic Data Sets for Episode Mining
Jan 16, 2013
·
2214 Views