About
This Workshop brings together scientists and engineers interested in recent developments in exploiting Massive Data Sets. Emphasis is placed on available techniques and their application to security-critical applications. \ Today our world is awash in data and we live in an Information Society where every action leaves a trace, generating massive amounts of data. Recent scientific developments provide technologies to exploit these huge amounts of data and extract from it critical information. Used today in many commercial applications (marketing campaigns, user profiling and recommendations on e-commerce sites, web search,users communitiee...), these technologies can also be used for security-critical applications (fraud detection and money laundering, intrusions detection, intelligence gathering, terrorist networks detection, Web surveillance...).
It is the purpose of this workshop to review the various technologies available (data mining algorithms, social networks, crawling and indexing, text-mining, search engines, data streams) in the context of very large data sets.
The workshop will provide survey presentations & posters to help building a scientific community aware of security issues and techniques to solve them.
Videos
Opening and Closing session

Introduction to the Workshop
Nov 26, 2007
·
3087 views

Conclusion remarks
Dec 4, 2007
·
5002 views
Interviews

Interview with Clive Best
Dec 4, 2007
·
3556 views

Interview with Françoise Fogelman Soulié
Dec 4, 2007
·
4446 views
Lectures

Combining Information Retrieval and Information Extraction for Medical Intellige...
Dec 3, 2007
·
5685 views

Plastic Card Fraud Detection using Peer Group Analysis
Dec 3, 2007
·
10184 views

The "Real World" Web Search Problem
Dec 3, 2007
·
5985 views

Evolving Networks
Sep 21, 2008
·
3255 views

Security Applications of Web mining
Dec 4, 2007
·
6446 views

Modeling rare events: online advertisement targeting using machine learning and ...
Dec 3, 2007
·
8618 views

Diffusion and Cascading Behaviour in Networks
Nov 28, 2007
·
15261 views

Detecting Money Laundering Actions Using Data Mining and Expert Systems
Dec 3, 2007
·
10249 views

Approximation algorithms for k-anonymity and privacy preservation in query logs
Dec 3, 2007
·
7870 views

Learning with structured data - structured outputs
Nov 26, 2007
·
4581 views

Information Theo-retic and Alge-braic Methods for Network Anomaly Detection
Nov 26, 2007
·
4871 views

Automatic detection and aggregation of name variants from large multi-lingual do...
Nov 26, 2007
·
4023 views

User logs processing using machine learning techniques
Nov 26, 2007
·
5947 views

Web Spam Detection
Nov 28, 2007
·
10284 views

Open Source Intelligence
Dec 3, 2007
·
32819 views

Statistical techniques for fraud detection, prevention, and evaluation
Dec 3, 2007
·
42056 views

Website Privacy Preservation for Query Log Publishing
Nov 29, 2007
·
6095 views

Ontologies and Machine Learning
Nov 26, 2007
·
9758 views

CiteSeerX & ChemXSeer: Lessons for Cyber-infrastructure and Web
Dec 4, 2007
·
4254 views

Learning to Extract Security-related Event Information from Large News Collectio...
Dec 3, 2007
·
3538 views

Machine Learning for Intrusion Detection
Nov 26, 2007
·
17725 views

Large-Scale Semi-Supervised Learning
Nov 26, 2007
·
5828 views

Ontogen Software Demo
Nov 26, 2007
·
9112 views

Filtering Multi-Lingual Terrorist Content with Graph-Theoretic Classifi-cation T...
Nov 26, 2007
·
4091 views

Foundations of Statistical Learning Theory : Empirical Infe-rence in high-diment...
Dec 3, 2007
·
10273 views

Recognition and Disambiguation of geographical references in text
Nov 26, 2007
·
3334 views

Using linguistic information as features for text categorization
Nov 26, 2007
·
4852 views

Link Analysis and Text Mining : Current State of the Art and Applications for Co...
Dec 3, 2007
·
10666 views

Mining Networks through Visual Analytics:
Nov 26, 2007
·
4339 views

The Security of Mobile Agent Systems
Jan 7, 2008
·
4249 views

Feature selection, fundamentals and applications
Dec 3, 2007
·
17085 views

Learning using Many Examples
Nov 26, 2007
·
5101 views

Inference and Learning with Networked Data
Jan 15, 2008
·
4901 views

Data stream management and mining
Nov 26, 2007
·
12091 views

Summarizing Data Stream's History
Nov 26, 2007
·
4390 views

Geolocalisation in Cellular Telephone Networks
Dec 3, 2007
·
6394 views

Fitting mixtures of regression lines with the forward search: application to clu...
Dec 3, 2007
·
4048 views

Emergent patterns in large social systems
Nov 26, 2007
·
4015 views

Mining Massive Data Sets
Nov 26, 2007
·
10051 views