About
This Workshop brings together scientists and engineers interested in recent developments in exploiting Massive Data Sets. Emphasis is placed on available techniques and their application to security-critical applications. \ Today our world is awash in data and we live in an Information Society where every action leaves a trace, generating massive amounts of data. Recent scientific developments provide technologies to exploit these huge amounts of data and extract from it critical information. Used today in many commercial applications (marketing campaigns, user profiling and recommendations on e-commerce sites, web search,users communitiee...), these technologies can also be used for security-critical applications (fraud detection and money laundering, intrusions detection, intelligence gathering, terrorist networks detection, Web surveillance...).
It is the purpose of this workshop to review the various technologies available (data mining algorithms, social networks, crawling and indexing, text-mining, search engines, data streams) in the context of very large data sets.
The workshop will provide survey presentations & posters to help building a scientific community aware of security issues and techniques to solve them.
Videos
Opening and Closing session

Introduction to the Workshop
Nov 26, 2007
·
3085 views

Conclusion remarks
Dec 4, 2007
·
5001 views
Interviews

Interview with Clive Best
Dec 4, 2007
·
3555 views

Interview with Françoise Fogelman Soulié
Dec 4, 2007
·
4445 views
Lectures

Combining Information Retrieval and Information Extraction for Medical Intellige...
Dec 3, 2007
·
5683 views

Plastic Card Fraud Detection using Peer Group Analysis
Dec 3, 2007
·
10178 views

The "Real World" Web Search Problem
Dec 3, 2007
·
5982 views

Evolving Networks
Sep 21, 2008
·
3253 views

Security Applications of Web mining
Dec 4, 2007
·
6444 views

Modeling rare events: online advertisement targeting using machine learning and ...
Dec 3, 2007
·
8616 views

Diffusion and Cascading Behaviour in Networks
Nov 28, 2007
·
15254 views

Detecting Money Laundering Actions Using Data Mining and Expert Systems
Dec 3, 2007
·
10247 views

Approximation algorithms for k-anonymity and privacy preservation in query logs
Dec 3, 2007
·
7869 views

Learning with structured data - structured outputs
Nov 26, 2007
·
4580 views

Information Theo-retic and Alge-braic Methods for Network Anomaly Detection
Nov 26, 2007
·
4870 views

Automatic detection and aggregation of name variants from large multi-lingual do...
Nov 26, 2007
·
4021 views

User logs processing using machine learning techniques
Nov 26, 2007
·
5945 views

Web Spam Detection
Nov 28, 2007
·
10279 views

Open Source Intelligence
Dec 3, 2007
·
32785 views

Statistical techniques for fraud detection, prevention, and evaluation
Dec 3, 2007
·
42054 views

Website Privacy Preservation for Query Log Publishing
Nov 29, 2007
·
6089 views

Ontologies and Machine Learning
Nov 26, 2007
·
9757 views

CiteSeerX & ChemXSeer: Lessons for Cyber-infrastructure and Web
Dec 4, 2007
·
4251 views

Learning to Extract Security-related Event Information from Large News Collectio...
Dec 3, 2007
·
3536 views

Machine Learning for Intrusion Detection
Nov 26, 2007
·
17723 views

Large-Scale Semi-Supervised Learning
Nov 26, 2007
·
5825 views

Ontogen Software Demo
Nov 26, 2007
·
9110 views

Filtering Multi-Lingual Terrorist Content with Graph-Theoretic Classifi-cation T...
Nov 26, 2007
·
4089 views

Foundations of Statistical Learning Theory : Empirical Infe-rence in high-diment...
Dec 3, 2007
·
10268 views

Recognition and Disambiguation of geographical references in text
Nov 26, 2007
·
3334 views

Using linguistic information as features for text categorization
Nov 26, 2007
·
4850 views

Link Analysis and Text Mining : Current State of the Art and Applications for Co...
Dec 3, 2007
·
10665 views

Mining Networks through Visual Analytics:
Nov 26, 2007
·
4336 views

The Security of Mobile Agent Systems
Jan 7, 2008
·
4248 views

Feature selection, fundamentals and applications
Dec 3, 2007
·
17082 views

Learning using Many Examples
Nov 26, 2007
·
5099 views

Inference and Learning with Networked Data
Jan 15, 2008
·
4899 views

Data stream management and mining
Nov 26, 2007
·
12090 views

Summarizing Data Stream's History
Nov 26, 2007
·
4389 views

Geolocalisation in Cellular Telephone Networks
Dec 3, 2007
·
6393 views

Fitting mixtures of regression lines with the forward search: application to clu...
Dec 3, 2007
·
4046 views

Emergent patterns in large social systems
Nov 26, 2007
·
4012 views

Mining Massive Data Sets
Nov 26, 2007
·
10044 views