About
This Workshop brings together scientists and engineers interested in recent developments in exploiting Massive Data Sets. Emphasis is placed on available techniques and their application to security-critical applications. \ Today our world is awash in data and we live in an Information Society where every action leaves a trace, generating massive amounts of data. Recent scientific developments provide technologies to exploit these huge amounts of data and extract from it critical information. Used today in many commercial applications (marketing campaigns, user profiling and recommendations on e-commerce sites, web search,users communitiee...), these technologies can also be used for security-critical applications (fraud detection and money laundering, intrusions detection, intelligence gathering, terrorist networks detection, Web surveillance...).
It is the purpose of this workshop to review the various technologies available (data mining algorithms, social networks, crawling and indexing, text-mining, search engines, data streams) in the context of very large data sets.
The workshop will provide survey presentations & posters to help building a scientific community aware of security issues and techniques to solve them.
Videos
Opening and Closing session

Introduction to the Workshop
Nov 26, 2007
·
3085 views

Conclusion remarks
Dec 4, 2007
·
5000 views
Interviews

Interview with Clive Best
Dec 4, 2007
·
3554 views

Interview with Françoise Fogelman Soulié
Dec 4, 2007
·
4443 views
Lectures

Combining Information Retrieval and Information Extraction for Medical Intellige...
Dec 3, 2007
·
5682 views

Plastic Card Fraud Detection using Peer Group Analysis
Dec 3, 2007
·
10175 views

The "Real World" Web Search Problem
Dec 3, 2007
·
5982 views

Evolving Networks
Sep 21, 2008
·
3253 views

Security Applications of Web mining
Dec 4, 2007
·
6443 views

Modeling rare events: online advertisement targeting using machine learning and ...
Dec 3, 2007
·
8616 views

Diffusion and Cascading Behaviour in Networks
Nov 28, 2007
·
15250 views

Detecting Money Laundering Actions Using Data Mining and Expert Systems
Dec 3, 2007
·
10246 views

Approximation algorithms for k-anonymity and privacy preservation in query logs
Dec 3, 2007
·
7868 views

Learning with structured data - structured outputs
Nov 26, 2007
·
4579 views

Information Theo-retic and Alge-braic Methods for Network Anomaly Detection
Nov 26, 2007
·
4868 views

Automatic detection and aggregation of name variants from large multi-lingual do...
Nov 26, 2007
·
4020 views

User logs processing using machine learning techniques
Nov 26, 2007
·
5944 views

Web Spam Detection
Nov 28, 2007
·
10269 views

Open Source Intelligence
Dec 3, 2007
·
32782 views

Statistical techniques for fraud detection, prevention, and evaluation
Dec 3, 2007
·
42050 views

Website Privacy Preservation for Query Log Publishing
Nov 29, 2007
·
6089 views

Ontologies and Machine Learning
Nov 26, 2007
·
9756 views

CiteSeerX & ChemXSeer: Lessons for Cyber-infrastructure and Web
Dec 4, 2007
·
4246 views

Learning to Extract Security-related Event Information from Large News Collectio...
Dec 3, 2007
·
3536 views

Machine Learning for Intrusion Detection
Nov 26, 2007
·
17719 views

Large-Scale Semi-Supervised Learning
Nov 26, 2007
·
5819 views

Ontogen Software Demo
Nov 26, 2007
·
9109 views

Filtering Multi-Lingual Terrorist Content with Graph-Theoretic Classifi-cation T...
Nov 26, 2007
·
4089 views

Foundations of Statistical Learning Theory : Empirical Infe-rence in high-diment...
Dec 3, 2007
·
10266 views

Recognition and Disambiguation of geographical references in text
Nov 26, 2007
·
3332 views

Using linguistic information as features for text categorization
Nov 26, 2007
·
4850 views

Link Analysis and Text Mining : Current State of the Art and Applications for Co...
Dec 3, 2007
·
10664 views

Mining Networks through Visual Analytics:
Nov 26, 2007
·
4335 views

The Security of Mobile Agent Systems
Jan 7, 2008
·
4247 views

Feature selection, fundamentals and applications
Dec 3, 2007
·
17078 views

Learning using Many Examples
Nov 26, 2007
·
5098 views

Inference and Learning with Networked Data
Jan 15, 2008
·
4898 views

Data stream management and mining
Nov 26, 2007
·
12087 views

Summarizing Data Stream's History
Nov 26, 2007
·
4388 views

Geolocalisation in Cellular Telephone Networks
Dec 3, 2007
·
6391 views

Fitting mixtures of regression lines with the forward search: application to clu...
Dec 3, 2007
·
4046 views

Emergent patterns in large social systems
Nov 26, 2007
·
4012 views

Mining Massive Data Sets
Nov 26, 2007
·
10042 views