About
This Workshop brings together scientists and engineers interested in recent developments in exploiting Massive Data Sets. Emphasis is placed on available techniques and their application to security-critical applications. \ Today our world is awash in data and we live in an Information Society where every action leaves a trace, generating massive amounts of data. Recent scientific developments provide technologies to exploit these huge amounts of data and extract from it critical information. Used today in many commercial applications (marketing campaigns, user profiling and recommendations on e-commerce sites, web search,users communitiee...), these technologies can also be used for security-critical applications (fraud detection and money laundering, intrusions detection, intelligence gathering, terrorist networks detection, Web surveillance...).
It is the purpose of this workshop to review the various technologies available (data mining algorithms, social networks, crawling and indexing, text-mining, search engines, data streams) in the context of very large data sets.
The workshop will provide survey presentations & posters to help building a scientific community aware of security issues and techniques to solve them.
Videos
Opening and Closing session

Introduction to the Workshop
Nov 26, 2007
·
3087 views

Conclusion remarks
Dec 4, 2007
·
5006 views
Interviews

Interview with Clive Best
Dec 4, 2007
·
3558 views

Interview with Françoise Fogelman Soulié
Dec 4, 2007
·
4447 views
Lectures

Combining Information Retrieval and Information Extraction for Medical Intellige...
Dec 3, 2007
·
5686 views

Plastic Card Fraud Detection using Peer Group Analysis
Dec 3, 2007
·
10190 views

The "Real World" Web Search Problem
Dec 3, 2007
·
5991 views

Evolving Networks
Sep 21, 2008
·
3266 views

Security Applications of Web mining
Dec 4, 2007
·
6448 views

Modeling rare events: online advertisement targeting using machine learning and ...
Dec 3, 2007
·
8623 views

Diffusion and Cascading Behaviour in Networks
Nov 28, 2007
·
15274 views

Detecting Money Laundering Actions Using Data Mining and Expert Systems
Dec 3, 2007
·
10255 views

Approximation algorithms for k-anonymity and privacy preservation in query logs
Dec 3, 2007
·
7870 views

Learning with structured data - structured outputs
Nov 26, 2007
·
4588 views

Information Theo-retic and Alge-braic Methods for Network Anomaly Detection
Nov 26, 2007
·
4880 views

Automatic detection and aggregation of name variants from large multi-lingual do...
Nov 26, 2007
·
4025 views

User logs processing using machine learning techniques
Nov 26, 2007
·
5949 views

Web Spam Detection
Nov 28, 2007
·
10290 views

Open Source Intelligence
Dec 3, 2007
·
32854 views

Statistical techniques for fraud detection, prevention, and evaluation
Dec 3, 2007
·
42063 views

Website Privacy Preservation for Query Log Publishing
Nov 29, 2007
·
6105 views

Ontologies and Machine Learning
Nov 26, 2007
·
9759 views

CiteSeerX & ChemXSeer: Lessons for Cyber-infrastructure and Web
Dec 4, 2007
·
4261 views

Learning to Extract Security-related Event Information from Large News Collectio...
Dec 3, 2007
·
3539 views

Machine Learning for Intrusion Detection
Nov 26, 2007
·
17732 views

Large-Scale Semi-Supervised Learning
Nov 26, 2007
·
5835 views

Ontogen Software Demo
Nov 26, 2007
·
9118 views

Filtering Multi-Lingual Terrorist Content with Graph-Theoretic Classifi-cation T...
Nov 26, 2007
·
4094 views

Foundations of Statistical Learning Theory : Empirical Infe-rence in high-diment...
Dec 3, 2007
·
10276 views

Recognition and Disambiguation of geographical references in text
Nov 26, 2007
·
3337 views

Using linguistic information as features for text categorization
Nov 26, 2007
·
4863 views

Link Analysis and Text Mining : Current State of the Art and Applications for Co...
Dec 3, 2007
·
10670 views

Mining Networks through Visual Analytics:
Nov 26, 2007
·
4342 views

The Security of Mobile Agent Systems
Jan 7, 2008
·
4253 views

Feature selection, fundamentals and applications
Dec 3, 2007
·
17099 views

Learning using Many Examples
Nov 26, 2007
·
5102 views

Inference and Learning with Networked Data
Jan 15, 2008
·
4907 views

Data stream management and mining
Nov 26, 2007
·
12091 views

Summarizing Data Stream's History
Nov 26, 2007
·
4400 views

Geolocalisation in Cellular Telephone Networks
Dec 3, 2007
·
6395 views

Fitting mixtures of regression lines with the forward search: application to clu...
Dec 3, 2007
·
4049 views

Emergent patterns in large social systems
Nov 26, 2007
·
4021 views

Mining Massive Data Sets
Nov 26, 2007
·
10056 views