About
This Workshop brings together scientists and engineers interested in recent developments in exploiting Massive Data Sets. Emphasis is placed on available techniques and their application to security-critical applications. \ Today our world is awash in data and we live in an Information Society where every action leaves a trace, generating massive amounts of data. Recent scientific developments provide technologies to exploit these huge amounts of data and extract from it critical information. Used today in many commercial applications (marketing campaigns, user profiling and recommendations on e-commerce sites, web search,users communitiee...), these technologies can also be used for security-critical applications (fraud detection and money laundering, intrusions detection, intelligence gathering, terrorist networks detection, Web surveillance...).
It is the purpose of this workshop to review the various technologies available (data mining algorithms, social networks, crawling and indexing, text-mining, search engines, data streams) in the context of very large data sets.
The workshop will provide survey presentations & posters to help building a scientific community aware of security issues and techniques to solve them.
Videos
Opening and Closing session

Introduction to the Workshop
Nov 26, 2007
·
3089 views

Conclusion remarks
Dec 4, 2007
·
5007 views
Interviews

Interview with Clive Best
Dec 4, 2007
·
3560 views

Interview with Françoise Fogelman Soulié
Dec 4, 2007
·
4447 views
Lectures

Combining Information Retrieval and Information Extraction for Medical Intellige...
Dec 3, 2007
·
5686 views

Plastic Card Fraud Detection using Peer Group Analysis
Dec 3, 2007
·
10190 views

The "Real World" Web Search Problem
Dec 3, 2007
·
5992 views

Evolving Networks
Sep 21, 2008
·
3267 views

Security Applications of Web mining
Dec 4, 2007
·
6450 views

Modeling rare events: online advertisement targeting using machine learning and ...
Dec 3, 2007
·
8628 views

Diffusion and Cascading Behaviour in Networks
Nov 28, 2007
·
15278 views

Detecting Money Laundering Actions Using Data Mining and Expert Systems
Dec 3, 2007
·
10257 views

Approximation algorithms for k-anonymity and privacy preservation in query logs
Dec 3, 2007
·
7871 views

Learning with structured data - structured outputs
Nov 26, 2007
·
4590 views

Information Theo-retic and Alge-braic Methods for Network Anomaly Detection
Nov 26, 2007
·
4882 views

Automatic detection and aggregation of name variants from large multi-lingual do...
Nov 26, 2007
·
4027 views

User logs processing using machine learning techniques
Nov 26, 2007
·
5950 views

Web Spam Detection
Nov 28, 2007
·
10294 views

Open Source Intelligence
Dec 3, 2007
·
32864 views

Statistical techniques for fraud detection, prevention, and evaluation
Dec 3, 2007
·
42068 views

Website Privacy Preservation for Query Log Publishing
Nov 29, 2007
·
6108 views

Ontologies and Machine Learning
Nov 26, 2007
·
9760 views

CiteSeerX & ChemXSeer: Lessons for Cyber-infrastructure and Web
Dec 4, 2007
·
4263 views

Learning to Extract Security-related Event Information from Large News Collectio...
Dec 3, 2007
·
3539 views

Machine Learning for Intrusion Detection
Nov 26, 2007
·
17732 views

Large-Scale Semi-Supervised Learning
Nov 26, 2007
·
5838 views

Ontogen Software Demo
Nov 26, 2007
·
9120 views

Filtering Multi-Lingual Terrorist Content with Graph-Theoretic Classifi-cation T...
Nov 26, 2007
·
4096 views

Foundations of Statistical Learning Theory : Empirical Infe-rence in high-diment...
Dec 3, 2007
·
10283 views

Recognition and Disambiguation of geographical references in text
Nov 26, 2007
·
3337 views

Using linguistic information as features for text categorization
Nov 26, 2007
·
4865 views

Link Analysis and Text Mining : Current State of the Art and Applications for Co...
Dec 3, 2007
·
10670 views

Mining Networks through Visual Analytics:
Nov 26, 2007
·
4342 views

The Security of Mobile Agent Systems
Jan 7, 2008
·
4255 views

Feature selection, fundamentals and applications
Dec 3, 2007
·
17107 views

Learning using Many Examples
Nov 26, 2007
·
5102 views

Inference and Learning with Networked Data
Jan 15, 2008
·
4910 views

Data stream management and mining
Nov 26, 2007
·
12092 views

Summarizing Data Stream's History
Nov 26, 2007
·
4402 views

Geolocalisation in Cellular Telephone Networks
Dec 3, 2007
·
6395 views

Fitting mixtures of regression lines with the forward search: application to clu...
Dec 3, 2007
·
4049 views

Emergent patterns in large social systems
Nov 26, 2007
·
4021 views

Mining Massive Data Sets
Nov 26, 2007
·
10061 views