Mining Uncertain and Probabilistic Data: problems, Challenges, Methods, and Applications

author: Jian Pei, School of Computing Science, Simon Fraser University
author: Ming Hua, Simon Fraser University
published: Sept. 26, 2008,   recorded: August 2008,   views: 11171

Related Open Educational Resources

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.

 Watch videos:   (click on thumbnail to launch)

Watch Part 1
Part 1 27:03
Watch Part 2
Part 2 28:51
Watch Part 3
Part 3 46:45
Watch Part 4
Part 4 39:13


Uncertain data are inherent in some important applications, such as environmental surveillance, market analysis, and quantitative economics research. Uncertain data in those applications are generally caused by factors like data randomness and incompleteness, limitations of measuring equipment, delayed data updates, etc. Due to the importance of those applications and the rapidly increasing amount of uncertain data collected and accumulated, analyzing and mining large collections of uncertain data have become an important task and attracted more and more interest from the data mining community. In this tutorial, we will give a systematic survey on the motivations/applications, the problems, the challenges, the fundamental principles and the state-of-the-art methods of mining uncertain and probabilistic data. We will motivate the survey with several interesting practical applications of uncertain data analysis. To set the stage, we will discuss two major models for uncertain and probabilistic data briefly. We will cover several important data mining tasks on uncertain data, including clustering, classification, frequent pattern mining and online analytical processing (OLAP). For each task, we will analyze the challenges posed by uncertain and probabilistic data and the state-of-the-art solutions.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Reviews and comments:

Comment1 xiangrui, October 22, 2008 at 5:16 p.m.:

Very good talk and PPT file for researching! :)

Comment2 rocky, February 17, 2009 at 5:15 p.m.:

Good and helpful

Comment3 VIJAY, June 16, 2009 at 12:39 p.m.:


Comment4 salman shaikh, August 3, 2010 at 12:44 p.m.:

nice talk but where is PPT???

Comment5 Mohammed Alsheky, May 12, 2012 at 11:47 p.m.:

Very Good Job !! thumb up

Comment6 H.A, October 16, 2013 at 11:16 p.m.:

please help me
I have C# Code of algorithm Frequent pattern Growth


Comment7 aamir, February 26, 2014 at 9:44 a.m.:

your video are superb but how to download video
pls tell me how to download this videos

Comment8 hobart t4m, March 5, 2021 at 2:31 p.m.:

Very good and hot girls for the chat is waiting for you if you visit

Write your own review or comment:

make sure you have javascript enabled or clear this field: