event thumbnail image
The 13th International Conference on Knowledge Discovery and Data Mining

Truth Discovery with Multiple Conflicting Information Providers on the Web

author: Xiaoxin Yin, Microsoft Research

Description

The world-wide web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the web. Moreover, different web sites often provide conflicting information on a subject, such as different specifications for the same product. In this paper we propose a new problem called Veracity, i.e., conformity to truth, which studies how to find true facts from a large amount of conflicting information on many subjects that is provided by various web sites. We design a general framework for the Veracity problem, and invent an algorithm called TruthFinder, which utilizes the relationships between web sites and their information, i.e., a web site is trustworthy if it provides many pieces of true information, and a piece of information is likely to be true if it is provided by many trustworthy web sites. Our experiments show that TruthFinder successfully finds true facts among conflicting information, and identifies trustworthy web sites better than the popular search engines.

You might be experiencing some problems with Your Video player.
Slides
0:00 Truth Discovery with Multiple Conflicting Information Providers
0:35 Trustworthiness of the Web
1:16 Conflicting Information on the Web
2:15 Our Problem Setting
2:47 Basic Heuristics for Problem Solving
3:47 Overview of Our Method
4:28 Analogy to Authority-Hub Analysis
5:13 An Example
5:41 Computation Model (1): t(w) and s(f)
5:54 Computation Model (2): Fact Influence
6:11 Computation Model (3): Influence Function
6:37 Experiments: Finding Truth of Facts
7:31 Experiments: Trustable Info Providers
8:14 Conclusions
8:47 Thank you!

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment: