Mining the Web 2.0 for Better Search

author:Ricardo Baeza-Yates, Yahoo! Research
published: May 20, 2009,   recorded: April 2009,   views: 103
You might be experiencing some problems with Your Video player.

Slides

Slides
0:00 Mining the Web 2.0 to improve Search
0:27 Agenda
1:19 Content and Metadata trends
2:22 Examples
5:28 The Wisdom of Crowds (1)
6:48 The Wisdom of Crowds (2)
7:38 The Wisdom of Crowds (3)
8:26 The Wisdom of Crowds (4)
10:02 Tag Mining - Collective Knowledge
10:33 Improving Image Search
10:48 TagExplorer
11:37 Dynamic Tag Clouds
12:04 Semantic Breakup of Tag Clouds
12:34 Tag Mining - Classification (1)
12:59 Tag Mining - Classification (2)
13:19 TagExplorer - Example
14:22 Could suggest tags: nice but ...
15:22 Dimensions of Diversity
15:48 Topical Diversity
16:09 Retrieval Performance
16:28 Use Visual Annotations
17:02 Content-based Image Retrieval
17:28 High-level search outline (1)
17:50 High-level search outline (2)
18:05 High-level search outline (3)
18:22 Evaluation
18:41 Results: Systems comparison
19:36 Bridging implicit and explicit metadata
20:21 Language, Text, Search & 'Semantics' ...
21:26 Extending metadata
21:53 Entity Containment Graph
22:28 Example: Picasso
22:39 Correlator
23:41 Correlator - Examples
24:49 Overview page
25:42 Step 1: Definitions of query concepts
25:52 Step 2: Realtions between query concepts (1/2)
26:11 Step 2: Realtions between query concepts (2/2)
26:21 Synthetic Page - example
27:01 Queries as implicit tags
28:07 Click Graph
28:32 Session (Query-Flow) Graph
28:55 Query-reformulation types
29:27 SearchPad
30:06 Research Session
30:41 Research Sessions
31:00 Implicit Folksonomy?
32:22 Implicit Knowledge? Web slang!
34:06 Experimental Evaluation
35:06 Open Issues
37:57 The Virtuous Cycle
39:14 Questions?

Related content

Visitors who watched this lecture also watched...
05:21:05
Query Log Mining

264 views - Ricardo Baeza-Yates, Fabrizio Silvestri, 2009
41:17
Reflecting on the last 20 years and looking forward to the next 20

220 views - Tim Berners Lee, 2009
47:17
DBpedia - A Linked Data Hub and Data Source for Web Applications and Enterprises

258 views - Sören Auer, Georgi Kobilarov, Christian Bizer, Jens Lehmann, 2009
29:26
Rated Aspect Summarization of Short Comments

210 views - Neel Sundaresan, Yue Lu, ChengXiang Zhai, 2009
32:15
Web infrastructure for the 21st Century

204 views - Pablo Rodriguez, 2009
27:37
Extracting Key Terms From Noisy and Multitheme Documents

144 views - Maria Grineva, Maxim Grinev, Dmitry Lizorkin, 2009
01:05:00
Challenges in Building Large-Scale Information Retrieval Systems

4100 views - Jeffrey Dean, 2009
25:48
Mining the Web to Facilitate Fast and Accurate Approximate Match

65 views - Surajit Chaudhuri, Venkatesh Ganti, Dong Xin, 2009
01:17:17
Web 20th Anniversary Panel

59 views - Tim Berners Lee, Robert Cailliau, Vinton G. Cerf, Dale Dougherty, Mike Shaver, Wendy Hall, 2009
01:20:58
The Emergence of Web Science

74 views - Ricardo Baeza-Yates, Tim Berners Lee, Michael L. Brodie, Nigel Shadbolt, 2009

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.

Description

There are several semantic sources that can be found in the Web that are either explicit, e.g. Wikipedia, or implicit, e.g. derived from Web usage data. Most of them are related to user generated content (UGC) or what is called today the Web 2.0. In this talk we show several applications of mining the wisdom of crowds behind UGC to improve search. We will show live demos to find relations in the Wikipedia or to improve image search as well as our current research in the topic. Our final goal is to produce a virtuous data feedback circuit to leverage the Web itself.

Link this page  

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: