event thumbnail image
PASCAL Challenges Workshop 2
Pascal

Mining XML documents - Bridging the gap between Machine Learning and Information Retrieval

author: Ludovic Denoyer, University of Paris 6
You might be experiencing some problems with Your Video player.
Slides
0:01 XML Document Mining Challenge
0:28 Outline
1:12 What is XML DM Challenge ?
1:57 What is the XML DM Challenge ?
2:29 Context
4:15 How to learn with structures ?
5:13 Tasks with structured data
6:39 Context: ML and IR
8:08 ML and IR
9:28 ML and IR
10:12 Idea of the challenge
11:20 Description of the challenge
11:28 Tasks
11:49 Categorization/Clustering
12:54 Example
14:25 Example
15:10 Example
16:37 Difficulties
17:59 Structure Mapping
19:39 Difficulties
21:13 First part of the challenge
21:33 Description
22:57 Description
23:14 Example of Results (structure only)
23:56 INEX Structure+Content Categorization
24:46 Conclusion about the results
25:34 Conclusion about the first part of the challenge
26:18 For the next part
28:10 To convince Machine Learning Researchers
29:14 What is the second part ?
30:14 Wikipedia XML Corpus
31:49 Wikipedia XML Corpus for XML DM
32:33 INEX Corpus for XML DM
33:46 Structure Mapping Corpus
34:21 Currently
34:58 Conclusion

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment: