event thumbnail image
RuSSIR - Russian Summer School in Information Retrieval

Text Mining, Information and Fact Extraction (TMIFE)

author: Marie-Francine Moens, Catholic University of Leuven

Description

communities (medical informatics, security, blog and news analysis, business information analysis, legal informatics, etc.). ?Still, today it is a somewhat fragmented subfield of human language technologies and information retrieval where the themes of (often forgotten) old-style pattern-based IE and more recent machine learning techniques, as applied in medical informatics, opinion mining and blog extraction, are scattered in various conferences and sessions (computational linguistics, artificial intelligence, machine learning, Web technologies, semantic computing). The aim of this tutorial is to explain important technologies from handcrafted patterns to learning, and especially focus on how they blend together in order to suit the needs of current information systems that retrieve or mine information, or that make decisions and solve problems based on the extracted information. This unified perspective also entails valuable insights into the role of traditional pipelined system architectures and more recent probabilistic inference techniques. Probabilistic extraction, by which text is translated into a variety of semantic labels, pe"../slides/rfectly integrates with probabilistic retrieval models that naturally combine surface text features and semantic labels in ranking computations, among which are the popular language retrieval models. Finally, information extraction alleviates the knowledge acquisition bottleneck in expert and question answering systems technology that operate in more restricted subject domains. We conclude with some pointers to new challenges among which are the recognition of complex semantic concepts (e.g., narrative scripts, or issues such as medical malpractice or competitiveness) in texts. Because of the reconciling aspects of the many techniques and application domains, the tutorial will attract students and researchers with different backgrounds.

You might be experiencing some problems with Your Video player.
Slides
0:00 Text Mining, Information and Fact Extraction Part 1: Introduction and Symbolic Techniques
1:31 Text
2:11 Text ... about
2:33 Sharapova beats Ivanovic to win Australian Open
4:07 Mining
4:10 To mine
4:50 Information extraction
5:12 “Information extraction is the identification..."
6:35 Fact extraction
6:38 What is fact?
6:42 Aim of the course
7:36 Overview of the course
8:53 Overview of part 1
9:52 Why do we need IE?
10:37 How good is the machine already?
10:54 Named entity recognition
13:21 Noun phrase coreference resolution
14:34 Relation recognition
14:39 Event detection and linking
15:27 Location, job offer, sports, non-spam...
17:05 Information extraction: more examples
17:45 Birthday party, happyness,...
18:53 Information extraction from text
21:14 Role of natural language processing
22:22 SWARM INTELLIGENCE
22:40 Lexical features part1
24:14 Lexical features part2
25:07 Lexical features part3
25:34 SWARM INTELLIGENCE
27:30 Morphological transformations part1
27:34 Morphological transformations part2
27:59 Morphological transformations part1
28:21 Morphological transformations part2
28:23 POS tagging and sentence parsing part1
28:37 POS tagging and sentence parsing part2
29:03 POS tagging and sentence parsing part1
30:37 POS tagging and sentence parsing part2
33:10 POS tagging and sentence parsing part3
34:36 Other natural language features
35:30 The role of machine learning
36:16 Supervised learning
36:43 Examples...
36:50 Examples...
38:47 Supervised learning
40:58 Unsupervised learning
41:19 Similar objects are grouped... example
41:55 Weakly supervised learning
42:49 Similar objects are grouped... example
42:54 Weakly supervised learning
45:08 In this course
47:16 Information extraction
50:17 Evaluation: confusion matrix part1
51:07 Evaluation: confusion matrix part2
53:06 Evaluation: confusion matrix part3
54:48 Evaluation: F-measure
56:59 ROC curve
57:20 The symbolic approaches part1
58:09 The symbolic approaches part2
58:17 The symbolic approaches part3
61:00 Early origin part1
62:03 Script: human (X) taking the bus to go from LOC1 to LOC3
62:19 Early origin part1
63:57 Early origin part2
65:28 Early origin part1
65:41 Script: human (X) taking the bus to go from LOC1 to LOC3
67:57 Frame-based approaches part1
69:30 Frame-based approaches part2
70:33 Frame-based approaches part3
71:03 Script: human (X) taking the bus to go from LOC1 to LOC3
71:45 Frame-based approaches part4
72:41 FASTUS
74:38 Cascade of finite state transducers
74:45 Example sentence
75:23 Step 2
75:28 Cascade of finite state transducers
76:03 Step 2
76:40 Cascade of finite state transducers
77:26 Step 4
78:25 Step 2
78:29 Cascade of finite state transducers
79:05 Step 4
79:17 Cascade of finite state transducers
80:36 Symbolic techniques: results
80:50 Table 2: Maximum Results...
83:00 What to learn from the symbolic techniques?
84:41 Today
85:10 References

Lecture rating

People found this lecture:
Worth seeing
because it is:
 Valuable and informative
Well presented
Easily understandable
Acceptably recorded
You need to login to cast your vote.

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.

 Watch videos:   (click on thumbnail to launch)

Watch Part 1
Part 1 1:25:34
Flash video Slides Windows Media video

!NOW PLAYING
Watch Part 2
Part 2 1:21:47
Flash video Slides Windows Media video
Watch Part 3
Part 3 1:32:56
Flash video Slides Windows Media video
Watch Part 4
Part 4 1:46:16
Flash video Slides Windows Media video
Watch Part 5
Part 5 1:38:13
Flash video Slides Windows Media video

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: