Constructing Information Networks from Text Documents
Description
A major challenge for next generation data mining systems is
creative knowledge discovery from diverse and distributed
data/knowledge sources. In this task, an important challenge is
information fusion of diverse representations into a unique
data/knowledge format. This paper focuses on the graph representation
of data/knowledge generated from text documents available on the web.
The problem addressed is how to efficiently and effectively create an
information network, named a BisoNet, from large text corpora. Several
options concerning node and arc representation are discussed, and a
case study information network is created from articles concerning
autism, downloaded from the PubMed repository of medical
publications. Open issues and lessons learned concerning representation
choices are discussed.
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !


