Industry 3: How Co-Occurrence can Complement Semantics?
coauthor:Borislav Popov, Ontotext Semantic Technology Lab, Sirma Group
published: Feb. 25, 2007, recorded: November 2006, views: 220
Related content
30:27
168 views - Atanas Kiryakov, 2006
01:34:18
5989 views - York Sure, 1970
01:01:41
2326 views - Kamal Nigam, 2006
03:54:31
12584 views - Chih-Jen Lin, 2006
01:00:29
3561 views - Tom Gruber, 2006
01:13:49
34 views - Kiril Ivanov Simov, 1970
01:19:12
334 views - Kalina Bontcheva, 2004
04:59:19
18249 views - Sam Roweis, 2006
01:05:19
603 views - Marko Grobelnik, 2005
20:15
120 views - Mikalai Yatskevich, 2006
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Description
Analysis of texts is an obvious way for semantic annotation and extraction of structured knowledge. A basic task is the recognition of references to entities (people, locations, organizations, etc). A next step is relation extraction, e.g. identifying that an organization is located in a particular city. Automatic extraction of such relations is a tough linguistic problem - the solutions are either very partial, expensive to implement, or slow. On the other hand, relationships are crucial for the usability of the extracted knowledge for navigation and search purposes. We demonstrate how efficient co-occurrence analysis, performed on top of semantic annotation, can be used for several purposes: relation extraction, faceted search, and popularity timelines. The faceted search interface allows an easy way for augmenting full-text search by means of entity references, derived through co-occurrence profiling and semantic relationships. Although this sort of analytics can be used in virtually any domain, their development within the KIM platform was driven by the requirements for news analysis and research. We demonstrate the usage of these interfaces on top of 1 million news articles - a corpus of the major international news for the last five years. This sort of co-occurrence analysis has the potential of aiding identity resolution, which is recognized to be a crucial problem for several tasks: cross-document co-reference resolution, record linkage, object linking, and data integration.
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !




Write your own review or comment: