XML Information Retrieval
published: March 18, 2011, recorded: September 2010, views: 6844
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Documents usually have content and structure. The content refers to the text of the document, whereas the structure refers to how a document is logically organized. An increasingly common way to encode the structure is through the use of a mark up language. Nowadays, the most widely used mark up language for representing structure is the eXtensible Mark up Language (XML). XML can be used to provide a focused access to documents, i.e. returning XML elements, such as sections and paragraphs, instead of whole documents in response to a query. Such focused strategies are of particular benefit for information repositories containing long documents, or documents covering a wide variety of topics, where users are directed to the most relevant content within a document. The increased adoption of XML to represent a document structure requires the development of tools to effectively access documents marked up in XML. This course provides a detailed description of query languages, indexing strategies, ranking algorithms, presentation scenarios developed to access XML documents. Major advances in XML information retrieval were seen from 2002 as a result of INEX, the Initiative for Evaluation of XML Retrieval. INEX, also described in this course, provided test sets for evaluating XML information retrieval effectiveness. Many of the developments and results described in this course were investigated within INEX.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !