M-ATOLL: A Framework for the lexicalization of ontologies in multiple languages
published: Dec. 19, 2014, recorded: October 2014, views: 2087
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Many tasks in which a system needs to mediate between natural language expressions and elements of a vocabulary in an ontology or dataset require knowledge about how the elements of the vocabulary (i.e. classes, properties, and individuals) are expressed in natural language. In a multilingual setting, such knowledge is needed for each of the supported languages. In this paper we present M-ATOLL, a frame- work for automatically inducing ontology lexica in multiple languages on the basis of a multilingual corpus. The framework exploits a set of language-speciﬁc dependency patterns which are formalized as SPARQL queries and run over a parsed corpus. We have instantiated the system for two languages: German and English. We evaluate it in terms of precision, recall and F-measure for English and German by comparing an automatically induced lexicon to manually constructed ontology lexica for DBpedia. In particular, we investigate the contribution of each single dependency pattern and perform an analysis of the impact of diﬀerent parameters.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !