Europe Media Monitor (EMM) System
published: Sept. 20, 2010, recorded: September 2010, views: 6261
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
The Europe Media Monitor (EMM) is the text gathering and analysis engine underlying a number of European media monitoring and other information analysis applications (e.g. EMM (http://emm.newsbrief.eu), MediSys (http://medisys.newsbrief.eu) ) that are serving EU policies especially those concerned with crisis management. The EMM engine consists of a growing number of text analysis and information processing modules currently performing the following tasks: language detection, known entity extraction, geo-tagging, sentiment analysis, categorization, duplicate detection, clustering, event detection and indexing. The system has a number of information aggregation modules to present the analysis results per category, per country, as a story etc. In the case of the EMM NewsBrief, the system harvests and analyses around 100.000 news articles per day in 40 languages from around 5500 RSS feeds and HTML pages, and categorizes these in approximately 1000 different categories defined by 35000 different keywords and keyword combinations. The system is developed and operated by the European Commission's Joint Research Centre (JRC). The presentation will focus on the history, development and architecture of the system.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !