SEARCH RESULTS:

Search: retrieval - Matches: 328


Event sections:


 event sectionChoice Models and Preference Learning
The workshop is motivated by the two following lines of research: 1. Large scale preference learning with sparse data: There has been a great interest and take-up of machine learning techniques for preference learning in learning to rank, information retrieval ...
 event sectionDiscrete Optimization in Machine Learning
Fortunately, most discrete optimization problems that arise in machine learning have specific structure, which can be leveraged in order to develop tractable exact or approximate optimization procedures. For example, consider the case of a discrete graphical model over a set ...
 event sectionLarge Scale Graphical Models
Approximate Learning of Large Scale Graphical Models: Theory and Applications Undirected graphical models provide a powerful framework for representing dependency structure between random variables. Learning the parameters of undirected models plays a crucial role in solving key problems in many ...
 event sectionSemantics in digital content
Overall objective and topics to be addressed: The aim of the session was to show recent developments in semantic technologies for digital content, with a specific focus on 3D. It stems from the activities of 2 initiatives: the SMaRT society, ...
 event sectionxLiTe: Cross-Lingual Technologies
Automatic text understanding has been an unsolved research problem for many years. This partially results from the dynamic and diverging nature of human languages, which ultimately results in many different varieties of natural language. This variations range from the individual ...

Events:


 event10th Extended Semantic Web Conference (ESWC), Montpellier 2013
The ESWC is a major venue for discussing the latest scientific results and technology innovations around semantic technologies. Building on its past success, ESWC is seeking to broaden its focus to span other relevant research areas in which Web semantics ...
 event1st Russian Summer School in Information Retrieval (RuSSIR), Ekaterinburg 2007
The 1st Russian Summer School in Information Retrieval (RuSSIR) is initiated in order to make students familiar with a wide range of modern problems and methods in Information Retrieval; to stimulate scientific research in the field of Information Retrieval and ...
 event1st Spring School on Social Media Retrieval (S3MR), Interlaken 2010
Introduction Multimedia content has become ubiquitous on the web, creating new challenges for indexing, access, search and retrieval. At the same time, much of this content is made available on content sharing websites like YouTube or Flickr, or shared on ...
 event1st Workshop on Automated Knowledge Based Construction (AKBC), Grenoble 2010
Good decision-making is dependent on comprehensive, accurate knowledge. But the information relevant to many important decisions in areas such as business, government, medicine and scientific research is massive, and growing at an accelerating pace. Relevant raw data is widely available ...
 event26th Annual Conference on Learning Theory (COLT), Princeton 2013
The conference is a single track meeting that includes invited talks as well as oral presentations of all refereed papers. We invited submissions of papers addressing theoretical aspects of machine learning and related topics. We strongly support a broad definition ...
 event27th Annual Conference on Learning Theory (COLT), Barcelona 2014
The conference strongly supports a broad definition of learning theory, including, but not limited to: • Design and analysis of learning algorithms and their generalization ability • Computational complexity of learning • Optimization procedures for learning • Unsupervised, semi-supervised learning, ...
 event2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Edinburgh 2005
The topics covered by the workshop are the following: * human-human communication modeling * speech and visual processing * multi-modal processing, fusion and fission * multi-modal dialog modeling * human-human interaction modeling * multi-modal data structuring and presentation * multimedia ...
 event2nd Russian Summer School in Information Retrieval (RuSSIR), Taganrog 2008
The 2nd Russian Summer School in Information Retrieval (RuSSIR) is initiated in order to make students familiar with a wide range of modern problems and methods in Information Retrieval; to stimulate scientific research in the field of Information Retrieval and ...
 event2nd Summer School on Social Media Retrieval (S3MR), Antalya 2011
Multimedia content has become ubiquitous on the web, creating new challenges for indexing, access, search and retrieval. At the same time, much of this content is made available on content sharing websites such as YouTube or Flickr, or shared on ...
 event3rd Russian Summer School in Information Retrieval (RuSSIR), Petrozavodsk 2009
The 3rd Russian Summer School in Information Retrieval will be held September 11-16, 2009 in Petrozavodsk, Russia. The school is co-organized by the Russian Information Retrieval Evaluation Seminar (ROMIP), Petrozavodsk State University, and Karelian Research Center, RAS. The first and ...
 event4th International Conference on Educational Data Mining
The 4th International Conference on Educational Data Mining (EDM 2011) brings together researchers from computer science, education, psychology, psychometrics, and statistics to analyze large datasets to answer educational research questions. Recently, the increase in dissemination of interactive learning environments, learning ...
 event4th Russian Summer School in Information Retrieval (RuSSIR), Voronezh 2010
The 4th Russian Summer School in Information Retrieval was held on September 13-18, 2010 in Voronezh, Russia. The school was co-organized by the Russian Information Retrieval Evaluation Seminar (ROMIP) and Voronezh State University. The previous RuSSIRs took place in Ekaterinburg, ...
 event5th Annual European Semantic Web Conference (ESWC), Tenerife 2008
The vision of the Semantic Web is to enhance today's Web by exploiting machine-processable metadata. The explicit representation of the semantics of data, enriched with domain theories (ontologies), will enable a web that provides a qualitatively new level of service. ...
 event6th Annual European Semantic Web Conference (ESWC), Hersonissos 2009
The vision of the Semantic Web is to enhance today's Web by exploiting machine-processable metadata. The explicit representation of the semantics of data, enriched with domain theories (ontologies), will enable a web that provides a qualitatively new level of service. ...
 event6th Russian Summer School in Information Retrieval (RuSSIR), Yaroslavl 2012
The 6th Russian Summer School in Information Retrieval (RuSSIR 2012) was held on August 6-10, 2012 in Yaroslavl, Russia. The school was co-organized by Yaroslavl Demidov State University and Russian Information Retrieval Evaluation Seminar (ROMIP) with support from the MUMIA ...
 event7th Extended Semantic Web Conference (ESWC), Heraklion 2010
The mission of the Extended Semantic Web Conference (ESWC 2010) is to bring together researchers and practioners dealing with different aspects of semantics on the Web. ESWC2010 builds on the success of the former European Semantic Web Conference series, but ...
 event8th Extended Semantic Web Conference (ESWC) 2011
The mission of the Extended Semantic Web Conference is to bring together researchers and practitioners dealing with different aspects of semantic technologies. Following a successful re-launch in 2010 as a multi-track conference, ESWC 2011 builds on the success of the ...
 event8th International Summer School on Information Retrieval
The 8th International Summer School on Information Retrieval offers high quality teaching on Information Retrieval and related research topics to a broad audience of computer scientists and research students working on topics related to management of information on the Internet, ...
 event9th Extended Semantic Web Conference (ESWC), Heraklion 2012
The Extended Semantic Web Conference (ESWC) is a major venue for discussing the latest scientific results and technology innovations around semantic technologies. Building on its past success, ESWC is seeking to broaden its focus to span other relevant research areas ...
 eventACM 17th Conference on Information and Knowledge Management (CIKM), Napa Valley 2008
Since 1992, the ACM Conference on Information and Knowledge Management (CIKM) has successfully brought together leading researchers and developers from the database, information retrieval, and knowledge management communities. The purpose of the conference is to identify challenging problems facing the ...
 eventAutumn School 2006: Machine Learning over Text and Images - Pittsburgh
Machine learning approaches to natural language processing problems such as information retrieval, document classification, and information extraction have developed rapidly over recent years. Even more recently, the joint analysis of text and images has become a significant focus for machine ...
 eventFourth ACM International Conference on Web Search and Data Mining - WSDM 2011
WSDM (pronounced “wisdom”) is the premier international ACM conference covering research in the areas of search and data mining on the Web. The 4th ACM WSDM Conference took place in Hong Kong, during February 9-12, 2011. WSDM publishes original, high ...
 eventJoin DELOS - MUSCLE Summer School on Multimedia digital libraries, Machine learning and cross-modal technologies for access and retrieval
The aim of the school is to promote advanced experiences and implementations and, more in general, to improve knowledge in these fields. The school will be open to researchers and practitioners who are interested in audio-visual and multimedia digital libraries ...
 eventJoint AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Martigny 2004
AMI (Augmented Multiparty Interaction, http://www.amiproject.org) is a newly launched (January 2004) European Integrated Project (IP) funded under Framework FP6 as part of its IST program. AMI targets computer enhanced multi-modal interaction in the context of meetings. The project aims at ...
 eventNational Initiatives on Multimedia Content Description and Retrieval
This highly successful Workshop was organised in cooperation with the European Commission. The event brought together the technical, administrative and financial representatives of the various national initiatives, which have been established recently in some European countries to support research and ...
 eventNIPS Workshop on Learning from Multiple Sources, Whistler 2008
While the machine learning community has primarily focused on analysing the output of a single data source, there has been relatively few attempts to develop a general framework, or heuristics, for analysing several data sources in terms of a shared ...
 eventNIPS Workshop on Learning to Compare Examples, Whistler 2006
The identification of an effective function to compare examples is essential to several machine learning problems. For instance, retrieval systems entirely depend on such a function to rank the documents with respect to their estimated similarity to the submitted query. ...
 eventNIPS Workshop on Music, Brain and Cognition, Whistler 2007
Music is one of the most widespread of human cultural activities, existing in some form in all cultures throughout the world. The definition of music as organised sound is widely accepted today but a naïve interpretation of this definition may ...
 eventPetaMedia Industry Workshop, Saint Malo 2009
The NEM European Technology Platform and the PetaMedia Network of Excellence are very pleased to announce the PetaMedia Industry Workshop, to be held within the NEM Summit on September 30, 2009. PetaMedia is a EU network of excellence dedicated to ...
 eventQuery Log Mining Tutorial, Madrid 2009
Web Search Engines have stored in their logs information about users since they started to operate. This information often serves many purposes. The primary focus of this tutorial is to introduce to the discipline of query mining by showing its ...
 eventSecond ACM International Conference on Web Search and Data Mining - WSDM 2009
WSDM (pronounced "wisdom") is a young ACM conference intended to be the publication venue for research in the areas of search and data mining. Indeed, the pace of innovation in these areas prevents proper coverage by conferences of broader scope. ...
 eventSMART Workshop, Grenoble 2007
More than half of the EU citizens are not able to hold a conversation in a language other than their mother tongue, let alone to conduct a negotiation, or interpret a law. In a time of wide availability of communication ...
 eventStatistical Multilingual Analysis for Retrieval and Translation (SMART) Dissemination Workshop, Barcelona 2009
The aim of this workshop is to disseminate scientific results produced by the SMART project to the larger technical and scientific community working on Statistical Machine Translation. To facilitate this inter-exchange, it will be co-located with EAMT 2009 - 13th ...
 eventThe 6th Workshop on Algorithms and Models for the Web Graph (WAW2009)
The World Wide Web has become part of our everyday life, and information retrieval and data mining on the Web are now of enormous practical interest. The algorithms supporting these activities combine the view of the Web as a text ...
 eventThird ACM International Conference on Web Search and Data Mining - WSDM 2010
WSDM (pronounced "wisdom") is a young ACM conference intended to be the publication venue for research in the areas of search and data mining. Indeed, the pace of innovation in these areas prevents proper coverage by conferences of broader scope. ...
 eventxLiTe Workshop: Cross-Lingual Technologies, Lake Tahoe 2012
Automatic text understanding has been an unsolved research problem for many years. This partially results from the dynamic and diverging nature of human languages, which ultimately results in many different varieties of natural language. This variations range from the individual ...

Projects:


 projectSMART- Statistical Multilingual Analysis for Retrieval and Translation
Check the SMART web page http://www.smart-project.eu/node/1 What is the SMART project? More than half of the EU citizens are not able to hold a conversation in a language other than their mother tongue, let alone to conduct a negotiation, or ...
 projectThe ACM Conference Series - ACM International Conference on Web Search and Data Mining
WSDM (pronounced "wisdom") is a young ACM conference intended to be the publication venue for research in the areas of search and data mining. Indeed, the pace of innovation in these areas prevents proper coverage by conferences of broader scope. ...

Best paper:


 best paper Anja Pilz: Entity Disambiguation using Relations extracted from Wikipedia
We present an approach for the disambiguation of textual mentions of ambiguous names: disambiguation means here the identification of the true entity denoted by a name phrase appearing in a query context through its assignment to the corresponding Wikipedia article. ...

Demonstration videos:


 demonstration video Joshua T. Abbott: Human memory search as a random walk in a semantic network
The human mind has a remarkable ability to store a vast amount of information in memory, and an even more remarkable ability to retrieve these experiences when needed. Understanding the representations and algorithms that underlie human memory search could potentially ...
 demonstration video Samuel Kaski: Interactive NeRV: Interactive Visualization Optimized for Information Retrieval
 demonstration videoLarKC Promotional video
The aim of the EU FP 7 Large-Scale Integrating Project LarKC is to develop the Large Knowledge Collider (LarKC, for short, pronounced “lark”), a platform for massive distributed incomplete reasoning that will remove the scalability barriers of currently existing reasoning ...
 demonstration video Lorand Dali: Learning to Rank for User Personalized News Article Retrieval
 demonstration video Rosa Meo: MetaData Retrieval: A Software Prototype for the Annotation of Maps with Social Metadata
MetaData Retrieval (MDR) is a software module for the enrichment of geo-referenced maps with metadata. Metadata are annotations on spatial locations that are taken from the Volunteered Graphical Information projects like OpenStreetMap and GeoNames. The MDR user acts with a ...
 demonstration video Ksenia Konyushkova: Relevance Feedback Content-Based Image Rertieval with Hierarchical Gaussian Process Bandits
A content-based image retrieval system based on relevance feedback is proposed. The system relies on an interactive search paradigm where at each round a user is presented with k images and selects the one closest to her target. The approach ...
 demonstration video Dejan Pangerčič: Shopping for Groceries and Putting Them Away
The video shows TUM-James (a PR2 robot) simulating the shopping task by bringing the groceries home and placing them in accordance to a priori learnt organizational patterns in households. The robot uses the 3D perception algorithms from point cloud library ...

Invited talks:


 invited talk David Ferrucci: Beyond Jeopardy! The Future of Watson
Computer systems that directly and accurately understand and answer people’s questions over a broad domain of human knowledge have been envisioned by scientists and writers since the advent of computers themselves. Toy solutions are easy to create when the knowledge ...
 invited talk David Ferrucci: Building Watson: An Overview of DeepQA for the Jeopardy! Challenge
Computer systems that can directly and accurately answer peoples’ questions over a broad domain of human knowledge have been envisioned by scientists and writers since the advent of computers themselves. Open domain question answering holds tremendous promise for facilitating informed ...
 invited talk Bettina Hoser: Collective Semantics: Collective Intelligence & the Semantic Web - Information Retrieval vs. Knowledge Retrieval: A social network perspective
Web 2.0 has introduced new style of information sharing featuring mass user participation, social networking, heterogeneity of data sources, and a huge scale of information and knowledge, posing difficulties in discovering relevant information. The Semantic Web may contribute by providing ...
 invited talk Geoffrey E. Hinton: Deep Learning with Multiplicative Interactions
Deep networks can be learned efficiently from unlabeled data. The layers of representation are learned one at a time using a simple learning module that has only one layer of latent variables. The values of the latent variables of one ...
 invited talk Jörg Wurzer: Driving Business Value with Linked Enterprise Data
Linked Enterprise Data goes beyond Semantic Web by including any data sources in an outside of business processes. The presentation shows how Linked Enterprise Data not only slimline it infrastructure but also enables automatic information logistics to combine process management ...
 invited talk Hugo Zaragoza: Improving Search with Semantic Technologies: Current Research Directions
Search engines play a major role in the success and growth of the WWW. In doing so they in turn help shape the web: they create new business models, modify content creation and consumption practices, support new forms of user ...
 invited talk Ivan Titov: Inducing Cross-Lingual Semantic Representations of Words, Phrases, Sentences and Events
Cross-lingual representations of linguistic units (e.g., words or phrases) can facilitate transfer of annotation from resource-rich to resource-poor languages and have many potential multilingual applications (e.g., machine translation and crosslingual information retrieval). In this talk, I will discuss our ongoing ...
 invited talk Samuel Kaski: Learning and retrieval from multiple sources
 invited talk Fernando C. N. Pereira: Low-Pass Semantics
Advances in statistical and machine learning approaches to natural language processing have yielded a wealth of methods and applications in information retrieval, speech recognition, machine translation, and information extraction. Yet, even as we enjoy these advances, we recognize that our ...
 invited talk Marko Grobelnik: Many Faces of Text Processing
Why people process text with computers? It all started many years ago, with the main goal in minds of researchers, to understand the text. In the meantime, the area of text processing developed in many different directions whereby the original ...
 invited talk Ramon López de Mántaras: Playing with Cases: Rendering Expressive Music Performance with Case-Based Reasoning
Rendering expressive music performances involves complex processes that constitute a challenging research area for computer music research. Besides, it is a rich ield for investigating aspects of human intelligence, emotion, and creativity. Case-based reasoning is one of the AI techniques ...
 invited talk Victor Lesser: Scaling AI Through Multi-Agent Organizations
Scaling remains one of the grand challenges for AI. Lesser has been using organizational control to build multiagent systems with hundreds to thousands of intelligent agents. This approach can also be used to structure complex AI systems with extensive and ...
 invited talk Kia Teymourian: Self-organization in Distributed Semantic Repositories
Principles from nature-inspired selforganization can help to attack the massive scalability challenges in future internet infrastructures. We researched into ant-like mechanisms for clustering semantic information. We outline algorithms to store related information within clusters to facilitate efficient and scalable retrieval. ...
 invited talk Jane Malin: Semantic Annotation of Complex Text Structures in Problem Reports
Text analysis is important for effective information retrieval from databases where the critical information is embedded in text fields. Aerospace safety depends on effective retrieval of relevant and related problem reports for the purpose of trend analysis. The complex text ...
 invited talk Alan F. Smeaton: Video Semantics and the Sensor Web
The most widespread way in which content-based access to video information is supported is through using a combination of video metadata (date, time, format, etc.) and user-generated description (user tags, ratings, reviews, etc.). This has had widespread usage and is ...
 invited talk Barney Pell: Why users need semantic search
While users dependence on search continues to increase, user satisfaction is not improving. This is partly because search is hard, and partly because users are becoming more demanding and pushing search beyond the traditional scope of information retrieval. Our research ...

Keynotes:


 keynote Jeffrey Dean: Challenges in Building Large-Scale Information Retrieval Systems
Building and operating large-scale information retrieval systems used by hundreds of millions of people around the world provides a number of interesting challenges. Designing such systems requires making complex design tradeoffs in a number of dimensions, including (a) the number ...
 keynote Hugo Zaragoza: Correlator: things we did, things we should do, and things we don't know how to
Correlator (http://sandbox.yahoo.com/Correlator) is a demo showcasing work developed at Yahoo! Research Barcelona in the areas of information extraction, retrieval and visualization. I will use this and other Yahoo! demos during my talk to discuss some of the technologies used, to ...
 keynote Ariadna Quattoni: Latent Variable Models for Content-Based Image Retrieval and Structure Prediction
In the first part of the talk I will present recent work on learning latent variable models for content-based image retrieval. To learn a function that predicts the relevance of a database image to an image query all that we ...
 keynote Fernando C. N. Pereira: Low-Pass Semantic
Advances in statistical and machine learning approaches to natural language processing have yielded a wealth of methods and applications in information retrieval, speech recognition, machine translation, and information extraction. Yet, even as we enjoy these advances, we recognize that our ...
 keynote Ramesh Jain: Multimedia Semantic Web
The Capture, Storage, Sharing, Organizing, Retrieval, and Use of knowledge dominate most socio-economic activities in our society. Most of the knowledge in the world is initially captured and stays in the form of experiences in different sensing modalities. Current technology ...
 keynote Stan Sclaroff: People In Motion: Pose, Action and Communication
This talk will give an overview of some of the research in the Image and Video Computing Group at Boston University related to tracking, analysis, recognition and retrieval of images and video based on humans and their actions. First, efficient ...
 keynote Masataka Goto: PodCastle and Songle: Web Services for Retrieval and Browsing of Speech and Music Content on the Basis of Automatic Content Analysis and Crowdsourcing
 keynote Alfred Spector: The Continuing Metamorphosis of the Web
The invention of HTML and HTTP catalyzed a path of enormous innovation that was hard to foresee in the early 1990’s. The Web’s continuing metamorphosis has led to fantastically increased capabilities and economic value. It has catalyzed the creation of ...

Lectures:


 lecture Oliver Schreer: "3D Scene Structure Analysis for Semantic Annotation and Retrieval of Unedited Video"
 lecture Martin Buss, Kolja Kühnlenz, Dirk Wollherr, Florian Rohrmüller, Tianguang Zhang, Quirin Mühlbauer, Georgios Lidoris, Stefan Sosnowski, Tingting Xu, Klaas Klasing, Andrea Bauer: 4. The Autonomous City Explorer
This video presents the Autonomous City Explorer (ACE) project. Its goal was to create a robot capable of navigating in an unknown urban environments without the use of prior map knowledge or GPS data. The robot had to find its ...
 lecture Yu-Feng Li: A Convex Method for Locating Regions of Interest with Multi-Instance Learning
In content-based image retrieval (CBIR) and image screening, it is often desirable to automatically locate the regions of interest (ROI) in the images. This can be accomplished with multi-instance learning techniques by treating each image as a bag of instances ...
 lecture Aleksandrs Slivkins: Adaptive Crowdsourcing Algorithms for the Bandit Survey Problem
Very recently crowdsourcing has become the de facto platform for distributing and collecting human computation for a wide range of tasks and applications such as information retrieval, natural language processing and machine learning. Current crowdsourcing platforms have some limitations in ...
 lecture Kazuhiro Seki, Kuniaki Uehara: Adaptive Subjective Triggers for Opinionated Document Retrieval
 lecture Alan Hanjalic: Advancements in Social Media Retrieval: How did PetaMedia help?
This presentation provides a brief overview of the FP7 Network of Excellence (NoE) Peer-to-Peer Tagged Media (PetaMedia). This NoE has been established and conducted with the goal to explore the possibilities for developing advanced solutions for multimedia information retrieval (MIR) ...
 lecture Jisheng Liang: A Large-Scale System for Annotating and Querying Quotations in News Feeds
In this paper, we describe a system that automatically extracts quotations from news feeds, and allows efficient retrieval of the semantically annotated quotes. APIs for real-time querying of over 10 million quotes extracted from recent news feeds are publicly available. ...
 lecture Nicolas Moënne-Loccoz: An Integrated framework for the management of video collection
Video document retrieval is now an active part of the domain of multimedia retrieval.However, unlike for other media, the management of a collection of video documents adds the problem of efficiently handling an overwhelming volume of temporal data. Challenges include ...
 lecture Ricardo Baeza-Yates: An Introduction to Web Retrieval
 lecture Olivier Aubert: Annotations, a key asset for video-based e-learning
Video documents provide a great support for e-learning, and in some cases even constitute the core of the proposed contents. Yet video remains a particular medium that lacks properties that can be found more easily in texts, such as structure ...
 lecture Juan F. Huete: Applications of Influence Diagrams to Information Retrieval
 lecture Timo Honkela: Applying unsupervised learning in creating language models for information retrieval and machine translation
 lecture Jose M. Iñesta: A Probabilistic Approach to Melodic Similarity
Melodic similarity is an important research topic in music information retrieval. The representation of symbolic music by means of trees has proven to be suitable in melodic similarity computation, because they are able to code rhythm in their structure leaving ...
 lecture Róbert Busa-Fekete: A Robust Ranking Methodology based on Diverse Calibration of AdaBoost
In subset ranking, the goal is to learn a ranking function that approximates a gold standard partial ordering of a set of objects (in our case, relevance labels of a set of documents retrieved for the same query). In this ...
 lecture Dunja Mladenić, Marko Grobelnik: Artificial Intelligence Handling Text Data
Text is one of the traditional ways of communication between people. With the growing availability of text data in electronic form, handling and analysis of text by means of computers gained popularity. Handling text data with machine learning methods brought ...
 lecture Liwei Wang: A Theoretical Analysis of NDCG Type Ranking Measures
Ranking has been extensively studied in information retrieval, machine learning and statistics. A central problem in ranking is to design a ranking measure for evaluation of ranking functions. State of the art leaning to rank methods often train a ranking ...
 lecture Nasir Naveed: ATT: Analyzing Temporal Dynamics of Topics and Authors in Social Media
Understanding Topical trends and user roles in topic evolution is an important challenge in the field of information retrieval. In this contribution, we present a novel model for analyzing evolution of user’s interests with respect to produced content over time. ...
 lecture Geoffrey E. Hinton: A tutorial on Deep Learning
Complex probabilistic models of unlabeled data can be created by combining simpler models. Mixture models are obtained by averaging the densities of simpler models and "products of experts" are obtained by multiplying the densities together and renormalizing. A far more ...
 lecture Joachim Köhler: Audio Content Search
Dr. Joachim Köhler showed several techniques and applications in the area of audio content search. Robust segmentation algorithms were applied to detect speech and non/speech events and to perform a speaker segmentation and clustering task. Further speech and music alignment ...
 lecture Qiang Zhu: Augmenting the Generalized Hough Transform to Enable the Mining of Petroglyphs
Rock art is an archaeological term for human-made markings on stone. It is believed that there are millions of petroglyphs in North America alone, and the study of this valued cultural resource has implications even beyond anthropology and history. Surprisingly, ...
 lecture Yun Fu: A Videography Analysis Framework for Video Retrieval and Summarization
In this work, we focus on developing features and approaches to represent and analyze videography styles in unconstrained videos. By unconstrained videos, we mean typical consumer videos with significant content complexity and diverse editing artifacts, mostly with long duration. Our ...
 lecture Vassilios Stathopoulos: Bayesian Probabilistic Models for Image Retrieval
In this paper we present new probabilistic ranking functions for content based image retrieval. Our methodology generalises previous approaches and is based on the predictive densities of generative probabilistic models modelling the density of image features. We evaluate the proposed ...
 lecture Roelof Van Zwol: Beyond Relevance
Finding similar and relevant media content given a user query or sample image has been at the core of the multimedia retrieval community for a long time. In this talk, I will identify and address multimedia challenges that play a ...
 lecture Hugo Zaragoza: Beyond String Search: Fast and Accurate Retrieval of Entities and Dependencies
 lecture Anil K. Jain: Big Data Clustering
The goal of data clustering is to organize a set of n objects into k clusters such that objects in the same cluster are more similar to each other than objects in different clusters. Clustering is one of the most ...
 lecture Jouni Tuominen: Biological Names and Taxonomies on the Semantic Web -- Managing the Change in Scientific Conception
Biodiversity management requires the usage of heterogeneous biological information from multiple sources. Indexing, aggregating, and finding such information is based on names and taxonomic knowledge of organisms. However, taxonomies change in time due to evolution, new scientific findings, opinions of ...
 lecture Christian Kohlschütter: Boilerplate Detection Using Shallow Text Features
In addition to the actual content Web pages consist of navigational elements, templates, and advertisements. This boilerplate text typically is not related to the main content, may deteriorate search precision and thus needs to be detected properly. In this paper, ...
 lecture Maksims Volkovs: BoltzRank: Learning to Maximize Expected Ranking Gain
Ranking a set of retrieved documents according to their relevance to a query is a popular problem in information retrieval. Methods that learn ranking functions are difficult to optimize, as ranking performance is typically judged by metrics that are not ...
 lecture Katja Hofmann: Challenges in online learning to rank for information retrieval
Online learning to rank for information retrieval (IR) aims to enable search systems to learn directly from interactions with their users. In our recent work, we explore formulations based on reinforcement learning to allow systems to continuously adapt to changes ...
 lecture Daniel Gatica-Perez: Classifying Visual Scenes with Affine Invariant Regions and Text Retrieval Methods
 lecture Kinga Schumacher: Combining Fact and Document Retrieval with Spreading Activation for Semantic Desktop Search
 lecture Roman Yangarber: Combining Information Retrieval and Information Extraction for Medical Intelligence
Global epidemic and medical surveillance is an essential function of Public Health agencies, whose primary aim is to protect the public from major health threats. To perform this function effectively one requires timely and accurate medical information from a wide ...
 lecture Rachanee Ungrangsi: combiSQORE: An Ontology Combination Algorithm
Automatic knowledge reuse for Semantic Web applications imposes several challenges on ontology search. Existing ontology retrieval systems merely return a lengthy list of relevant single ontologies, which may not completely cover the specified user requirements. Therefore, there arises an increasing ...
 lecture Raphaël Troncy: COMM: Designing a Well-Founded Multimedia Ontology for the Web
Semantic descriptions of non-textual media available on the web can be used to facilitate retrieval and presentation of media assets and documents containing them. While technologies for multimedia semantic descriptions already exist, there is as yet no formal description of ...
 lecture Anna Ritchie: Comparing Citation Contexts for Information Retrieval
 lecture Jasminka Dobsa: Comparison of information retrieval techniques: Latent semantic indexing (LSI) and Concept indexing (CI)
Information retrieval in the vector space model is based on literal matching of terms in the documents and the queries. The model is implemented by creating the term-document matrix, which is formed on the base of frequencies of terms in ...
 lecture James G. Shanahan: Computational advertising: business models, technologies and issues (CoAd)
Internet advertising revenues in the United States totaled $21 billion for 2007, up 25 percent versus 2006 revenues of $16.9 billion (according to the Interactive Advertising Bureau); this represents approximately half the worldwide revenue from online advertising. Fueled by these ...
 lecture Gaëlle Loosli: Context changes detection by one-class svms
For a system that aims at taking into account the user, we need to consider that there are many different behaviors as well as many different users. Hence we need adaptative, unsupervised (or semi-supervised) learning methods. Our idea is to ...
 lecture Sam Chapman: Creating and Using Organisational Semantic Webs in Large Networked Organisations
Modern knowledge management is based on the orchestration of dynamic communities that acquire and share knowledge according to customized schemas. However, while independence of ontological views is favoured, these communities must also be able to share their knowledge with the ...
 lecture Gabriela Csurka: Crossing textual and visual content in different application scenarios
In this presentation, we present a method based on Trans-media Pseudo-Relevance Feedback that allows crossing visual and textual content through multimodal knowledge base. The main idea is to use one of the modalities to retrieve multimodal documents from the knowledge ...
 lecture Jian-Yun Nie: Cross-Language Information Retrieval and Beyond
Cross-language information retrieval (CLIR) aims to find relevant documents that are written in a different language than the query. When there is not enough relevant information in the language of the user, the user may be interested in using CLIR ...
 lecture Matteo Palmonari: Cross-language Semantic Retrieval and Linking of E-gov Services
Public administrations are aware of the advantages of sharing Open Government Data in terms of transparency, development of improved services, collaboration between stakeholders, and spurring new economic activities. Initiatives for the publication and interlinking of government service catalogs as Linked ...
 lecture Jan Rupnik: Cross-Lingual Document Retrieval through Hub Languages
We address the problem of learning similarities between documents written in different languages for language pairs where little or no direct supervision (in the form of a comparable or parallel corpus) is available. To make up for the lack of ...
 lecture Tadej Štajner: Cross-lingual named entity disambiguation for concept translation
The talk will focus on our experience in developing an integrated natural language processing pipeline, consisting of several distinct components, operating across multiple languages. We will demonstrate a cross-language information retrieval method that enables reuse of the same language resources ...
 lecture Mor Naaman: Data by the people, for the people
What can we learn from social media and community-contributed collections of information on the web? The most salient attribute of social media is the creation of an environment that promotes user contributions in the form of authoring, curation, discussion and ...
 lecture Maxim Gubin: Data Structures in IR (DSIR)
The course presents an overview of theoretical and practical approaches to implementation of information retrieval systems. It is mainly focused on classic big and large-scale search problems but also includes brief description of structures applicable for other IR tasks. The ...
 lecture Pavel Serdyukov: Dealing with Heterogeneity in Profiles for Personalized Information Retrieval
 lecture Frank Reichartz: Dependency Tree Kernels for Relation Extraction from Natural Language Text
The automatic extraction of relations from unstructured natural text is challenging but offers practical solutions for many problems like automatic text understanding and semantic retrieval. Relation extraction can be formulated as a classification problem using support vector machines and kernels ...
 lecture Leonidas J. Guibas: Detection of Symmetries and Repeated Patterns in 3D Point Cloud Data
Digital models of physical shapes are becoming ubiquitous in our economy and life. Such models are sometimes designed ab initio using CAD tools, but more and more often they are based on existing real objects whose shape is acquired using ...
 lecture Hua-Yan Wang: Dirichlet Aggregation: Unsupervised Learning towards an Optimal Metric for Proportional Data
Proportional data (normalized histograms) have been frequently occurring in various areas, and they could be mathematically abstracted as points residing in a geometric simplex. A proper distance metric on this simplex is of importance in many applications including classification and ...
 lecture Michael I. Jordan: Dirichlet Processes, Chinese Restaurant Processes, and all that
Bayesian approaches to learning problems have many virtues, including their ability to make use of prior knowledge and their ability to link related sources of information, but they also have many vices, notably the strong parametric assumptions that are often ...
 lecture Jerónimo Arenas-García: Discovering Music Structure via Similarity Fusion
Automatic methods for music navigation and music recommendation exploit the structure in the music to carry out a meaningful exploration of the “song space”. To get a satisfactory performance from such systems, one should incorporate as much information about songs ...
 lecture Peter Mika: Distributed Indexing for Semantic Search
In this paper we describe the process of building indices for semantic search using MapReduce. We compare the two most straightforward representations of RDF data, the horizontal index structure using parallel indices and the vertical index structure using fi elds. ...
 lecture Fabio Crestani, Ilya Markov: Distributed Information Retrieval
The research area of Distributed Information Retrieval (DIR) provides techniques that help to integrate multiple searchable resources into a single federated resource and provide direct access to them through a single system. A DIR system can access Deep Web resources ...
 lecture Qiaozhu Mei: DivRank: the Interplay of Prestige and Diversity in Information Networks
Information networks are widely used to characterize the relationships between data items such as text documents. Many important retrieval and mining tasks rely on ranking the data items based on their centrality or prestige in the network. Beyond prestige, diversity ...
 lecture Allan Hanbury, Mihai Lupu: Domain Specific Information Retrieval
Domain-specific search engines only index documents relevant to a specific domain, such as health information or intellectual property information. As prior knowledge is available about the domain of interest, such search engines can be adapted to take advantage of this ...
 lecture Frizo Janssens: Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis
To unravel the concept structure and dynamics of the bioinformatics field, we analyze a set of 7401 publications from the Web of Science and MEDLINE databases, publication years 1981–2004. For delineating this complex, interdisciplinary field, a novel bibliometric retrieval strategy ...
 lecture Thorsten Joachims: Dynamic Ranked Retrieval
We present a theoretically well-founded retrieval model for dynamically generating rankings based on interactive user feedback. Unlike conventional rankings that remain static after the query was issued, dynamic rankings allow and anticipate user activity, thus providing a way to combine ...
 lecture Ismail Sengor Altingovde, Nattiya Kanhabua: Dynamics of Web: Analysis and Implications from Search Perspective
Dynamicity of Web and its implications on various components of search systems have taken a large attention in the last decade. This course, in the first place, aims to introduce students to the general and wide topic of Web evolution, ...
 lecture Peter Mika: Effective and Efficient Entity Search in RDF data
Triple stores have long provided RDF storage as well as data access using expressive, formal query languages such as SPARQL. The new end users of the Semantic Web, however, are mostly unaware of SPARQL and overwhelmingly prefer imprecise, informal keyword ...
 lecture Ioannis Alexiou: Efficient Kernels Couple Visual Words Through Categorical Opponency
Recent progress has been made on sparse dictionaries for the Bag-of-Visual-Words (BOVW) approach to object recognition and scene categorization. In particular, jointly encoded words have been shown to greatly enhance retrieval and categorization performance by both improving dictionary sparsity, which ...
 lecture Gerhard Weikum: Efficient Top-k Queries for XML Information Retrieval
 lecture Jasmin Franz, Andreas Gaag: empolis/tu munich │ Semantic Enterprise: Unleashing Solution Knowledge in the Area of Mechanical Engineering
ROCESSUS as a particular project of the German national funded high-tech-initiative THESEUS has the objective to create an IT-based corporate system that will allow companies to compare products, solutions and details of business associates, as well as locating the complex ...
 lecture Yusuf Aytar: Enhancing Exemplar SVMs using Part Level Transfer Regularization
Exemplar SVMs (E-SVMs, Malisiewicz et al, ICCV 2011), where a SVM is trained with only a single positive sample, have found applications in the areas of object detection and Content-Based Image Retrieval (CBIR), amongst others. In this paper we introduce ...
 lecture Véronique Malaisé: Enriching a Thesaurus to Improve Retrieval of Audiovisual Documents
In many archives of audiovisual documents, annotation and retrieval are done using metadata from a structured vocabulary or a thesaurus. In practice, many of these thesauri have limited or no structure. The objective of this paper is to find out ...
 lecture Pavel Dmitriev, Pavel Serdyukov, Sergey Chernov: Enterprise and Desktop search (EDS)
The Enterprise and Desktop Search problems recently received a considerable amount of attention from academia, mainly due to the increasing demand in industrial solutions supporting various search tasks in intranets. While challenges arising in intranet search are not entirely new ...
 lecture Krisztian Balog: Entity Search: Building Bridges between Two Worlds
We consider the task of entity search and examine to which extent state-of-art information retrieval (IR) and semantic web (SW) technologies are capable of answering information needs that focus on entities. We also explore the potential of combining IR with ...
 lecture Diane Lambert: Evaluating Online Ad Campaigns in a Pipeline: Causal Models At Scale
Display ads proliferate on the web, but are they effective? Or are they irrelevant in light of all the other advertising that people see? We describe a way to answer these questions, quickly and accurately, without randomized experiments, surveys, focus ...
 lecture Samy Bengio: Exploiting Hyperlinks to Learn a Retrieval Model
 lecture Guangyu Zhu: Extracting Relevant Named Entities for Automated Expense Reimbursement
Expense reimbursement is a time-consuming and labor-intensive process across organizations. In this talk, we present an automated expense reimbursement system developed at IBM Almaden Research Center. Our complete solution involves (1) an electronic document management infrastructure that provides multi-channel image ...
 lecture Ting Wang: Extracting Semantic Constraint from Description Text for Semantic Web Service Discovery
Various semantic web service discovery techniques have been proposed, many of which perform the profile based service signature (I/O) matching. However, the service I/O concepts are not sufficient to discover web services accurately. This paper presents a new method to ...
 lecture Lawrence Cayton: Fast Nearest Neighbor Retrieval for Bregman Divergences
We present a data structure enabling efficient NN retrieval for bregman divergences. The family of bregman divergences includes many popular dissimilarity measures including KL-divergence (relative entropy), Mahalanobis distance, and Itakura-Saito divergence. These divergences present a challenge for efficient NN retrieval ...
 lecture Ni Lao: Fast Query Execution for Retrieval Models Based on Path-Constrained Random Walks
Many recommendation and retrieval tasks can be represented as proximity queries on a labeled directed graph, with typed nodes representing documents, terms, and metadata, and labeled edges representing the relationships between them. Recent work has shown that the accuracy of ...
 lecture Bharath K. Sriperumbudur: Finding Musically Meaningful Words by Sparse CCA
A musically meaningful vocabulary is one of the keystones in building a computer audition system that can model the semantics of audio content. If a word in the vocabulary is not clearly represented by the underlying acoustic representation, the word ...
 lecture Mariano Consens: Flexible XML Retrieval using Summaries
 lecture Andrei Broder: From query based Information Retrieval to context driven Information Supply
 lecture Alan Hanjalic: From technology-Inspired towards Utility-Centered Multimedia Information Retrieval
 lecture Suvrit Sra: Generalized Dictionary Learning for Symmetric Positive Definite Matrices with Application to Nearest Neighbor Retrieval
We introduce Generalized Dictionary Learning (GDL), a simple but practical framework for learning dictionaries over the manifold of positive definite matrices. We illustrate GDL by applying it to Nearest Neighbor (NN) retrieval, a task of fundamental importance in disciplines such ...
 lecture Sergej Sizov: GeoFolk: Latent Spatial Semantics in Web 2.0 Social Media
We describe an approach for multi-modal characterization of social media by combining text features (e.g. tags as a prominent example of short, unstructured text labels) with spatial knowledge (e.g. geotags and coordinates of images and videos). Our model-based framework GeoFolk ...
 lecture Zoubin Ghahramani: Graphical models
An introduction to directed and undirected probabilistic graphical models, including inference (belief propagation and the junction tree algorithm), parameter learning and structure learning, variational approximations, and approximate inference. - Introduction to graphical models: (directed, undirected and factor graphs; conditional independence; ...
 lecture Gabriella Kazai, Nataša Milić-Frayling: Greetings and Introduction and Summary of Topics and Themes
This workshop aims to bring together researchers and industry practitioners, in Information Retrieval, Digital Libraries, eBooks, Human Computer Interaction, Publishing industry and on-line book services, to map out the agenda for digital book research, producing a draft research agenda for ...
 lecture Jason Weston: Half transductive ranking
We study the standard retrieval task of ranking a fixed set of items given a previously unseen query and pose it as the half transductive ranking problem. The task is transductive as the set of items is fixed. Transductive representations ...
 lecture Ivica Dimitrovski: Hierarchical Annotation of Medical Images
In this paper, we describe an approach for the automatic medical annotation task of the 2008 CLEF cross-language image retrieval campaign (ImageCLEF). The data comprise 12076 fully annotated images according to the IRMA code. This work is focused on the ...
 lecture A. Taylan Cemgil: Hierarchical Bayesian Models for Audio and Music Processing
In recent years, there has been an increasing interest in statistical approaches and tools from machine learning for the analysis of audio and music signals, driven partially by applications in music information retrieval, computer aided music education and interactive music ...
 lecture Thorsten Joachims: How does Clickthrough Data Reflect Retrieval Quality?
 lecture Esther Kaufmann: How Useful are Natural Language Interfaces to the Semantic Web for Casual End-users?
Natural language interfaces offer end-users a familiar and convenient option for querying ontology-based knowledge bases. Several studies have shown that they can achieve high retrieval performance as well as domain independence. This paper focuses on usability and investigates if NLIs ...
 lecture Anton Nijholt: Human Sensing, Implicit Tagging, and Implicit Interaction
Annotation of multimedia data is usually done manually by human annotators or automatically, using algorithms that directly analyze the multimedia data. An alternative to these methods is the use of implicit tagging, where the aim is to observe the user ...
 lecture Mitja Trampuš, Jan Berčič: (i)DiversiNews a stream-based, on-line service for diversified news
With the ever-increasing ease and speed of opinion exchange, the internet often displays the echo chamber effect. This is exacerbated by a free market: search engines and other data aggregators are monetarily incentivized to primarily show the most popular opinions. ...
 lecture Torsten Sattler: Image Retrieval for Image-Based Localization Revisited
To reliably determine the camera pose of an image relative to a 3D point cloud of a scene, correspondences between 2D features and 3D points are needed. Recent work has demonstrated that directly matching the features against the points outperforms ...
 lecture Michel Barlaud: Image Retrieval via Kullback Divergence of Patches of Wavelets Coefficients in the k-NN Framework
This talk presents a framework to define an objective measure of the similarity (or dissimilarity) between two images for image processing. The problem is twofold: define a set of features that capture the information contained in the image relevant for ...
 lecture Gérard Dupont: Implicit feedback learning in semantic and collaborative information retrieval systems
Information retrieval is a very wide domain which can involve various types of activities and tasks. Many complex factors are participating in a search for information and many systems have been experimented. Nowadays a general consensus has been established around ...
 lecture Hang Li: Improving Quality of Training Data for Learning to Rank Using Click-Through Data
In information retrieval, relevance of documents with respect to queries is usually judged by humans, and used in evaluation and/or learning of ranking functions. Previous work has shown that certain level of noise in relevance judgments has little effect on ...
 lecture David Hawking: Information Retrieval
 lecture Thorsten Joachims: Information Retrieval and Language Technology
The course will give an overview of how statistical learning can help organize and access information that is represented in textual form. In particular, it will cover tasks like text classification, information retrieval, information extraction, topic detection, and topic tracking. ...
 lecture Thomas Hofmann: Information Retrieval and Text Mining
This four hour course will provide an overview of applications of machine learning and statistics to problems in information retrieval and text mining. More specifically, it will cover tasks like document categorization, concept-based information retrieval, question-answering, topic detection and document ...
 lecture Thomas Hofmann: Information Retrieval and Text Mining
This four hour course will provide an overview of applications of machine learning and statistics to problems in information retrieval and text mining. More specifically, it will cover tasks like document categorization, concept-based information retrieval, question-answering, topic detection and document ...
 lecture Julien Nioche: Information Retrieval in GATE
 lecture Djoerd Hiemstra: Information Retrieval Modeling (IRM)
There is no such thing as a dominating model or theory of information retrieval, unlike the situation in for instance the area of databases where the relational model is the dominating database model. In information retrieval, some models work for ...
 lecture Francesc J. Ferri: Interactive image retrieval using smoothed nearest neighbor estimates
 lecture Yisong Yue: Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem
We present an online learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, we only require pairwise comparisons, which were shown to be reliably inferred from implicit feedback. We ...
 lecture Gang Hua: Interest Seam Image
We propose interest seam image, an efficient visual synopsis for video. To extract an interest seam image, a spatiotemporal energy map is constructed for the target video shot. Then an optimal seam which encompasses the highest energy is identified by ...
 lecture Taesup Moon: IntervalRank - Isotonic Regression with Listwise and Pairwise Constraints
Ranking a set of retrieved documents according to their relevance to a given query has become a popular problem at the intersection of web search, machine learning, and information retrieval. Recent work on ranking focused on a number of different ...
 lecture Andrew Aksyonoff: Introduction to Information Retrieval
An introductory course that covers all the basic concepts one needs to know to build keyword search engines from scratch. We will overview all the engine pipeline stages (indexing, searching, ranking, etc) at reasonable detail, aided by practical examples from ...
 lecture James Mayfield: Intro to Information Retrieval
 lecture Carlo Semenza: "I remember you but not your name" - The brain and proper name retreival / "Spomnim se te, a ne vem, kako se kličeš" - Možgani in priklic lastnih imen
Proper names are important in every day life. Forgetting them is a very common source of embarrassment, increasingly more frequent with aging. Their difference with common names has been matter of philosophical speculations and linguistic theories. Until two decades ago, ...
 lecture Peter Bednar: Java library for support of text mining and retrieval
 lecture José A. Rodriguez: Label Embedding for Text Recognition
The standard approach to recognizing text in images consists in first classifying local image regions into candidate characters and then combining them with high-level word models such as conditional random fields (CRF). This paper explores a new paradigm that departs ...
 lecture Hugo Zaragoza: Language Models for Information Retrieval
 lecture Diana Maynard: Language Technologies
This tutorial covers the use of Human Language Technologies for the Semantic Web and Web Services. It includes sections on HLT and Text Mining for the Semantic Web, various forms of Information Extraction, Ontology Population and Semantic Metadata Creation, and ...
 lecture Thomas Hofmann: Latent Semantic Variable Models
In the context of information retrieval and natural language processing, latent variable models are quite useful in modeling and discovering hidden structure that often leads to "semantic" data representations. This talk will provide an overview of the most popular approaches ...
 lecture Michael Bendersky: Learning Concept Importance Using a Weighted Dependence Model
Modeling query concepts through term dependencies has been shown to have a significant positive effect on retrieval performance, especially for tasks such as web search, where relevance at high ranks is particularly critical. Most previous work, however, treats all concepts ...
 lecture Paul N. Bennett, David Maxwell Chickering, Anton Mityagin: Learning Consensus Opinion: Mining Data from a Labeling Game
In this paper, we consider the challenge of how to identify the consensus opinion of a set of users as to how the results for a query should be ranked. Once consensus rankings are identified for a set of queries, ...
 lecture Yoshua Bengio, Samy Bengio: Learning Deep Hierarchies of Representations
Whereas theoretical work suggests that deep architectures might be computationally and statistically more efficient at representing highly-varying functions, training deep architectures was unsuccessful until the recent advent of algorithms based on unsupervised pre-training of each level of a hierarchically structured ...
 lecture Rioe Kliper: Learning Distance Function by Coding Similarity
We consider the problem of learning a similarity function from a set of positive equivalence constraints, i.e. "similar" point pairs. We define the similarity in information theoretic terms, as the gain in coding length when shifting from independent encoding of ...
 lecture Charles Parker: Learning for Efficient Retrieval of Structured Data with Noisy Queries
Increasingly large collections of structured data necessitate the development of efficient, noise-tolerant retrieval tools. In this work, we consider this issue and describe an approach to learn a similarity function that is not only accurate, but that also increases the ...
 lecture De-Chuan Zhan: Learning Instance Specific Distances Using Metric Propagation
In many real-world applications, such as image retrieval, it would be natural to measure the distances from one instance to others using \textit{instance specific distance} which captures the distinctions from the perspective of the concerned instance. However, there is no ...
 lecture Andreas Dengel: Learning Mental Associations as a means to build Organizational Memories
Office workspace reveals collections of documents structured along directories, bookmarks and email folders. The respective taxonomies represent conceptual implicit knowledge generated by the user about his/her role, tasks, and interests. Starting from that, learning methods can be applied to generate ...
 lecture Thorsten Joachims: Learning Rankings for Information Retrieval
 lecture Jose Oncina: Learning Stochastic Edit Distances from Structured Data: Application in Music Retrieval
 lecture Chun-Nam Yu: Learning Structural SVMs with Latent Variables
We present a large-margin formulation and algorithm for structured output prediction that allows the use of latent variables. The paper identifies a particular formulation that covers a large range of application problems, while showing that the resulting optimization problem can ...
 lecture Kai Puolamäki: Learning to Learn Implicit Queries from Gaze Patterns
In the absence of explicit queries, an alternative is to try to infer users' interests from implicit feedback signals, such as clickstreams or eye tracking. The interests, formulated as an implicit query, can then be used in further searches. We ...
 lecture Frederic Jurie: Learning Visual Distance Function for Object Identification from one Example
Comparing images is essential to several computer vision problems, like image retrieval or object identification. The comparison of two images heavily relies on the definition of a good distance function. Standard functions (e.g. the euclidean distance in the original feature ...
 lecture Rong Jin: Learn to Weight Term in Information Retrieval Using Category Information
 lectureLecture 24: Compare Map Implementations
What are we doing today? We’re gonna talk about hashing. Hashing’s one of the coolest things you’re ever gonna learn in 106b, so it’s a good day to be here and learn something really neat, kinda clever, inventive idea for ...
 lecture Tom Diethe: Linear Programming Boosting for Classification of Musical Genre
Classification of musical genre from raw audio files is a fairly well researched area of music research, and as such provides a good starting point for testing a new algorithm. The Music Information Retrieval Evaluation eXchange (MIREX) is a yearly ...
 lecture Danica Damljanović: Linked Data-based Concept Recommendation: Comparison of Different Methods in Open Innovation Scenario
Concept recommendation is a widely used technique aimed to assist users to chose the right tags, improve their Web search experience and a multitude of other tasks. In finding potential problem solvers in Open Innovation (OI) scenarios, the concept recommendation ...
 lecture Isabelle Augenstein: LODifier: Generating Linked Data from Unstructured Text
The automated extraction of information from text and its transformation into a formal description is an important goal of in both Semantic Web research and computational linguistics. The extracted information can be used for a variety of tasks such as ...
 lecture Padraig Cunningham: Machine learning for access and retrieval I
 lecture Jia Li: Machine learning for access and retrieval II
 lecture Mark Hasegawa-Johnson: Machine Learning in Acoustic Signal Processing
This tutorial presents a framework for understanding and comparing applications of pattern recognition in acoustic signal processing. Representative applications will be delimited by two binary features: (1) regression vs. (2) classification (inferred variables are continuous vs. discrete), (A) instantaneous vs. ...
 lecture Chris Manning: Machine Learning of Language from Distributional Evidence
Christopher Manning thinks linguistics went astray in the 20th century when it searched “for homogeneity in language, under the misguided assumption that only homogeneous systems can be structured.” In the face of human creativity with language, rigid categories of linguistic ...
 lecture Chang Wang: Manifold Alignment using Procrustes Analysis
In this paper we introduce a novel approach to manifold alignment, based on Procrustes analysis. Our approach differs from "semi-supervised alignment" in that it results in a mapping that is defined everywhere - when used with a suitable dimensionality reduction ...
 lecture Brandon Paulson, Tracy Hammond: MARQS: Media album retrieval by query sketch
An advertisement-like short demo of a tool for retrieving photos from an album by sketching.
 lecture Kai Ming Ting: Mass Estimation and Its Applications
This paper introduces mass estimation—a base modelling mechanism in data mining. It provides the theoretical basis of mass and an efficient method to estimate mass. We show that it solves problems very effectively in tasks such as information retrieval, regression ...
 lecture Chintan Patel: Matching Patient Records to Clinical Trials Using Ontologies
This talk describes a large case study that explores the applicability of ontology reasoning to problems in the medical domain. We investigate whether it is possible to use such reasoning to automate com- mon clinical tasks that are currently labor ...
 lecture Massimo Ferri: Mathematics, Shape, Computer Vision
This will be an excursus of many aspects of mathematics – in particular of geometry and topology – which are applied in the fields of shape analysis and computer vision. Mathematical subjects to be touched: gradients, critical points, transforms, distances, ...
 lecture Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka: Measuring the Similarity between Implicit Semantic Relations from the Web
Measuring the similarity between semantic relations that hold among entities is an important and necessary step in various Web related tasks such as relation extraction, information retrieval and analogy detection. For example, consider the case in which a person knows ...
 lecture Martha Larson: MediaEval Multimedia Benchmark Initiative: Looking back and looking forward to year five
This talk presents MediaEval (http://www.multimediaeval.org), an international multimedia benchmark initiative that offers innovative new tasks to the multimedia community. It is open to any research group who signs up to participate. MediaEval sets its focus on social and human aspects ...
 lecture Jure Dimec: Medjezično iskanje (Cross-language information retrieval)
Medjezično iskanje je relativno nova zvrst shranjevanja in iskanja informacij (information retrieval), ki v večjezičnih zbirkah besedil omogoča z iskalno zahtevo v enem od jezikov zbirke priklic dokumentov v dveh ali več jezikih zbirke. V seminarju bomo poskušali opisati nekatere ...
 lecture Kai Puolamäki: Methods for Fusing Eye Movements and Text Content for Information Retrieval
 lecture Guangyu Zhu: Mining Rich Session Context to Improve Web Search
User browsing information, particularly their non-search related activity, reveals important contextual information on the preferences and the intent of web users. In this paper, we expand the use of browsing information for web search ranking and other applications, with an ...
 lecture Ludovic Denoyer: Mining XML documents - Bridging the gap between Machine Learning and Information Retrieval
 lecture Primož Škraba: Multilingual Document Retrieval Through Hub Languages
In this paper we extend previous work on document retrieval across multilingual corpora. In this setting it is often assumed that we have a certain alignment given based on which we can learn mapping between spaces. In true multilingual corpora ...
 lecture Alan F. Smeaton: Multimedia Information Indexing and Retrieval
 lecture Stefan Rüger: Multimedia Information Retrieval
 lecture Stefan Rueger: Multimedia Information Retrieval
At its very core multimedia information retrieval means the process of searching for and nding multimedia documents; the corresponding research field is concerned with building the best possible multimedia search engines. The intriguing bit here is that the query itself ...
 lecture Agnes Lisowska: Multimodal Input for Meeting Browsing and Retrieval Interfaces: Preliminary Findings
 lecture Ruslan Salakhutdinov: Multimodal Learning with Deep Boltzmann Machines
We propose a Deep Boltzmann Machine for learning a generative model of multimodal data. We show how to use the model to extract a meaningful representation of multimodal data. We find that the learned representation is useful for classification and ...
 lecture Fred Stentiford: Multimodal Processing and Multimedia Understanding: Image Retrieval Using Eye Movements
His presentation describes experiments that explored eye behaviour when carrying out purely visual tasks on a Corel database of 1000 images. Results are reported that indicate that image identification can be carried out significantly faster with an eye tracker than ...
 lecture Marc Sloan: Multi Period Information Retrieval and Optimal Relevance Feedback using Dynamic Programming
In Multi Period Information Retrieval we consider retrieval as a stochastic yet controllable process, the ranking action during the process continuously controls the retrieval system's dynamics, and an optimal ranking policy is found in order to maximise the overall users' ...
 lecture Roberto Basili: Natural Languages and Ontology Learning
This is a one-hour video recording of the presentation of Roberto Basili at the KnowledgeWeb summer school 2005. It comprises either the video synchronized with the slides (but requires Quicktime, hence Windows or MacOS, otherwise the slides have to be ...
 lecture Miloš Radovanović: Nearest Neighbors in High-Dimensional Data: The Emergence and Influence of Hubs
High dimensionality can pose severe difficulties, widely recognized as different aspects of the curse of dimensionality. In this paper we study a new aspect of the curse pertaining to the distribution of k-occurrences, i.e., the number of times a point ...
 lecture Sunil Kumar Gupta: Nonnegative Shared Subspace Learning and Its Application to Social Media Retrieval
Although tagging has become increasingly popular in online image and video sharing systems, tags are known to be noisy, ambiguous, incomplete and subjective. These factors can seriously affect the precision of a social tag-based web retrieval system. Therefore improving the ...
 lecture David Losada: Ongoing research on sentence retrieval and novelty detection
 lecture Lillian Lee: Only Connect! Two Minor Explorations in Using Graphs for IF and NLP
Can we create a system that can learn to understand political speeches well enough to determine the speakers' viewpoints? Can we improve information retrieval by using link analysis, as is famously done in Web search, if we are dealing with ...
 lecture Thanh Tran: Ontology-based Interpretation of Keywords for Semantic Search
Current information retrieval (IR) approaches do not formally capture the explicit meaning of a keyword query but provide a comfortable way for the user to specify information needs on the basis of keywords. Ontology-based approaches allow for sophisticated semantic search ...
 lecture Sean Bechhofer: Ontology Languages for the Semantic Web
This is a one-hour video recording of the presentation of Sean Bechhofer at the KnowledgeWeb summer school 2007. It comprises the video synchronized with the slides (requires Flash) or the video alone (Flash format). Table of Contents: Ontology Languages for ...
 lecture José María Martínez: On Video Abstraction Systems' Architectures and Modelling
Abstract Nowadays the huge amount of video material stored in multimedia repositories makes the search and retrieval of such content a very slow and usually difficult task. The existing video abstraction systems aim to ease the multimedia access problem by ...
 lecture Clive Best: Open Source Intelligence
Open Source Intelligence can be defined as the retrieval, extraction and analysis of information from publicly available sources. Each of these three processes is the subject of ongoing research resulting in specialised techniques. Today the largest source of open source ...
 lecture Daniel Schneider: Open Vocabulary Speech Analysis in VITALAS
Automatic indexing of TV and radio speech data requires robust components for both speech recognition and spoken document retrieval. Due to the high topic variability and the resulting large vocabularies, classic word-based approaches have to cope with a high number ...
 lecture Sean Bechhofer: OWL: An Ontology Language for the Semantic Web
This is a one-hour video recording of the presentation of Sean Bechhofer at the KnowledgeWeb summer school 2005. It comprises either the video synchronized with the slides (but requires Quicktime, hence Windows or MacOS, otherwise the slides have to be ...
 lecture Sean Bechhofer: OWL: an Ontology Language for the Web
This is a one-hour video recording of the presentation of Sean Bechhofer at the KnowledgeWeb summer school 2006. It comprises either the video synchronized with the slides (requires Flash) or the video alone. Table of Contents: OWL: An Ontology Language ...
 lecture Sina Samangooei: Performing Content-based Retrieval of Humans using Gait Biometrics
In order to analyse surveillance video, we need to efficiently explore large datasets containing videos of walking humans. At this resolution, the human walk (their gait) can be determined automatically more readily than other features, such as the face. Analysis ...
 lecture Zakria Hussain: Pinview: Implicit Feedback in Content-Based Image Retrieval
 lecture Marine Campedel: PLATO for Information Mining in Satellite Imagery
Satellite images are numerous and weakly exploited: it is urgent to develop an efficient and fast indexing/retrieval system to easy their access. Content-based image retrieval systems (CBIR) are known to provide an efficient framework. We thus propose to associate a ...
 lecture Emmanuel Eckard: PLSI: The True Fisher Kernel and Beyond
The Probabilistic Latent Semantic Indexing model, introduced by T. Hofmann (1999), has engendered applications in numerous fields, notably document classification and information retrieval. In this context, the Fisher kernel was found to be an appropriate document similarity measure. However, the ...
 lecture Yisong Yue: Predicting Diverse Subsets Using Structural SVMs
In many retrieval tasks, one important goal involves retrieving a diverse set of results (e.g., documents covering a wide range of topics for a search query). First of all, this reduces redundancy, effectively presenting more information with the presented results. ...
 lecture Jarkko Salojärvi: Proactive Information Retrieval by User Modeling from Eye Tracking
 lecture Vikas Raykar: Probabilistic Dyadic Data Analysis with Local and Global Consistency
Dyadic data arises in many real world applications such as social network analysis and information retrieval. In order to discover the underlying or hidden structure in the dyadic data, many topic modeling techniques were proposed. The typical algorithms include Probabilistic ...
 lecture Guillaume Obozinski: Probabilistic graphical models for Information Retrieval
This course provides an introduction to probabilistic graphical modeling in the context of information retrieval. Starting with a review of basic concepts from statistics including notions of conditional independence and the maximum likelihood principle, the course will introduce the concepts ...
 lecture Yiming Yang: Protein Identification from Tandem Mass Spectra with Probabilistic Language Modeling
This paper presents an interdisciplinary investigation of statistical information retrieval (IR) techniques for protein identification from tandem mass spectra, a challenging problem in proteomic data analysis. We formulate the task as an IR problem, by constructing a “query vector” whose ...
 lecture Michael Bendersky: Quality-Biased Ranking of Web Documents
Many existing retrieval approaches do not take into account the content quality of the retrieved documents, although link-based measures such as PageRank are commonly used as a form of document prior. In this paper, we present the quality-biased ranking method ...
 lecture Tie-Yan Liu: Query-Level Stability and Generalization in Learning to Rank
This paper is concerned with the generalization ability of learning to rank algorithms for information retrieval (IR). We point out that the key for addressing the learning problem is to look at it from the viewpoint of query, and we ...
 lecture Van Dang: Query Reformulation Using Anchor Text
Query reformulation techniques based on query logs have been studied as a method of capturing user intent and improving retrieval effectiveness. The evaluation of these techniques has primarily, however, focused on proprietary query logs and selected samples of queries. In ...
 lecture Kira Radinsky: Ranking From Pairs and Triplets: Information Quality, Evaluation Methods and Query Complexity
Obtaining judgments from human raters is a vital part in the design of search engines’ evaluation. Today, there exists a discrepancy between judgment acquisition from raters (training phase) and use of the responses for retrieval evaluation (evaluation phase). This discrepancy ...
 lecture Nicolas Usunier: Ranking with Ordered Weighted Pairwise Classification
In ranking with the pairwise classification approach, the loss associated to a predicted ranked list is the mean of the pairwise classifica- tion losses. This loss is inadequate for tasks such as information retrieval where we prefer ranked lists with ...
 lecture Jiang Bian: Ranking with Query-Dependent Loss for Web Search
Queries describe the users’ search intent and therefore they play an essential role in the context of ranking for information retrieval and Web search. However, most of existing approaches for ranking do not explicitly take into consideration the fact that ...
 lecture Axel-Cyrille Ngonga Ngomo: Real-time RDF extraction from unstructured data streams
The vision behind the Web of Data is to extend the current document-oriented Web with machine-readable facts and structured data, thus creating a representation of general knowledge. However, most of the Web of Data is limited to being a large ...
 lecture Nicola Fanizzi: ReduCE: A Reduced Coulomb Energy Network Method for Approximate Classification
In order to overcome the limitations of the purely deductive approaches to query answering from ontologies, inductive (instance-based) methods have been proposed as efficient and noise-tolerant tools. In this paper we propose an original approach based on non-parametric learning: the ...
 lecture Jan Rupnik: Regression Canonical Correlation Analysis
In this paper we present Regression Canonical Correlation Analysis, an extension of Canonical Correlation Analysis, where one of the dimensions is fixed and demonstrate how it can be solved efficiently. We applied the extension to the task of query translation ...
 lecture Harry Reeves Halpin, Victor Lavrenko: Relevance Feedback Between Hypertext and Semantic Search
Relevance feedback is one method for creating a ‘virtuous cycle’ - as put by Baeza-Yates - between semantics and search. Previous approaches to search have generally considered the Semantic Web and hypertext Web search to be entirely disparate, indexing and ...
 lecture Rayid Ghani: Research Challenges in Enterprise Information Retrieval
Information Retrieval is a major component of Knowledge Management systems in every business but most of the research that is being done in IR today focuses on the Web and not on the needs and challenges of businesses. This is ...
 lecture Chad Cumby, Katharina Probst, Rayid Ghani: Retrieval and Ranking of Semantic Entities for Enterprise Knowledge Management Tasks
We describe a task-sensitive approach to retrieval and ranking of semantic entities, using the domain information available in an enterprise. Our approach utilizes noisy named-entity tagging and document classification, on top of an enterprise search engine, to provide input to ...
 lecture Hao Yan: Revisiting Globally Sorted Indexes for Efficient Document Retrieval
There has been a large amount of research on efficient document retrieval in both IR and web search areas. One important technique to improve retrieval efficiency is early termination, which speeds up query processing by avoiding scanning the entire inverted ...
 lecture Francesco Denatale: Scenarios and Technologies for Event-based Media Search
Media retrieval is still an open problem, due to many problems: the semantic gap, the difficulty in scaling according to dimension and diversity of media archives, the computational burden. The use of contextual information is considered among the best options ...
 lecture Ansgar Scherp: SchemEX -- Web-Scale Indexed Schema Extraction of Linked Open Data
We present SchemEX, an approach and tool for web-scale, real-time indexing and schema extraction of Linked Open Data (LOD) at linear runtime complexity. As we cannot assume that a complete retrieval of the LOD cloud on a local machine is ...
 lecture Yiannis Kompatsiaris: Search Computing: Business Areas, Research and Socio-Economic Challenges
Search has become an important and necessary component of many diverse ICT applications. A large number of business and application areas depend on the efficiency and availability of search techniques that are capable to process and retrieve heterogeneous and dispersed ...
 lecture Gaël Dias: Searching the Web by Discovering and Clustering Related Terms
The amount of information on the web is growing so fast that it is becoming more and more difficult for classical search engines to find relevant information. Indeed, due to the frenetic increase of webpages written in different languages and ...
 lecture Jonathan Hurlock: Searching Twitter: Separating the Tweet from the Chaff
Within the millions of digital communications posted in online social networks, there is undoubtedly some valuable and useful information. Although a large portion of social media content is considered to be babble, research shows that people share useful links, provide ...
 lecture Ruben Tous: Semantic-driven multimedia retrieval with the MPEG Query Format
The MPEG Query Format (MPQF) is a new standard from the MPEG standardization committee which provides a standardized interface to multimedia document repositories. The purpose of this paper is describing the necessary modifications which will allow MPQF to manage metadata ...
 lecture Nadine Steinmetz, Jens Lehmann: Semantic Multimedia Information Retrieval Based on Contextual Descriptions
Semantic analysis and annotation of textual information with appropriate semantic entities is an essential task to enable content based search on the annotated data. For video resources textual information is rare at first sight. But in recent years the development ...
 lecture Michela Spagnuolo: Semantics in digital content: From multimedia to emerging 3D Media
This session aims to solicit discussions, exchange ideas and establish new partnerships in the area of semantic media technologies and semantic based modelling and processing in general but will focus on 3D content in particular, according to the missions of ...
 lecture Roelof Van Zwol: (Semantic) Structure in Structured Document Retrieval
 lecture Martin Szummer: Semi-supervised Learning of Compact Document Representations with Deep Networks
Finding a good representation of text documents is crucial in document retrieval and classification systems. Nowadays, the most popular representation is simply based on a vector of counts storing the number of occurrences of each word in the document. This ...
 lecture Lei Zhang: Semplore: An IR Approach to Scalable Hybrid Query of Semantic Web Data
As an extension to the current Web, Semantic Web will not only contain structured data with machine understandable semantics but also textual information. While structured queries can be used to find information more precisely on the Semantic Web, keyword searches ...
 lecture Roberto Bilbao: Services associated to digitalised contents of tissues in biobanks across Europe – BIOPOOL
Nowadays it has become common practice to take digital images of thin slices of biopsies that are obtained for studying the composition of cells, glands, tissues and organs, and the possible pathologies that may affect them. These images are of ...
 lecture Raphaël Troncy: ShareIt: Mining SocialMedia Activities for Detecting Events
The list of social networking websites is diverse across the globe but the popularity of social media is indisputable. The 640M+ Facebook users, the 480M+ QZone users or the 200M+ Twitter users are used to share observations, opinions and media ...
 lecture Pinar Wennerberg: Siemens │ Semantic Technologies for Advanced Medical Image and Information Search
Clinical care and research deal with large volumes of complex information that originates from different sources and that has different structures and semantics. By establishing an explicit, formal specification of the concepts and their relations for a particular domain, e.g. ...
 lecture Krzysztof Janowicz: SIM-DL_A: A Novel Semantic Similarity Measure for Description Logics Reducing Inter-Concept to Inter-Instance Similarity
While semantic similarity plays a crucial role for human categorization and reasoning, computational similarity measures have also been applied to fields such as semantics-based information retrieval or ontology engineering. Several measures have been developed to compare concepts specified in various ...
 lecture Kristen Parton: Simultaneous Multilingual Search for Translingual Information Retrieval
 lecture Eyal Oren: Sindice.com: Weaving the Open Linked Data
Developers of SemanticWeb applications face a challenge with respect to the decentralised publication model: where to nd statements about encountered resources. The \linked data" approach, which man- dates that resource URIs should be de-referenced and yield metadata about the resource, ...
 lecture Maarten de Rijke: Social media analysis and retrieval technologies
 lecture Gorazd Avguštin: Sodobni pogled na drevo življenja: Mikrobiologova zgodba / Contemporary View of a Tree of Life: The Microbiologist’s Tale
Mikroorganizmi so si kot taksonomska skupina dokaj pozno utrli pot do lastne veje na skupnem drevesu življenja. Dolgo so na tem drevesu prevladovali predstavniki velikih organizmov, torej živali in rastlin. Danes pa moderne metode omogočajo hitro in zanesljivo pridobivanje informacij ...
 lecture David R. Hardoon: Sparse Canonical Correlation Analysis
We present a novel method for solving Canonical Correlation Analy- sis (CCA) in a sparse convex framework using a least squares approach. The presented method focuses on the scenario when one is interested in (or limited to) a primal representation ...
 lecture Tiezheng Ge: Sparse-Coded Features for Image Retrieval
State-of-the-art image retrieval systems typically represent an image with a bag of low-level features. Since different images often exhibit different kinds of low-level characteristics, it is desirable to represent an image with multiple types of complementary features. The systems scalability ...
 lecture Martin Vetterli: Sparse Sampling: Variations on a Theme by Shannon
Sampling is not only a beautiful topic in harmonic analysis, with an interesting history, but also a subject with high practical impact, at the heart of signal processing and communications and their applications. The question is very simple: when is ...
 lecture Amr Ahmed: Structured Correspondence Topic Models for Mining Captioned Figures in Biological Literature
A major source of information (often the most crucial and informative part) in scholarly articles from scientific journals, proceedings and books are the figures that directly provide images and other graphical illustrations of key experimental results and other scientific contents. ...
 lecture Xinhua Zhang: Tailoring Density Estimation via Reproducing Kernel Moment Matching
Moment matching is a popular means of parametric density estimation. We extend this technique to nonparametric estimation of mixture models. Our approach works by embedding distributions into a reproducing kernel Hilbert space, and performing moment matching in that space. This ...
 lecture Elisabeth Georgii: Targeted retrieval of gene expression measurements using regulatory models
Motivation: Large public repositories of gene expression measurements offer the opportunity to position a new experiment into the context of earlier studies. While previous methods rely on experimental annotation or global similarity of expression profiles across genes or gene sets, ...
 lecture Frederic Jurie: TecnoVision-ROBIN: benchmarking object retrieval algorithms
Technovision is a recent program of the French Ministry of Research and Technology that will fund evaluation projects in the area of computer vision. Many vision algorithms have been proposed in the past, but comparing their performance has been difficult ...
 lecture Florian Metze: The “FAME” Interactive Space
This paper describes the “FAME” multi-modal demonstrator, which integrates multiple communication modes – vision, speech and object manipulation – by combining the physical and virtual worlds to provide support for multi-cultural or multi-lingual communication and problem solving. The major challenges ...
 lecture Jitendra Malik: The Future of Image Search
There are billions of images on the Internet. Today, searching for a desired image is largely based on textual data such as filename or associated text on the web page; not much use is made of the image content. There ...
 lecture Laurence A. F. Park: The Sensitivity of Latent Dirichlet Allocation for Information Retrieval
It has been shown that the use of topic models for Information retrieval provides an increase in precision when used in the appropriate form. Latent Dirichlet Allocation (LDA) is a generative topic model that allows us to model documents using ...
 lecture Claudia Wagner: The Wisdom in Tweetonomies: Acquiring Latent Conceptual Structures from Social Awareness Streams
Although one might argue that little wisdom can be conveyed in messages of 140 characters or less, this paper sets out to explore whether the aggregation of messages in social awareness streams, such as Twitter, conveys meaningful information about a ...
 lecture Irem Arikan, Srikanta Bedathur, Klaus Berberich: Time Will Tell: Leveraging Temporal Expressions in IR
Temporal expressions, such as between 1992 and 2000, are frequent across many kinds of documents. Text retrieval, though, treats them as common terms, thus ignoring their inherent semantics. For queries with a strong temporal component, such as U.S. president 1997, ...
 lecture Kivanc Ozonat: Towards a Universal Marketplace over the Web: Statistical Multi-label Classification of Service Provider Forms with Simulated Annealing
There is a growing number of service providers that a consumer can interact with over the web to learn their service terms. The service terms, such as price and time to completion of the service, depend on the consumer's particular ...
 lecture Amelie Anglade: Towards Logic-based Representations of Musical Harmony for Classification, Retrieval and Knowledge Discovery
 lecture Anlei Dong: Towards Recency Ranking in Web Search
In web search, recency ranking refers to ranking documents by relevance which takes freshness into account. In this paper, we propose a retrieval system which automatically detects and responds to recency sensitive queries. The system detects recency sensitive queries using ...
 lecture Daqing He: Translation Enhancement: A New Relevance Feedback Method for Cross-Language Information Retrieval
 lecture Shen-Shyang Ho: Tropical Cyclone Event Sequence Similarity Search via Dimensionality Reduction and Metric Learning
The Earth Observing System Data and Information System (EOSDIS) is a comprehensive data and information system which archives, manages, and distributes Earth science data from the EOS spacecrafts. One non-existent capability in the EOSDIS is the retrieval of satellite sensor ...
 lecture Hugo Zaragoza: "Tuning": Error Optimisation in Ad-Hoc Retrieval
 lecture Deng Cai: Unsupervised Feature Selection for Multi-Cluster Data
In many data analysis tasks, one is often confronted with very high dimensional data. Feature selection techniques are designed to find the relevant feature subset of the original features which can facilitate clustering, classification and retrieval. In this paper, we ...
 lecture Samuel Kaski: User models from implicit feedback for proactive information retrieval
Our research consortium develops user modeling methods for proactive applications. In this project we use machine learning methods for predicting users’ preferences from implicit relevance feedback. Our prototype application is information retrieval, where the feedback signal is measured from eye ...
 lecture José R. Pérez Agüera: Using BM25F for Semantic Search
Information Retrieval (IR) approaches for semantic web search engines have become very popular in the last years. Popularization of different IR libraries, like Lucene, that allows IR implementations almost out-of-the-box have make easier IR integration in Semantic Web search engines. ...
 lecture Tetsuya Sakai: Using Graded-Relevance Metrics for Evaluating Community QA Answer Selection
Community Question Answering (CQA) sites such as Yahoo! Answers have emerged as rich knowledge resources for information seekers. However, answers posted to CQA sites can be irrelevant, incomplete, redundant, incorrect, biased, ill-formed or even abusive. Hence, automatic selection of "good" ...
 lecture Mikko Karvonen: Using Mathematical Morphology for Geometric Music Retrieval
 lecture Alexandra Moraru: Using Personalized PageRank for Keyword Based Sensor Retrieval
 lecture Bo Fu, Jens Wissmann: Using Pseudo Feedback to Improve Cross-Lingual Ontology Mapping
While ontologies are widely accepted internationally as knowledge management mechanism across disciplines, the ability to reason over knowledge bases regardless of the natural languages used in them has become a pressing issue in digital content management. To enable knowledge sharing ...
 lecture Aiden Roger Doherty: Validating the Detection of Everyday Concepts in Visual Lifelogs
The Microsoft SenseCam is a small lightweight wearable camera used to passively capture photos and other sensor readings from a user's day-to-day activities. It can capture up to 3,000 images per day, equating to almost 1 million images per year. ...
 lecture Martha Larson, Gareth Jones: VideoCLEF: evaluation of moving image retrieval
 lecture Pierluigi Plebani: Web services retrieval
 lecture Sebastian Nowozin: Weighted Substructure Mining for Image Analysis
In web-related applications of image categorization, it is desirable to derive an interpretable classification rule with high accuracy. Using the bag-of-words representation and the linear support vector machine, one can partly fulfill the goal, but the accuracy of linear classifiers ...
 lecture Nicola Cancedda: Welcome to the Statistical Multilingual Analysis for Retrieval and Translation - SMART Dissemination Workshop
Statistical Machine Learning has many points of contact with Statistical Machine Translation. This workshop presents recent work within this intersection. It is sponsored both by the Pascal Network of Excellence and the EU-funded SMART project. This is a 3 years ...
 lecture Duen Horng Chau: What to Do When Search Fails: Finding Information by Association
Feldspar lets people find personal information on their computer by specifying chains of associated information as queries, emulating the retrieval process of human associative memory.
 lecture Joan Codina: Workshop on semantic search - Integration of semantic, metadata and image search engines with a engine for patent retrieval
In recent years we have witnessed tremendous interest and substantial economic exploitation of search technologies. On the other hand semantic repositories and reasoning engines have advanced to a state where querying and processing of this knowledge can scale to realistic ...
 lecture Véronique Malaisé: Workshop on semantic search - The Interaction Between Automatic Annotation and Query Expansion: a retrieval experiment on a large cultural heritage archive
In recent years we have witnessed tremendous interest and substantial economic exploitation of search technologies. On the other hand semantic repositories and reasoning engines have advanced to a state where querying and processing of this knowledge can scale to realistic ...
 lecture Hubert Stigler: XML based frameworks in managing and archiving (not only) textual data
Text-technological application scenarios have become a fixed component in many fields of the Humanities, which produce increasing quantities of digital data in the research process. This trend induces the need for strategies for intelligent management and archiving of digital resources ...
 lecture Mounia Lalmas: XML Information Retrieval
Documents usually have content and structure. The content refers to the text of the document, whereas the structure refers to how a document is logically organized. An increasingly common way to encode the structure is through the use of a ...

Panels:


 panel Jochen L. Leidner, Allan Hanbury, Leonoor van der Beek, Khalid Choukri: Cross-Lingual Information access, retrieval, extraction, ... needs, requirements and current state of the art
The main topics that were addressed were the following: The needs of large users of multilingual information archives and their requirements, expectations The current offers for enterprise search and web What is preventing the take off of cross-lingual search applications ...
 panel Rolf Fricke: Implementation of Media Fragment Retrieval for TV Archives & Lessons Learned
 panel Susie Stephens, Rudi Studer, Bo Andersson, Francisco José Farfán, Todor Primov, José Manuel Gómez-Pérez: Next Steps in Intelligent Information Management in the Pharmaceutical and eHealth Domains: Facts and Trends
The pharmaceutical industry is one of the business sectors where the data explosion experienced by the global economy in the last years has had a bigger impact. Motivated by the need of safer and more effective medical treatments, means for ...

Poster:


 poster Pavel Vacha: Content-Based Tile Retrieval System

Tutorials:


 tutorial Natalia Vassilieva: Content Based Image Retrieval (CBIR)
This course will give an overview of the main tasks and methods in the content based image retrieval (CBIR) field. Firstly we will address a question of image retrieval methods application in real life: what are the real-world problems that ...
 tutorial Hsuan-Tien Lin: Cost-sensitive Classification: Algorithms and Advances
Classification is an important problem in machine learning. It can be used in a variety of applications, such as distinguishing apples, oranges, and bananas automatically. Traditionally, the regular classification problem aims at minimizing the number of future mis-prediction errors. Nevertheless, ...
 tutorial Wray Buntine: Document Analysis
We will consider various problems in document analysis (named entity recognition, natural language parsing, information retrieval), and look at various probabilistic graphical models and algorithms for addressing the problem. This will not be an extensive coverage of information extraction or ...
 tutorial Thomas Hofmann: Information Retrieval
 tutorial Hervé Jégou: Large-scale and larger-scale image search
The first part of this tutorial, dedicated to large-scale image retrieval, will first introduce the typical use-cases and the datasets used for evaluation of image search when considering an unsupervised framework. We will present different classes of techniques considering different ...
 tutorial Wray Buntine: Latent Variable Models for Document Analysis
Wray Buntine will consider various problems in document analysis (named entity recognition, natural language parsing, information retrieval), and look at various probabilistic graphical models and algorithms for addressing the problem. This will not be an extensive coverage of information extraction ...
 tutorial Pedro Domingos: Practical Statistical Relational Learning
The tutorial will be composed of three parts: # Foundational areas. The first part will consist of a brief introduction to each of the four foundational areas of SRL: logical inference, inductive logic programming, probabilistic inference, and statistical learning. Obviously, ...
 tutorial Johannes Fürnkranz, Eyke Hullermeier: Preference Learning
The topic of "preferences" has recently attracted considerable attention in artificial intelligence in general and machine learning in particular, where the topic of preference learning has emerged as a new, interdisciplinary research field with close connections to related areas such ...
 tutorial Deepayan Chakrabarti, Deepak Agarwal: Statistical Challenges in Computational Advertising
Many organizations now devote significant fractions of their advertising/outreach budgets to online advertising; ad-networks like Yahoo!, Google, MSN have responded by constructing new kinds of economic models and perform the fundamental task of matching the most relevant ads (selected from ...
 tutorial Marie-Francine Moens: Text Mining, Information and Fact Extraction (TMIFE)
communities (medical informatics, security, blog and news analysis, business information analysis, legal informatics, etc.). ?Still, today it is a somewhat fragmented subfield of human language technologies and information retrieval where the themes of (often forgotten) old-style pattern-based IE and more ...

Advertisment:


 advertisment Raphaël Troncy: Describing Media Assets – media fragment specification and description
Semantic descriptions of non-textual media available on the Web can facilitate retrieval, re-use and presentation of media assets. Semantic Web languages can represent controlled vocabularies and shared annotations of media content on the Web. By identifying concepts to consider, Uniform ...