Learning Consensus Opinion: Mining Data from a Labeling Game
author:David Maxwell Chickering, Microsoft Live Labs, Microsoft
author:Anton Mityagin, Microsoft Live Labs, Microsoft
published: May 20, 2009, recorded: April 2009, views: 52
Related content
27:08
121 views - Thore Graepel, David Stern, Ralf Herbrich, 2009
41:17
221 views - Tim Berners Lee, 2009
22:40
76 views - Sihong Xie, Wei Fan, Jing Peng, Olivier Verscheure, Jiangtao Ren, 2009
01:09:02
264 views - Bruce Croft, 2008
24:21
706 views - Bing Liu, 2008
01:02:31
210 views - Lillian Lee, 2009
28:29
97 views - Danushka Bollegala, Yutaka Matsuo, Mitsuru Ishizuka, 2009
25:48
66 views - Surajit Chaudhuri, Venkatesh Ganti, Dong Xin, 2009
50:38
103 views - Ricardo Baeza-Yates, 2009
01:43:02
16728 views - Michael Berthold, 2005
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Description
In this paper, we consider the challenge of how to identify the consensus opinion of a set of users as to how the results for a query should be ranked. Once consensus rankings are identified for a set of queries, these rankings can serve for both evaluation and training of retrieval and learning systems. We present a novel approach to collecting user preferences over image-search results: we use a collaborative game in which players are rewarded for agreeing on which image result is best for a query. Our approach is distinct from other labeling games because we are able to elicit directly the preferences of interest with respect to image queries extracted from query logs. As a source of relevance judgments, this data provides a useful complement to click data. Furthermore, it is free of positional biases and does not carry the risk of frustrating users with non-relevant results associated with proposed mechanisms for debiasing clicks. We describe data collected over 35 days from a deployed version of this game that amounts to about 19 million expressed preferences between pairs. Finally, we present several approaches to modeling this data in order to extract the consensus rankings from the preferences and better sort the search results for targeted queries.
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !




Write your own review or comment: