TrueSkill and AdPredictor: Large Scale Machine Learning in the Wild
published: Sept. 20, 2010, recorded: September 2010, views: 500
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Probabilistic Graphical Models play a crucial role in Microsoft's online services. In this talk, I will describe two powerful applications of machine learning in practice. TrueSkill is Xbox Live's Ranking and Matchmaking system and ensures that gamers online have balanced and exciting matches with equally skilled opponents. AdPredictor is the system that estimates click-through rates (CTR) for ad selection and pricing within Microsoft's search engine Bing. The two systems have in common that they are based on factor graph models and approximate Bayesian inference. They operate at a very large scale involving millions of gamers and billions of ad impressions, respectively. However, in this talk, I will put particular emphasis on those aspects of these applications that are not part of the generic machine learning setting: a) The difficulties that arise because these are closed-loop systems in which the predictions determine the future composition of the training sample. b) The consequences of the fact that these systems make decisions that have an impact on more or less rational agents (advertisers, users, gamers) with the ability to influence the training sample. Time permitting, I will show the two systems in action. This is based on joint work with Ralf Herbrich, Thomas Borchert, Tom Minka, and Joaquin Quińonero Candela.
Download slides: wapa2010_graepel_tals_01.pdf (3.3 MB)
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !