
Interactively Optimizing Information Systems as a Dueling Bandits Problem
Published on 2008-12-204208 Views
We present an online learning framework tailored towards real-time learning from observed user behavior in search engines and other information access systems. In particular, we only require pairwise