Learning from Logged Implicit Exploration Data
Published on Mar 25, 20112806 Views
We provide a sound and consistent foundation for the use of nonrandom exploration data in "contextual bandit" or "partially labeled" settings where only the value of a chosen action is learned. The pr