
en
0.25
0.5
0.75
1.25
1.5
1.75
2
Learning from Logged Implicit Exploration Data
Published on Feb 4, 20252810 Views
We provide a sound and consistent foundation for the use of nonrandom exploration data in "contextual bandit" or "partially labeled" settings where only the value of a chosen action is learned. The pr