Efficient Learning in Large-Scale Combinatorial Semi-Bandits
Published on Dec 05, 20151532 Views
A stochastic combinatorial semi-bandit is an online learning problem where at each step a learning agent chooses a subset of ground items subject to combinatorial constraints, and then observes stocha