Trading Regret Rate for Computational Efficiency in Online Learning with Limited Feedback
Published on Aug 26, 20093125 Views
We study low regret algorithms for online learning with limited feedback, where there is an additional constraint on the computational power of the learner. Focusing on multi-armed bandit with side in