Bandit Algorithms for Online Linear Optimization
Published on Aug 26, 20095670 Views
In the online linear optimization problem a forecaster chooses, at each time instance, a vector x from a certain given subset S of the D-dimensional Euclidean space and suffers a time-dependent loss t