Piecewise-Stationary Bandit Problems with Side Information
Published on Aug 26, 20093150 Views
We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distributions are unknown and may change at unknown