Piecewise-Stationary Bandit Problems with Side Observations
Published on Aug 26, 20092821 Views
We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distributions are unknown and may change at unknown inst