
en
0.25
0.5
0.75
1.25
1.5
1.75
2
Piecewise-Stationary Bandit Problems with Side Information
Published on Feb 4, 20253154 Views
We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distributions are unknown and may change at unknown