Piecewise-Stationary Bandit Problems with Side Observations thumbnail
Pause
Mute
Subtitles
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Piecewise-Stationary Bandit Problems with Side Observations

Published on Aug 26, 20092822 Views

We consider a sequential decision problem where the rewards are generated by a piecewise-stationary distribution. However, the different reward distributions are unknown and may change at unknown inst

Related categories