Online Learning in Non-Stationary Markov Decision Processes thumbnail
slide-image
Pause
Mute
Subtitles not available
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Online Learning in Non-Stationary Markov Decision Processes

Published on Aug 06, 20133038 Views

We consider online learning in Markov decision processes with adversarial reward functions. Depending on the information available to the decision maker, we analyze two scenarios: in one setup the

Related categories

Chapter list

Online learning in non-stationry Markov decision processes00:00
The learning problem01:27
Results03:54