Online Learning in Non-Stationary Markov Decision Processes thumbnail
Pause
Mute
Subtitles
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Online Learning in Non-Stationary Markov Decision Processes

Published on Aug 06, 20133040 Views

We consider online learning in Markov decision processes with adversarial reward functions. Depending on the information available to the decision maker, we analyze two scenarios: in one setup the

Related categories

Chapter list

Online learning in non-stationry Markov decision processes00:00
The learning problem01:27
Results03:54