en-es
en
0.25
0.5
0.75
1.25
1.5
1.75
2
Online Learning in Non-Stationary Markov Decision Processes
Published on Aug 06, 20133041 Views
We consider online learning in Markov decision processes with adversarial reward functions. Depending on the information available to the decision maker, we analyze two scenarios: in one setup the
Related categories
Chapter list
Online learning in non-stationry Markov decision processes00:00
The learning problem01:27
Results03:54