video thumbnail
Pause
Mute
Subtitles
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Online Learning in Non-Stationary Markov Decision Processes

Published on Feb 4, 20253045 Views

We consider online learning in Markov decision processes with adversarial reward functions. Depending on the information available to the decision maker, we analyze two scenarios: in one setup the

Related categories

Presentation

Online learning in non-stationry Markov decision processes00:00
The learning problem01:27
Results03:54