Online Learning in Non-Stationary Markov Decision Processes

Published on 2013-08-063054 Views

Gergely Neu

We consider online learning in Markov decision processes with adversarial reward functions. Depending on the information available to the decision maker, we analyze two scenarios: in one setup the

Knowledge 4 All Foundation Video Journal Volume 4

Related categories

On-line Learning

Presentation

Online learning in non-stationry Markov decision processes00:00

The learning problem01:27

Results03:54