
en-de
en-es
en-fr
en-pt
en-sl
en
en-zh
0.25
0.5
0.75
1.25
1.5
1.75
2
Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs
Published on 2008-08-065775 Views
Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains because they optimally trade between actions that increase an agent's knowledge and actions that increase an