Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs
Published on Aug 06, 20085770 Views
Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains because they optimally trade between actions that increase an agent's knowledge and actions that increase an