video thumbnail
Pause
Mute
Subtitles
Playback speed
0.25
0.5
0.75
1
1.25
1.5
1.75
2
Full screen

Reinforcement Learning with Limited Reinforcement: Using Bayes Risk for Active Learning in POMDPs

Published on 2008-08-065775 Views

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains because they optimally trade between actions that increase an agent's knowledge and actions that increase an

Related categories

Presentation