
0.25
0.5
0.75
1.25
1.5
1.75
2
Robust partially observable Markov decision process
Published on 2015-09-271543 Views
We seek to find the robust policy that maximizes the expected cumulative reward for the worst case when a partially observable Markov decision process (POMDP) has uncertain parameters whose values are