
0.25
0.5
0.75
1.25
1.5
1.75
2
Universal Value Function Approximators
Published on 2015-12-052078 Views
Value functions are a core component of reinforcement learning. The main idea is to to construct a single function approximator V(s; theta) that estimates the long-term reward from any state s, using