Universal Value Function Approximators
Published on Dec 05, 20152071 Views
Value functions are a core component of reinforcement learning. The main idea is to to construct a single function approximator V(s; theta) that estimates the long-term reward from any state s, using