Apprenticeship Learning Using Linear Programming
Published on Aug 12, 20084316 Views
In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in that the MDP's true rewa