Online Reinforcement Learning from Concurrent Customer Interaction Sequences

Published on 2013-05-284192 Views

David Silver

This talk explores applications in which a company interacts with many customers. The company has an objective function, such as maximising revenue, customer satisfaction, or customer loyalty, which

LSOLDM 2012 - Cumberland Lodge

Related categories