On-line Learning of Wide-domain Spoken Dialogue Systems
published: July 31, 2016, recorded: July 2016, views: 1133
Report a problem or upload filesIf you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
The first part of the talk reviews the general structure of limited domain statistical SDS and then explains how a collection of limited domain systems can be merged using the framework of Bayesian Committee Machines. The problem of reward estimation in on-line learning is then introduced and a solution based on the joint estimation of Gaussian Process based reward prediction and dialogue policy is presented.
Link this pageWould you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !