Unsupervised Estimation for Noisy-Channel Models
Description
Shannon’s Noisy-Channel model, which describes how a corrupted message might be reconstructed, has been the corner stone for much work in statistical language and speech processing. The model factors into two components: a language model to characterize the original message and a channel model to describe the channel’s corruptive process. The standard approach for estimating the parameters of the channel model is unsupervised Maximum-Likelihood of the observation data, usually approximated using the Expectation-Maximization (EM) algorithm. In this paper we show that it is better to maximize the joint likelihood of the data at both ends of the noisy-channel. We derive a corresponding bi-directional EM algorithm and show that it gives better performance than standard EM on two tasks: (1) translation using a probabilistic lexicon and (2) adaptation of a part-of-speech tagger between related languages.
Categories
Top: Computer Science: Machine Learning: Statistical LearningTop: Computer Science: Machine Learning: Unsupervised learning
| Slides | |
| 0:00 | Unsupervised Estimation for Noisy-Channel Models |
| 0:15 | The Noisy-Channel Approach |
| 1:53 | Noisy-Channel Parameters |
| 4:16 | (Uni-)EM Algorithm |
| 5:54 | Problem: Parameter Estimation |
| 8:05 | Bi-Directional EM (Bi-EM) |
| 10:43 | Empirical Results 1: Translation - 1 |
| 12:36 | Empirical Results 1: Translation - 2 |
| 13:34 | Empirical Results 1: Translation (Adding Target Data) |
| 14:39 | Empirical Results 2: POS Tagger Adaptation |
| 17:03 | - Questions |
Lecture rating
| People found this lecture: | ||
| Worth seeing | ||
| because it is: | ||
| Valuable and informative | ||
| Well presented | ||
| Easily understandable | ||
| Acceptably recorded | ||
| You need to login to cast your vote. | ||
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Related content
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !



