Tandem Connectionist Feature Extraction for Conversational Speech Recognition

author:Barry Chen, American Electronics Association
published: Feb. 25, 2007,   recorded: June 2004,   views: 79
Categories
You might be experiencing some problems with Your Video player.

Related content

Visitors who watched this lecture also watched...
22:31
Tandem Connectionist Feature Extraction for Conversational Speech Recognition

124 views - Qifeng Zhu, 2004
20:21
IBM Speech Activity System

143 views - Etienne Marcheret, 2006
01:00:47
Gaussian Process Basics

12618 views - David MacKay, 2006
27:50
Facial expression recognition and emotion recognition from speech

1121 views - Dragos Datcu, 2006
34:42
Speech-to-Text Evaluation System

142 views - Andreas Stolcke, 1970
16:08
Hierarchical Multi-Stream Posterior Based Speech Recognition System

73 views - Hamed Ketabdar, 1970
03:54:31
Support Vector Machines

12754 views - Chih-Jen Lin, 2006
04:38
Interview with Fei-Fei Li

3414 views - Davor Orlič, Fei-Fei Li, 2006
30:13
Mixtures of Neural Nets

241 views - Edward Snelson, 2005
33:39
Audio-Visual Speech Analysis & Recognition

298 views - Nassos Katsamanis, 2008

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.

Description

Multi-Layer Perceptrons (MLPs) can be used in automatic speech recognition in many ways. A particular application of this tool over the last few years has been the Tandem approach, as described by Hermansky et al in a number of publications. Here we discuss the characteristics of the MLP-based features used for the Tandem approach, and conclude with a report on their application to conversational speech recognition. The paper shows that MLP transformations yield variables that have regular distributions, which can be further modified by using logarithm to make the distribution easier to model by a Gaussian-HMM. Two or more vectors of these features can easily be combined without increasing the feature dimension. We also report recognition results that show that MLP features can significantly improve recognition performance for the NIST 2001 Hub-5 evaluation set with models trained on the Switchboard Corpus, even for complex systems incorporating MMIE training and other enhancements.

Link this page  

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Write your own review or comment:

make sure you have javascript enabled or clear this field: