Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition

Due to the mismatch between training and operating conditions, speech recognition systems often exhibit dramatic performance degradation when they are practically used in real-world environments. Various techniques based on noise resistance, speech enhancement and model compensation approaches have...

Full description

Saved in:
Bibliographic Details
Main Authors: Abd. Manan, Ahmad, Tan, Sang Sang
Format: Conference or Workshop Item
Published: 2006
Subjects:
Online Access:http://eprints.utm.my/id/eprint/25199/
http://comp.utm.my/pars/files/2013/04/Model-Compensation-Using-Parallel-Model-Combination-and-Adaptation-Technique-for-Improved-Accuracy-in-Noisy-Speech-Recognition.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Due to the mismatch between training and operating conditions, speech recognition systems often exhibit dramatic performance degradation when they are practically used in real-world environments. Various techniques based on noise resistance, speech enhancement and model compensation approaches have been widely used to improve the performance of speech recognition in noise. Parallel Model Combination (PMC) is one of the most popular techniques and has been proved powerful in compensating recognition models, so that they reflect speech in noisy acoustic environments. However, studies have shown that some assumptions and approximations made in the PMC, primarily in the domain transformation and parameter combination processes are not accurate in certain situations, thus restricting the achievement of better performance. This research suggests using Maximum Likelihood Spectral Transformation (MLST) as the adaptation technique to further improve the performance of PMC. MLST is a transformation-based technique. It has some advantages over other adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP). MLST requires only a small amount of adaptation data and the adaptation process is computationally inexpensive. The proposed method is denoted as Adaptive Parallel Model Combination (APMC).