Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition

Due to the mismatch between training and operating conditions, speech recognition systems often exhibit dramatic performance degradation when they are practically used in real-world environments. Various techniques based on noise resistance, speech enhancement and model compensation approaches have...

Full description

Saved in:
Bibliographic Details
Main Authors: Abd. Manan, Ahmad, Tan, Sang Sang
Format: Conference or Workshop Item
Published: 2006
Subjects:
Online Access:http://eprints.utm.my/id/eprint/25199/
http://comp.utm.my/pars/files/2013/04/Model-Compensation-Using-Parallel-Model-Combination-and-Adaptation-Technique-for-Improved-Accuracy-in-Noisy-Speech-Recognition.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.25199
record_format eprints
spelling my.utm.251992017-02-07T07:37:34Z http://eprints.utm.my/id/eprint/25199/ Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition Abd. Manan, Ahmad Tan, Sang Sang QA75 Electronic computers. Computer science Due to the mismatch between training and operating conditions, speech recognition systems often exhibit dramatic performance degradation when they are practically used in real-world environments. Various techniques based on noise resistance, speech enhancement and model compensation approaches have been widely used to improve the performance of speech recognition in noise. Parallel Model Combination (PMC) is one of the most popular techniques and has been proved powerful in compensating recognition models, so that they reflect speech in noisy acoustic environments. However, studies have shown that some assumptions and approximations made in the PMC, primarily in the domain transformation and parameter combination processes are not accurate in certain situations, thus restricting the achievement of better performance. This research suggests using Maximum Likelihood Spectral Transformation (MLST) as the adaptation technique to further improve the performance of PMC. MLST is a transformation-based technique. It has some advantages over other adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP). MLST requires only a small amount of adaptation data and the adaptation process is computationally inexpensive. The proposed method is denoted as Adaptive Parallel Model Combination (APMC). 2006 Conference or Workshop Item PeerReviewed Abd. Manan, Ahmad and Tan, Sang Sang (2006) Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition. In: Proc. Postgraduate Annual Research Seminar 2006 (PARS 2006) , 2006, UTM. http://comp.utm.my/pars/files/2013/04/Model-Compensation-Using-Parallel-Model-Combination-and-Adaptation-Technique-for-Improved-Accuracy-in-Noisy-Speech-Recognition.pdf
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Abd. Manan, Ahmad
Tan, Sang Sang
Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition
description Due to the mismatch between training and operating conditions, speech recognition systems often exhibit dramatic performance degradation when they are practically used in real-world environments. Various techniques based on noise resistance, speech enhancement and model compensation approaches have been widely used to improve the performance of speech recognition in noise. Parallel Model Combination (PMC) is one of the most popular techniques and has been proved powerful in compensating recognition models, so that they reflect speech in noisy acoustic environments. However, studies have shown that some assumptions and approximations made in the PMC, primarily in the domain transformation and parameter combination processes are not accurate in certain situations, thus restricting the achievement of better performance. This research suggests using Maximum Likelihood Spectral Transformation (MLST) as the adaptation technique to further improve the performance of PMC. MLST is a transformation-based technique. It has some advantages over other adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP). MLST requires only a small amount of adaptation data and the adaptation process is computationally inexpensive. The proposed method is denoted as Adaptive Parallel Model Combination (APMC).
format Conference or Workshop Item
author Abd. Manan, Ahmad
Tan, Sang Sang
author_facet Abd. Manan, Ahmad
Tan, Sang Sang
author_sort Abd. Manan, Ahmad
title Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition
title_short Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition
title_full Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition
title_fullStr Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition
title_full_unstemmed Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition
title_sort model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition
publishDate 2006
url http://eprints.utm.my/id/eprint/25199/
http://comp.utm.my/pars/files/2013/04/Model-Compensation-Using-Parallel-Model-Combination-and-Adaptation-Technique-for-Improved-Accuracy-in-Noisy-Speech-Recognition.pdf
_version_ 1643647573390524416
score 13.209306