Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition
Due to the mismatch between training and operating conditions, speech recognition systems often exhibit dramatic performance degradation when they are practically used in real-world environments. Various techniques based on noise resistance, speech enhancement and model compensation approaches have...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference or Workshop Item |
Published: |
2006
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/25199/ http://comp.utm.my/pars/files/2013/04/Model-Compensation-Using-Parallel-Model-Combination-and-Adaptation-Technique-for-Improved-Accuracy-in-Noisy-Speech-Recognition.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.25199 |
---|---|
record_format |
eprints |
spelling |
my.utm.251992017-02-07T07:37:34Z http://eprints.utm.my/id/eprint/25199/ Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition Abd. Manan, Ahmad Tan, Sang Sang QA75 Electronic computers. Computer science Due to the mismatch between training and operating conditions, speech recognition systems often exhibit dramatic performance degradation when they are practically used in real-world environments. Various techniques based on noise resistance, speech enhancement and model compensation approaches have been widely used to improve the performance of speech recognition in noise. Parallel Model Combination (PMC) is one of the most popular techniques and has been proved powerful in compensating recognition models, so that they reflect speech in noisy acoustic environments. However, studies have shown that some assumptions and approximations made in the PMC, primarily in the domain transformation and parameter combination processes are not accurate in certain situations, thus restricting the achievement of better performance. This research suggests using Maximum Likelihood Spectral Transformation (MLST) as the adaptation technique to further improve the performance of PMC. MLST is a transformation-based technique. It has some advantages over other adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP). MLST requires only a small amount of adaptation data and the adaptation process is computationally inexpensive. The proposed method is denoted as Adaptive Parallel Model Combination (APMC). 2006 Conference or Workshop Item PeerReviewed Abd. Manan, Ahmad and Tan, Sang Sang (2006) Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition. In: Proc. Postgraduate Annual Research Seminar 2006 (PARS 2006) , 2006, UTM. http://comp.utm.my/pars/files/2013/04/Model-Compensation-Using-Parallel-Model-Combination-and-Adaptation-Technique-for-Improved-Accuracy-in-Noisy-Speech-Recognition.pdf |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Abd. Manan, Ahmad Tan, Sang Sang Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition |
description |
Due to the mismatch between training and operating conditions, speech recognition systems often exhibit dramatic performance degradation when they are practically used in real-world environments. Various techniques based on noise resistance, speech enhancement and model compensation approaches have been widely used to improve the performance of speech recognition in noise. Parallel Model Combination (PMC) is one of the most popular techniques and has been proved powerful in compensating recognition models, so that they reflect speech in noisy acoustic environments. However, studies have shown that some assumptions and approximations made in the PMC, primarily in the domain transformation and parameter combination processes are not accurate in certain situations, thus restricting the achievement of better performance. This research suggests using Maximum Likelihood Spectral Transformation (MLST) as the adaptation technique to further improve the performance of PMC. MLST is a transformation-based technique. It has some advantages over other adaptation techniques like Maximum Likelihood Linear Regression (MLLR) and Maximum A Posterior (MAP). MLST requires only a small amount of adaptation data and the adaptation process is computationally inexpensive. The proposed method is denoted as Adaptive Parallel Model Combination (APMC). |
format |
Conference or Workshop Item |
author |
Abd. Manan, Ahmad Tan, Sang Sang |
author_facet |
Abd. Manan, Ahmad Tan, Sang Sang |
author_sort |
Abd. Manan, Ahmad |
title |
Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition |
title_short |
Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition |
title_full |
Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition |
title_fullStr |
Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition |
title_full_unstemmed |
Model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition |
title_sort |
model compensation using parallel model combination and adaptation technique for improved accuracy in noisy speech recognition |
publishDate |
2006 |
url |
http://eprints.utm.my/id/eprint/25199/ http://comp.utm.my/pars/files/2013/04/Model-Compensation-Using-Parallel-Model-Combination-and-Adaptation-Technique-for-Improved-Accuracy-in-Noisy-Speech-Recognition.pdf |
_version_ |
1643647573390524416 |
score |
13.209306 |