Analysis of Malay Speech Recognition for Different Speaker Origins

This paper explores speech recognition performance for Malay language with multi accents from speakers of different origins or ethnicities. Accented speech imposes accuracy problem in automatic speech recognition systems. This frequently occurs to non-native speakers of a language due to insufficien...

Full description

Saved in:
Bibliographic Details
Main Authors: Juan, Sarah Samson, Besacier, Laurent, Tan, Tien-Ping
Format: Conference or Workshop Item
Published: IEEE 2012
Subjects:
Online Access:http://ir.unimas.my/id/eprint/8877/
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6473738
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.unimas.ir.8877
record_format eprints
spelling my.unimas.ir.88772015-10-16T01:19:03Z http://ir.unimas.my/id/eprint/8877/ Analysis of Malay Speech Recognition for Different Speaker Origins Juan, Sarah Samson Besacier, Laurent Tan, Tien-Ping T Technology (General) This paper explores speech recognition performance for Malay language with multi accents from speakers of different origins or ethnicities. Accented speech imposes accuracy problem in automatic speech recognition systems. This frequently occurs to non-native speakers of a language due to insufficiency of the non-natives data in the recognizers. In this study, we investigate the mentioned problem by building a Malay model in our recognizer and test its performance for speakers of various ethnicities. Our Malay corpora consist of read speeches and texts that are collected from local newspapers in Malaysia. Speakers who contributed the speeches are of different ethnic backgrounds. We employ context dependent models by applying linear discriminant analysis for our acoustic model and a trigram based language model. Our experiments show improved results when linear discriminant analysis technique was employed in our model while our recognizer performed worst for speakers with accent that are not available in the training data. IEEE 2012 Conference or Workshop Item PeerReviewed Juan, Sarah Samson and Besacier, Laurent and Tan, Tien-Ping (2012) Analysis of Malay Speech Recognition for Different Speaker Origins. In: Proceedings of International Conference on Asian Language Processing (IALP). http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6473738
institution Universiti Malaysia Sarawak
building Centre for Academic Information Services (CAIS)
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sarawak
content_source UNIMAS Institutional Repository
url_provider http://ir.unimas.my/
topic T Technology (General)
spellingShingle T Technology (General)
Juan, Sarah Samson
Besacier, Laurent
Tan, Tien-Ping
Analysis of Malay Speech Recognition for Different Speaker Origins
description This paper explores speech recognition performance for Malay language with multi accents from speakers of different origins or ethnicities. Accented speech imposes accuracy problem in automatic speech recognition systems. This frequently occurs to non-native speakers of a language due to insufficiency of the non-natives data in the recognizers. In this study, we investigate the mentioned problem by building a Malay model in our recognizer and test its performance for speakers of various ethnicities. Our Malay corpora consist of read speeches and texts that are collected from local newspapers in Malaysia. Speakers who contributed the speeches are of different ethnic backgrounds. We employ context dependent models by applying linear discriminant analysis for our acoustic model and a trigram based language model. Our experiments show improved results when linear discriminant analysis technique was employed in our model while our recognizer performed worst for speakers with accent that are not available in the training data.
format Conference or Workshop Item
author Juan, Sarah Samson
Besacier, Laurent
Tan, Tien-Ping
author_facet Juan, Sarah Samson
Besacier, Laurent
Tan, Tien-Ping
author_sort Juan, Sarah Samson
title Analysis of Malay Speech Recognition for Different Speaker Origins
title_short Analysis of Malay Speech Recognition for Different Speaker Origins
title_full Analysis of Malay Speech Recognition for Different Speaker Origins
title_fullStr Analysis of Malay Speech Recognition for Different Speaker Origins
title_full_unstemmed Analysis of Malay Speech Recognition for Different Speaker Origins
title_sort analysis of malay speech recognition for different speaker origins
publisher IEEE
publishDate 2012
url http://ir.unimas.my/id/eprint/8877/
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6473738
_version_ 1644510620987424768
score 13.211869