Analysis of Malay Speech Recognition for Different Speaker Origins

This paper explores speech recognition performance for Malay language with multi accents from speakers of different origins or ethnicities. Accented speech imposes accuracy problem in automatic speech recognition systems. This frequently occurs to non-native speakers of a language due to insufficien...

Full description

Saved in:
Bibliographic Details
Main Authors: Juan, Sarah Samson, Besacier, Laurent, Tan, Tien-Ping
Format: Conference or Workshop Item
Published: IEEE 2012
Subjects:
Online Access:http://ir.unimas.my/id/eprint/8877/
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6473738
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper explores speech recognition performance for Malay language with multi accents from speakers of different origins or ethnicities. Accented speech imposes accuracy problem in automatic speech recognition systems. This frequently occurs to non-native speakers of a language due to insufficiency of the non-natives data in the recognizers. In this study, we investigate the mentioned problem by building a Malay model in our recognizer and test its performance for speakers of various ethnicities. Our Malay corpora consist of read speeches and texts that are collected from local newspapers in Malaysia. Speakers who contributed the speeches are of different ethnic backgrounds. We employ context dependent models by applying linear discriminant analysis for our acoustic model and a trigram based language model. Our experiments show improved results when linear discriminant analysis technique was employed in our model while our recognizer performed worst for speakers with accent that are not available in the training data.