Minimizing word error rate in a dyslexic reading-oriented ASR engine using phoneme refinement and alternative pronunciation

Little attention has been given to detecting miscues in the text space read by dyslexic children over an automatic speech recognition (ASR) engine. In an ASR system, the miscues are represented by word error rate (WER) and miscue detection rate (MDR). At all time, WER must be kept low, and MDR high...

Full description

Saved in:
Bibliographic Details
Main Authors: Husni, Husniza, Jamaludin, Zulikha
Format: Book Section
Language:English
Published: IATED publication 2010
Subjects:
Online Access:http://repo.uum.edu.my/1339/1/Abrack_view_Zulaikha%2C_J..pdf
http://repo.uum.edu.my/1339/
http://www.iated.org
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Little attention has been given to detecting miscues in the text space read by dyslexic children over an automatic speech recognition (ASR) engine. In an ASR system, the miscues are represented by word error rate (WER) and miscue detection rate (MDR). At all time, WER must be kept low, and MDR high so as to achieve better recognition. This paper focus on minimizing word error rate by formulating a better model for perspicuous representation of input data. Such representation takes into account phoneme refinement and alternative pronunciation for a particular Bahasa Melayu (BM) speech data uttered by dyslexic children. Based on literature, a few other optimal models of input data and their recognition results were compared. It is found that phoneme refinement and alternative pronunciation produced better recognition results as evidenced in the performance metrics --lower WER and higher MDR-- which are 25% and 80.77% respectively.