Corpus design for Malay corpus-based speech synthesis system
Problem statement: Speech corpus is one of the major components in corpus-based synthesis. The quality and coverage in speech corpus will affect the quality of synthesis speech sound. Approach: This study proposes a corpus design for Malay corpus-based speech synthesis system. This includes the stud...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Published: |
Science Publications
2009
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/13281/ http://dx.doi.org/10.3844/ajas.2009.696.702 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utm.13281 |
---|---|
record_format |
eprints |
spelling |
my.utm.132812011-07-29T09:40:16Z http://eprints.utm.my/id/eprint/13281/ Corpus design for Malay corpus-based speech synthesis system Tan, Tian-Swee Sh-Hussain, Sh-Hussain R Medicine (General) Problem statement: Speech corpus is one of the major components in corpus-based synthesis. The quality and coverage in speech corpus will affect the quality of synthesis speech sound. Approach: This study proposes a corpus design for Malay corpus-based speech synthesis system. This includes the study of design criteria in corpus-based speech synthesis, Malay corpus based database design and the concatenation engine in Malay corpus-based synthesis system. A set of 10 millions digital text corpuses for Malay language has been collected from Malay internet news. This text corpus had been analyzed using word frequency count to find out all high frequency words to be used for designing the sentences for speech corpus. Results: Altogether 381 sentences for speech corpus had been designed using 70% of high frequency words from 10 million text corpus. It consists of 16826 phoneme units and the total storage size is 37.6Mb. All the phone units are phonetically transcribed to preserve the phonetic context of its origin that will be used for phonetic context unit. This speech corpus had been labeled at phoneme level and used for variable length continuous phoneme based concatenation. Speech corpus is one of the major components in corpus-based synthesis. The quality and coverage in speech corpus will affect the quality of synthesized speech sound. Conclusion/Recommendation: This study has proposed a platform for designing speech corpus especially for Malay Text to Speech which can be further enhanced to support more coverage and higher naturalness of synthetic speech. Science Publications 2009 Article PeerReviewed Tan, Tian-Swee and Sh-Hussain, Sh-Hussain (2009) Corpus design for Malay corpus-based speech synthesis system. American Journal of Applied Sciences, 6 (4). pp. 696-702. ISSN 15469239 http://dx.doi.org/10.3844/ajas.2009.696.702 doi:10.3844/ajas.2009.696.702 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
R Medicine (General) |
spellingShingle |
R Medicine (General) Tan, Tian-Swee Sh-Hussain, Sh-Hussain Corpus design for Malay corpus-based speech synthesis system |
description |
Problem statement: Speech corpus is one of the major components in corpus-based synthesis. The quality and coverage in speech corpus will affect the quality of synthesis speech sound. Approach: This study proposes a corpus design for Malay corpus-based speech synthesis system. This includes the study of design criteria in corpus-based speech synthesis, Malay corpus based database design and the concatenation engine in Malay corpus-based synthesis system. A set of 10 millions digital text corpuses for Malay language has been collected from Malay internet news. This text corpus had been analyzed using word frequency count to find out all high frequency words to be used for designing the sentences for speech corpus. Results: Altogether 381 sentences for speech corpus had been designed using 70% of high frequency words from 10 million text corpus. It consists of 16826 phoneme units and the total storage size is 37.6Mb. All the phone units are phonetically transcribed to preserve the phonetic context of its origin that will be used for phonetic context unit. This speech corpus had been labeled at phoneme level and used for variable length continuous phoneme based concatenation. Speech corpus is one of the major components in corpus-based synthesis. The quality and coverage in speech corpus will affect the quality of synthesized speech sound. Conclusion/Recommendation: This study has proposed a platform for designing speech corpus especially for Malay Text to Speech which can be further enhanced to support more coverage and higher naturalness of synthetic speech.
|
format |
Article |
author |
Tan, Tian-Swee Sh-Hussain, Sh-Hussain |
author_facet |
Tan, Tian-Swee Sh-Hussain, Sh-Hussain |
author_sort |
Tan, Tian-Swee |
title |
Corpus design for Malay corpus-based speech synthesis system |
title_short |
Corpus design for Malay corpus-based speech synthesis system |
title_full |
Corpus design for Malay corpus-based speech synthesis system |
title_fullStr |
Corpus design for Malay corpus-based speech synthesis system |
title_full_unstemmed |
Corpus design for Malay corpus-based speech synthesis system |
title_sort |
corpus design for malay corpus-based speech synthesis system |
publisher |
Science Publications |
publishDate |
2009 |
url |
http://eprints.utm.my/id/eprint/13281/ http://dx.doi.org/10.3844/ajas.2009.696.702 |
_version_ |
1643646155412733952 |
score |
13.211869 |