Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
This study describes the addition of an affective component to the Malays TTS system in order to produce a system that is more expressive in nature. It introduces a new method for generating expressive speech by embedding an ‘emotion layer’ called eXpressive Text Reader Automation Layer, abbreviated...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Published: |
2007
|
Subjects: | |
Online Access: | http://studentsrepo.um.edu.my/11579/1/Syaheerah.pdf http://studentsrepo.um.edu.my/11579/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.um.stud.11579 |
---|---|
record_format |
eprints |
spelling |
my.um.stud.115792020-08-25T19:00:29Z Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi Syaheerah , Lebai Lutfi QA75 Electronic computers. Computer science QA76 Computer software This study describes the addition of an affective component to the Malays TTS system in order to produce a system that is more expressive in nature. It introduces a new method for generating expressive speech by embedding an ‘emotion layer’ called eXpressive Text Reader Automation Layer, abbreviated as eXTRA. The emotion generation method is template-driven. The templates are diphone-based and each template carries unique affective data. The two types of emotions created for the system are anger and sadness. To ensure naturalness, the input sentence from user is matched with the template that consist of a sentence with the same syllable structure of the input sentence, allowing the emotion parameters from the template to be applied to the input at the level of phonemes. This syllable-sensitive matching process requires analysis of each syllable's consonant or vowel pattern. The module is an independent component that can serve as an extension to any Malay TTS system that uses Multiband Resynthesis Overlap Add (MBROLA) engine for diphone concatenation. In a pilot project, the prototype is used with Fasih, the first Malay Text-to-Speech system developed by MIMOS Berhad, which can read unrestricted Malay text. eXTRA is evaluated through perception tests. The results show more than sixty percent of recognition rate, which confirm the satisfactory performance of this approach. .The solution should provide improvement to output of Malay TTS system. 2007 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/11579/1/Syaheerah.pdf Syaheerah , Lebai Lutfi (2007) Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi. Masters thesis, University of Malaya. http://studentsrepo.um.edu.my/11579/ |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Student Repository |
url_provider |
http://studentsrepo.um.edu.my/ |
topic |
QA75 Electronic computers. Computer science QA76 Computer software |
spellingShingle |
QA75 Electronic computers. Computer science QA76 Computer software Syaheerah , Lebai Lutfi Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi |
description |
This study describes the addition of an affective component to the Malays TTS system in order to produce a system that is more expressive in nature. It introduces a new method for generating expressive speech by embedding an ‘emotion layer’ called eXpressive Text Reader Automation Layer, abbreviated as eXTRA. The emotion generation method is template-driven. The templates are diphone-based and each template carries unique affective data. The two types of emotions created for the system are anger and sadness. To ensure naturalness, the input sentence from user is matched with the template that consist of a sentence with the same syllable structure of the input sentence, allowing the emotion parameters from the template to be applied to the input at the level of phonemes. This syllable-sensitive matching process requires analysis of each syllable's consonant or vowel pattern. The module is an independent component that can serve as an extension to any Malay TTS system that uses Multiband Resynthesis Overlap Add (MBROLA) engine for diphone concatenation. In a pilot project, the prototype is used with Fasih, the first Malay Text-to-Speech system developed by MIMOS Berhad, which can read unrestricted Malay text. eXTRA is evaluated through perception tests. The results show more than sixty percent of recognition rate, which confirm the satisfactory performance of this approach. .The solution should provide improvement to output of Malay TTS system.
|
format |
Thesis |
author |
Syaheerah , Lebai Lutfi |
author_facet |
Syaheerah , Lebai Lutfi |
author_sort |
Syaheerah , Lebai Lutfi |
title |
Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
|
title_short |
Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
|
title_full |
Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
|
title_fullStr |
Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
|
title_full_unstemmed |
Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
|
title_sort |
adding emotions to synthesized malay speech using diphone-based templates / syaheerah lebai lutfi |
publishDate |
2007 |
url |
http://studentsrepo.um.edu.my/11579/1/Syaheerah.pdf http://studentsrepo.um.edu.my/11579/ |
_version_ |
1738506501550505984 |
score |
13.160551 |