Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi

This study describes the addition of an affective component to the Malays TTS system in order to produce a system that is more expressive in nature. It introduces a new method for generating expressive speech by embedding an ‘emotion layer’ called eXpressive Text Reader Automation Layer, abbreviated...

Full description

Saved in:
Bibliographic Details
Main Author: Syaheerah , Lebai Lutfi
Format: Thesis
Published: 2007
Subjects:
Online Access:http://studentsrepo.um.edu.my/11579/1/Syaheerah.pdf
http://studentsrepo.um.edu.my/11579/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.stud.11579
record_format eprints
spelling my.um.stud.115792020-08-25T19:00:29Z Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi Syaheerah , Lebai Lutfi QA75 Electronic computers. Computer science QA76 Computer software This study describes the addition of an affective component to the Malays TTS system in order to produce a system that is more expressive in nature. It introduces a new method for generating expressive speech by embedding an ‘emotion layer’ called eXpressive Text Reader Automation Layer, abbreviated as eXTRA. The emotion generation method is template-driven. The templates are diphone-based and each template carries unique affective data. The two types of emotions created for the system are anger and sadness. To ensure naturalness, the input sentence from user is matched with the template that consist of a sentence with the same syllable structure of the input sentence, allowing the emotion parameters from the template to be applied to the input at the level of phonemes. This syllable-sensitive matching process requires analysis of each syllable's consonant or vowel pattern. The module is an independent component that can serve as an extension to any Malay TTS system that uses Multiband Resynthesis Overlap Add (MBROLA) engine for diphone concatenation. In a pilot project, the prototype is used with Fasih, the first Malay Text-to-Speech system developed by MIMOS Berhad, which can read unrestricted Malay text. eXTRA is evaluated through perception tests. The results show more than sixty percent of recognition rate, which confirm the satisfactory performance of this approach. .The solution should provide improvement to output of Malay TTS system. 2007 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/11579/1/Syaheerah.pdf Syaheerah , Lebai Lutfi (2007) Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi. Masters thesis, University of Malaya. http://studentsrepo.um.edu.my/11579/
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Student Repository
url_provider http://studentsrepo.um.edu.my/
topic QA75 Electronic computers. Computer science
QA76 Computer software
spellingShingle QA75 Electronic computers. Computer science
QA76 Computer software
Syaheerah , Lebai Lutfi
Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
description This study describes the addition of an affective component to the Malays TTS system in order to produce a system that is more expressive in nature. It introduces a new method for generating expressive speech by embedding an ‘emotion layer’ called eXpressive Text Reader Automation Layer, abbreviated as eXTRA. The emotion generation method is template-driven. The templates are diphone-based and each template carries unique affective data. The two types of emotions created for the system are anger and sadness. To ensure naturalness, the input sentence from user is matched with the template that consist of a sentence with the same syllable structure of the input sentence, allowing the emotion parameters from the template to be applied to the input at the level of phonemes. This syllable-sensitive matching process requires analysis of each syllable's consonant or vowel pattern. The module is an independent component that can serve as an extension to any Malay TTS system that uses Multiband Resynthesis Overlap Add (MBROLA) engine for diphone concatenation. In a pilot project, the prototype is used with Fasih, the first Malay Text-to-Speech system developed by MIMOS Berhad, which can read unrestricted Malay text. eXTRA is evaluated through perception tests. The results show more than sixty percent of recognition rate, which confirm the satisfactory performance of this approach. .The solution should provide improvement to output of Malay TTS system.
format Thesis
author Syaheerah , Lebai Lutfi
author_facet Syaheerah , Lebai Lutfi
author_sort Syaheerah , Lebai Lutfi
title Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
title_short Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
title_full Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
title_fullStr Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
title_full_unstemmed Adding emotions to synthesized Malay speech using diphone-based templates / Syaheerah Lebai Lutfi
title_sort adding emotions to synthesized malay speech using diphone-based templates / syaheerah lebai lutfi
publishDate 2007
url http://studentsrepo.um.edu.my/11579/1/Syaheerah.pdf
http://studentsrepo.um.edu.my/11579/
_version_ 1738506501550505984
score 13.160551