Pronunciation modelling of Penang Hokkien dialect for text-to-speech system

This is academic research about pronunciation modelling of Penang Hokkien for Text-to-Speech System which is under field of study, Speech Synthesis. It is widely known that there are majority of unwritten languages are gradually forgotten by younger generations due to domination of written langua...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Kang Jie
Format: Final Year Project / Dissertation / Thesis
Published: 2022
Subjects:
Online Access:http://eprints.utar.edu.my/4728/1/fyp_IA_2022_LKJ.pdf
http://eprints.utar.edu.my/4728/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my-utar-eprints.4728
record_format eprints
spelling my-utar-eprints.47282023-01-06T15:40:52Z Pronunciation modelling of Penang Hokkien dialect for text-to-speech system Lim, Kang Jie T Technology (General) This is academic research about pronunciation modelling of Penang Hokkien for Text-to-Speech System which is under field of study, Speech Synthesis. It is widely known that there are majority of unwritten languages are gradually forgotten by younger generations due to domination of written languages in education and the most significant factor is lack of documentation of the languages. Hence, these hindrances prevent or increase the effort of revitalization on those unwritten languages by implementing current technologies. Penang Hokkien Language, a likely unwritten language spoke in Northern of Southern Peninsular Malaysia is selected as case study of this research where its linguistic resources are partially documented. In order to develop an TTS System for Penang Hokkien, this research project is the first steps to familiarize with this high complexity language. Since this project is part of the effort in revitalizing Penang Hokkien Language, Traditional Chinese Character is opted as standard of writing system and Penang Hokkien Spelling System which created by Hokkien Association of Penang is selected as standard of pronunciation orthography. Listing of phonemes with categorizing them into initials and finals are taken as Penang Hokkien is a tonal language. Moreover, nine tones are marked with the use of diacritics based on Penang Hokkien Spelling System according to the tone marking rules. Tone sandhi rules are also created in orthography standardization phase. The contributions of this project are (1) finding the possible combinations of initials and finals and tones, (2) collect possible graphemes, (3) map graphemes with morphemes, (4) design database to store the processed data and (5) standardizing the tones and tone sandhi rules. 2022-05 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/4728/1/fyp_IA_2022_LKJ.pdf Lim, Kang Jie (2022) Pronunciation modelling of Penang Hokkien dialect for text-to-speech system. Final Year Project, UTAR. http://eprints.utar.edu.my/4728/
institution Universiti Tunku Abdul Rahman
building UTAR Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Tunku Abdul Rahman
content_source UTAR Institutional Repository
url_provider http://eprints.utar.edu.my
topic T Technology (General)
spellingShingle T Technology (General)
Lim, Kang Jie
Pronunciation modelling of Penang Hokkien dialect for text-to-speech system
description This is academic research about pronunciation modelling of Penang Hokkien for Text-to-Speech System which is under field of study, Speech Synthesis. It is widely known that there are majority of unwritten languages are gradually forgotten by younger generations due to domination of written languages in education and the most significant factor is lack of documentation of the languages. Hence, these hindrances prevent or increase the effort of revitalization on those unwritten languages by implementing current technologies. Penang Hokkien Language, a likely unwritten language spoke in Northern of Southern Peninsular Malaysia is selected as case study of this research where its linguistic resources are partially documented. In order to develop an TTS System for Penang Hokkien, this research project is the first steps to familiarize with this high complexity language. Since this project is part of the effort in revitalizing Penang Hokkien Language, Traditional Chinese Character is opted as standard of writing system and Penang Hokkien Spelling System which created by Hokkien Association of Penang is selected as standard of pronunciation orthography. Listing of phonemes with categorizing them into initials and finals are taken as Penang Hokkien is a tonal language. Moreover, nine tones are marked with the use of diacritics based on Penang Hokkien Spelling System according to the tone marking rules. Tone sandhi rules are also created in orthography standardization phase. The contributions of this project are (1) finding the possible combinations of initials and finals and tones, (2) collect possible graphemes, (3) map graphemes with morphemes, (4) design database to store the processed data and (5) standardizing the tones and tone sandhi rules.
format Final Year Project / Dissertation / Thesis
author Lim, Kang Jie
author_facet Lim, Kang Jie
author_sort Lim, Kang Jie
title Pronunciation modelling of Penang Hokkien dialect for text-to-speech system
title_short Pronunciation modelling of Penang Hokkien dialect for text-to-speech system
title_full Pronunciation modelling of Penang Hokkien dialect for text-to-speech system
title_fullStr Pronunciation modelling of Penang Hokkien dialect for text-to-speech system
title_full_unstemmed Pronunciation modelling of Penang Hokkien dialect for text-to-speech system
title_sort pronunciation modelling of penang hokkien dialect for text-to-speech system
publishDate 2022
url http://eprints.utar.edu.my/4728/1/fyp_IA_2022_LKJ.pdf
http://eprints.utar.edu.my/4728/
_version_ 1754534170395148288
score 13.211869