A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero

In recent years, speech technology has gained a tremendous improvement in term of its application and development. Speech technology such as machine translator, automatic speech recognition system and speech synthesis system are the state-of-the-art in today’s technology. TTS system or artificial sp...

Full description

Saved in:
Bibliographic Details
Main Author: Abubakar Ahmad , Aliero
Format: Thesis
Published: 2017
Subjects:
Online Access:http://studentsrepo.um.edu.my/14262/2/Abubakar.pdf
http://studentsrepo.um.edu.my/14262/1/Abubakar_Ahmad.pdf
http://studentsrepo.um.edu.my/14262/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.stud.14262
record_format eprints
spelling my.um.stud.142622023-04-09T22:48:53Z A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero Abubakar Ahmad , Aliero QA76 Computer software TA Engineering (General). Civil engineering (General) In recent years, speech technology has gained a tremendous improvement in term of its application and development. Speech technology such as machine translator, automatic speech recognition system and speech synthesis system are the state-of-the-art in today’s technology. TTS system or artificial speech development during the last few decades aims at gradual improvement in the intelligibility and naturalness. A Text-to-Speech system is a system that generates speech output from a given input text. TTS system has many different applications for many different users, but more specifically are the visually impaired and the illiterates. Some of the major application areas of speech synthesis system are document reader, speech translator, mobile read-aloud applications (such as google map reader) and announcement system. Speech synthesis system serves as an assistive tool for disabled, which is used for reading online text/information and as an automatic learning system for children. Despite the potential benefits of TTS system, it is language dependent and has yet to be developed for many of the languages around the world, which is mostly due to the lack in the necessary resources. Languages that is lacking in the necessary resources are referred as under-resourced language. Hausa is one of the under-resourced languages that lacks in the resources for developing a TTS system. The aim of this research is to develop a state-of-the-art TTS system for Hausa, an under-resourced language, using minimal resources. Several techniques have been introduced by researchers for developing TTS system for under-resourced languages, such as speaker adaptation, cross-lingual adaptation, bootstrapping, and etc. Currently, the state-of-the-art TTS technology is the Deep Neural Network (DNN)-based speech synthesis system which is only available for selected well-resourced languages like English, Arabic etc. The DNN-based speech synthesis system is the most advanced system that offers the highest intelligibility and naturalness as compared to the existing systems. Using the English resources as the basis, a DNN-based speech synthesis system is developed for Hausa with minimal resources by adopting the cross-lingual technique. The developed system was tested for intelligibility and naturalness using native Hausa speakers. The result of the developed system is 4.20 out of 5 in terms of naturalness and 4.10 out of 5 in terms in intelligibility, which is better than the existing techniques used for the development of TTS systems for under-resourced languages. 2017-10 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/14262/2/Abubakar.pdf application/pdf http://studentsrepo.um.edu.my/14262/1/Abubakar_Ahmad.pdf Abubakar Ahmad , Aliero (2017) A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero. Masters thesis, Universiti Malaya. http://studentsrepo.um.edu.my/14262/
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Student Repository
url_provider http://studentsrepo.um.edu.my/
topic QA76 Computer software
TA Engineering (General). Civil engineering (General)
spellingShingle QA76 Computer software
TA Engineering (General). Civil engineering (General)
Abubakar Ahmad , Aliero
A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero
description In recent years, speech technology has gained a tremendous improvement in term of its application and development. Speech technology such as machine translator, automatic speech recognition system and speech synthesis system are the state-of-the-art in today’s technology. TTS system or artificial speech development during the last few decades aims at gradual improvement in the intelligibility and naturalness. A Text-to-Speech system is a system that generates speech output from a given input text. TTS system has many different applications for many different users, but more specifically are the visually impaired and the illiterates. Some of the major application areas of speech synthesis system are document reader, speech translator, mobile read-aloud applications (such as google map reader) and announcement system. Speech synthesis system serves as an assistive tool for disabled, which is used for reading online text/information and as an automatic learning system for children. Despite the potential benefits of TTS system, it is language dependent and has yet to be developed for many of the languages around the world, which is mostly due to the lack in the necessary resources. Languages that is lacking in the necessary resources are referred as under-resourced language. Hausa is one of the under-resourced languages that lacks in the resources for developing a TTS system. The aim of this research is to develop a state-of-the-art TTS system for Hausa, an under-resourced language, using minimal resources. Several techniques have been introduced by researchers for developing TTS system for under-resourced languages, such as speaker adaptation, cross-lingual adaptation, bootstrapping, and etc. Currently, the state-of-the-art TTS technology is the Deep Neural Network (DNN)-based speech synthesis system which is only available for selected well-resourced languages like English, Arabic etc. The DNN-based speech synthesis system is the most advanced system that offers the highest intelligibility and naturalness as compared to the existing systems. Using the English resources as the basis, a DNN-based speech synthesis system is developed for Hausa with minimal resources by adopting the cross-lingual technique. The developed system was tested for intelligibility and naturalness using native Hausa speakers. The result of the developed system is 4.20 out of 5 in terms of naturalness and 4.10 out of 5 in terms in intelligibility, which is better than the existing techniques used for the development of TTS systems for under-resourced languages.
format Thesis
author Abubakar Ahmad , Aliero
author_facet Abubakar Ahmad , Aliero
author_sort Abubakar Ahmad , Aliero
title A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero
title_short A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero
title_full A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero
title_fullStr A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero
title_full_unstemmed A DNN-based text-to-speech system for Hausa: An under-resourced language / Abubakar Ahmad Aliero
title_sort dnn-based text-to-speech system for hausa: an under-resourced language / abubakar ahmad aliero
publishDate 2017
url http://studentsrepo.um.edu.my/14262/2/Abubakar.pdf
http://studentsrepo.um.edu.my/14262/1/Abubakar_Ahmad.pdf
http://studentsrepo.um.edu.my/14262/
_version_ 1762962033404280832
score 13.160551