Self-organizing map and multilayer perceptron for malay speech recognition

Various studies have been done in this field of speech recognition using various techniques such as Dynamic Time Warping (DTW), Hidden Markov Model (HMM) and Artificial Neural Network (ANN) in order to obtain the best and suitable model for speech recognition system. Every model has its drawbacks an...

Full description

Saved in:
Bibliographic Details
Main Author: Goh, Kia Eng
Format: Thesis
Language:English
Published: 2006
Subjects:
Online Access:http://eprints.utm.my/id/eprint/6385/1/GohKiaEngMFSKSM2006.pdf
http://eprints.utm.my/id/eprint/6385/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:62257
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.6385
record_format eprints
spelling my.utm.63852018-09-17T03:03:17Z http://eprints.utm.my/id/eprint/6385/ Self-organizing map and multilayer perceptron for malay speech recognition Goh, Kia Eng QA76 Computer software Various studies have been done in this field of speech recognition using various techniques such as Dynamic Time Warping (DTW), Hidden Markov Model (HMM) and Artificial Neural Network (ANN) in order to obtain the best and suitable model for speech recognition system. Every model has its drawbacks and weaknesses. Multilayer Perceptron (MLP) is a popular ANN for pattern recognition especially in speech recognition because of its non-linearity, ability to learn, robustness and ability to generalize. However, MLP has difficulties when dealing with temporal information as it needs input pattern of fixed length. With that in mind, this research focuses on finding a hybrid model/approach which combines Self-Organizing Map (SOM) and Multilayer Perceptron (MLP) to overcome as well as reduce the drawbacks. A hybrid-based neural network model has been developed to speech recognition in Malay language. In the proposed model, a 2D SOM is used as a sequential mapping function in order to transform the acoustic vector sequences of speech signal into binary matrix which performs dimensionality reduction. The idea of the approach is accumulating the winner nodes of an utterance into a binary matrix where the winner node is scaled as value “1� and others as value “0�. As a result, a binary matrix is formed which represents the content of an utterance. Then, MLP is used to classify the binary matrix to which each word corresponds to. The conventional model (MLP only) and the proposed model (SOM and MLP) were tested for digit recognition (“satu� to “sembilan�) and word recognition (30 selected Malay words) to find out the recognition accuracy using different values of parameters (cepstral order, dimension of SOM, hidden node number and learning rate). Both of the models were also tested using two types of classification: syllable classification and word classification. Finally, comparison and discussion was made between conventional and proposed model based on their recognition accuracy. The experimental results showed that the proposed model achieved higher accuracy. 2006-08 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/id/eprint/6385/1/GohKiaEngMFSKSM2006.pdf Goh, Kia Eng (2006) Self-organizing map and multilayer perceptron for malay speech recognition. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information System. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:62257
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA76 Computer software
spellingShingle QA76 Computer software
Goh, Kia Eng
Self-organizing map and multilayer perceptron for malay speech recognition
description Various studies have been done in this field of speech recognition using various techniques such as Dynamic Time Warping (DTW), Hidden Markov Model (HMM) and Artificial Neural Network (ANN) in order to obtain the best and suitable model for speech recognition system. Every model has its drawbacks and weaknesses. Multilayer Perceptron (MLP) is a popular ANN for pattern recognition especially in speech recognition because of its non-linearity, ability to learn, robustness and ability to generalize. However, MLP has difficulties when dealing with temporal information as it needs input pattern of fixed length. With that in mind, this research focuses on finding a hybrid model/approach which combines Self-Organizing Map (SOM) and Multilayer Perceptron (MLP) to overcome as well as reduce the drawbacks. A hybrid-based neural network model has been developed to speech recognition in Malay language. In the proposed model, a 2D SOM is used as a sequential mapping function in order to transform the acoustic vector sequences of speech signal into binary matrix which performs dimensionality reduction. The idea of the approach is accumulating the winner nodes of an utterance into a binary matrix where the winner node is scaled as value “1� and others as value “0�. As a result, a binary matrix is formed which represents the content of an utterance. Then, MLP is used to classify the binary matrix to which each word corresponds to. The conventional model (MLP only) and the proposed model (SOM and MLP) were tested for digit recognition (“satu� to “sembilan�) and word recognition (30 selected Malay words) to find out the recognition accuracy using different values of parameters (cepstral order, dimension of SOM, hidden node number and learning rate). Both of the models were also tested using two types of classification: syllable classification and word classification. Finally, comparison and discussion was made between conventional and proposed model based on their recognition accuracy. The experimental results showed that the proposed model achieved higher accuracy.
format Thesis
author Goh, Kia Eng
author_facet Goh, Kia Eng
author_sort Goh, Kia Eng
title Self-organizing map and multilayer perceptron for malay speech recognition
title_short Self-organizing map and multilayer perceptron for malay speech recognition
title_full Self-organizing map and multilayer perceptron for malay speech recognition
title_fullStr Self-organizing map and multilayer perceptron for malay speech recognition
title_full_unstemmed Self-organizing map and multilayer perceptron for malay speech recognition
title_sort self-organizing map and multilayer perceptron for malay speech recognition
publishDate 2006
url http://eprints.utm.my/id/eprint/6385/1/GohKiaEngMFSKSM2006.pdf
http://eprints.utm.my/id/eprint/6385/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:62257
_version_ 1643644542375690240
score 13.159267