Baby cry recognition using deep neural networks

Infant cry recognition is a challenging task as it is hard to determine the speech features that can allow researchers to clearly separate between different types of cries. However, baby cry is treated as a different way of communication of speech. The types of baby cry can be differentiated using M...

Full description

Saved in:
Bibliographic Details
Main Authors: Yong, B.F., Ting, H.N., Ng, K.H.
Format: Conference or Workshop Item
Language:English
Published: 2018
Subjects:
Online Access:http://eprints.um.edu.my/18972/1/reconation_using_deep_neural_networks.pdf
http://eprints.um.edu.my/18972/
https://doi.org/10.1007/978-981-10-9023-3_147
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.um.eprints.18972
record_format eprints
spelling my.um.eprints.189722018-08-08T07:46:18Z http://eprints.um.edu.my/18972/ Baby cry recognition using deep neural networks Yong, B.F. Ting, H.N. Ng, K.H. RA Public aspects of medicine Infant cry recognition is a challenging task as it is hard to determine the speech features that can allow researchers to clearly separate between different types of cries. However, baby cry is treated as a different way of communication of speech. The types of baby cry can be differentiated using Mel-Frequency Cepstral Coefficient (MFCC) with appropriate artificial intelligence model. Stacked restricted Boltzmann machine (RBN) is popular in providing few layers of neural networks to convert the high dimensional data to lower dimensional data to fine tune the input data to a better initialized weight for the neural networks. Usually RBN is used with another deep neural network to form the deep belief networks (DBN), and the studies in this direction is heading towards the convolutional-RBN variant. The study on RBN to pre-train Convolutional neural networks (CNN) without convolution function in the RBN meanwhile is scarce due to the Back propagation and principal component analysis can be applied directly to the CNN. In this paper, we describe the hybrid system between RBN and CNN for learning class specific features for baby cry recognition using the feature of Mel-Frequency Cepstral Coefficient. We archived an 78.6% of accuracy on 5 types of baby cries by validating the proposed model on baby cry recognition. 2018 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.um.edu.my/18972/1/reconation_using_deep_neural_networks.pdf Yong, B.F. and Ting, H.N. and Ng, K.H. (2018) Baby cry recognition using deep neural networks. In: World Congress on Medical Physics and Biomedical Engineering 2018, 3-8 June 2018, Prague, Czech Republic. https://doi.org/10.1007/978-981-10-9023-3_147
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
language English
topic RA Public aspects of medicine
spellingShingle RA Public aspects of medicine
Yong, B.F.
Ting, H.N.
Ng, K.H.
Baby cry recognition using deep neural networks
description Infant cry recognition is a challenging task as it is hard to determine the speech features that can allow researchers to clearly separate between different types of cries. However, baby cry is treated as a different way of communication of speech. The types of baby cry can be differentiated using Mel-Frequency Cepstral Coefficient (MFCC) with appropriate artificial intelligence model. Stacked restricted Boltzmann machine (RBN) is popular in providing few layers of neural networks to convert the high dimensional data to lower dimensional data to fine tune the input data to a better initialized weight for the neural networks. Usually RBN is used with another deep neural network to form the deep belief networks (DBN), and the studies in this direction is heading towards the convolutional-RBN variant. The study on RBN to pre-train Convolutional neural networks (CNN) without convolution function in the RBN meanwhile is scarce due to the Back propagation and principal component analysis can be applied directly to the CNN. In this paper, we describe the hybrid system between RBN and CNN for learning class specific features for baby cry recognition using the feature of Mel-Frequency Cepstral Coefficient. We archived an 78.6% of accuracy on 5 types of baby cries by validating the proposed model on baby cry recognition.
format Conference or Workshop Item
author Yong, B.F.
Ting, H.N.
Ng, K.H.
author_facet Yong, B.F.
Ting, H.N.
Ng, K.H.
author_sort Yong, B.F.
title Baby cry recognition using deep neural networks
title_short Baby cry recognition using deep neural networks
title_full Baby cry recognition using deep neural networks
title_fullStr Baby cry recognition using deep neural networks
title_full_unstemmed Baby cry recognition using deep neural networks
title_sort baby cry recognition using deep neural networks
publishDate 2018
url http://eprints.um.edu.my/18972/1/reconation_using_deep_neural_networks.pdf
http://eprints.um.edu.my/18972/
https://doi.org/10.1007/978-981-10-9023-3_147
_version_ 1643690848486948864
score 13.211869