Speech emotion recognition using deep neural networks on multilingual databases

The research community's ever-increasing interest in studying human-computer interactions (HCI), systems deducing, and identifying a speech signal's emotional aspects has emerged as a hot research topic. Speech Emotion Recognition (SER) has brought the development of automated and intellig...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahmad Qadri, Syed Asif, Gunawan, Teddy Surya, Wani, Taiba Majid, Ambikairajah, Eliathamby, Kartiwi, Mira, Ihsanto, Eko
Format: Book Chapter
Language:English
English
English
Published: Springer 2021
Subjects:
Online Access:http://irep.iium.edu.my/88878/1/Paper_110.pdf
http://irep.iium.edu.my/88878/7/88878_Speech%20emotion%20recognition.pdf
http://irep.iium.edu.my/88878/13/88878_Speech%20emotion%20recognition%20using%20deep%20neural_SCOPUS.pdf
http://irep.iium.edu.my/88878/
https://link.springer.com/book/10.1007%2F978-3-030-70917-4
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.iium.irep.88878
record_format dspace
spelling my.iium.irep.888782021-05-11T03:09:59Z http://irep.iium.edu.my/88878/ Speech emotion recognition using deep neural networks on multilingual databases Ahmad Qadri, Syed Asif Gunawan, Teddy Surya Wani, Taiba Majid Ambikairajah, Eliathamby Kartiwi, Mira Ihsanto, Eko TK7885 Computer engineering The research community's ever-increasing interest in studying human-computer interactions (HCI), systems deducing, and identifying a speech signal's emotional aspects has emerged as a hot research topic. Speech Emotion Recognition (SER) has brought the development of automated and intelligent analysis of human ut-terances to reality. Typically, an SER system focuses on extracting the features from speech signals such as pitch frequency, formant features, energy-related and spectral features, tailing it with a classification quest to understand the underlying emotion. The key issues pivotal for a successful SER system are driven by the proper selection of proper emotional feature extraction techniques. In this paper, Mel-frequency Cepstral Coefficient (MFCC) and Teager Energy Operator (TEO) along with a new proposed Feature Fusion of MFCC and TEO referred to as Teager-MFCC (TMFCC) is examined over a multilingual database consisting of English, German and Hindi languages. Deep Neural Networks have been used to classify the different emotions considered, happy, sad, angry, and neutral. Eval-uation results show that the proposed fusion TMFCC with a recognition rate of 92.7% outperforms TEO and MFCC. With TEO and MFCC configurations, the recognition rate has been found as 88.5% and 90.0%, respectively. Springer 2021 Book Chapter PeerReviewed application/pdf en http://irep.iium.edu.my/88878/1/Paper_110.pdf application/pdf en http://irep.iium.edu.my/88878/7/88878_Speech%20emotion%20recognition.pdf application/pdf en http://irep.iium.edu.my/88878/13/88878_Speech%20emotion%20recognition%20using%20deep%20neural_SCOPUS.pdf Ahmad Qadri, Syed Asif and Gunawan, Teddy Surya and Wani, Taiba Majid and Ambikairajah, Eliathamby and Kartiwi, Mira and Ihsanto, Eko (2021) Speech emotion recognition using deep neural networks on multilingual databases. In: Advances in Robotics, Automation and Data Analytics. Advances in Intelligent Systems and Computing, Chapter 3 . Springer, pp. 21-30. ISBN 978-3-030-70916-7 https://link.springer.com/book/10.1007%2F978-3-030-70917-4
institution Universiti Islam Antarabangsa Malaysia
building IIUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider International Islamic University Malaysia
content_source IIUM Repository (IREP)
url_provider http://irep.iium.edu.my/
language English
English
English
topic TK7885 Computer engineering
spellingShingle TK7885 Computer engineering
Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Wani, Taiba Majid
Ambikairajah, Eliathamby
Kartiwi, Mira
Ihsanto, Eko
Speech emotion recognition using deep neural networks on multilingual databases
description The research community's ever-increasing interest in studying human-computer interactions (HCI), systems deducing, and identifying a speech signal's emotional aspects has emerged as a hot research topic. Speech Emotion Recognition (SER) has brought the development of automated and intelligent analysis of human ut-terances to reality. Typically, an SER system focuses on extracting the features from speech signals such as pitch frequency, formant features, energy-related and spectral features, tailing it with a classification quest to understand the underlying emotion. The key issues pivotal for a successful SER system are driven by the proper selection of proper emotional feature extraction techniques. In this paper, Mel-frequency Cepstral Coefficient (MFCC) and Teager Energy Operator (TEO) along with a new proposed Feature Fusion of MFCC and TEO referred to as Teager-MFCC (TMFCC) is examined over a multilingual database consisting of English, German and Hindi languages. Deep Neural Networks have been used to classify the different emotions considered, happy, sad, angry, and neutral. Eval-uation results show that the proposed fusion TMFCC with a recognition rate of 92.7% outperforms TEO and MFCC. With TEO and MFCC configurations, the recognition rate has been found as 88.5% and 90.0%, respectively.
format Book Chapter
author Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Wani, Taiba Majid
Ambikairajah, Eliathamby
Kartiwi, Mira
Ihsanto, Eko
author_facet Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Wani, Taiba Majid
Ambikairajah, Eliathamby
Kartiwi, Mira
Ihsanto, Eko
author_sort Ahmad Qadri, Syed Asif
title Speech emotion recognition using deep neural networks on multilingual databases
title_short Speech emotion recognition using deep neural networks on multilingual databases
title_full Speech emotion recognition using deep neural networks on multilingual databases
title_fullStr Speech emotion recognition using deep neural networks on multilingual databases
title_full_unstemmed Speech emotion recognition using deep neural networks on multilingual databases
title_sort speech emotion recognition using deep neural networks on multilingual databases
publisher Springer
publishDate 2021
url http://irep.iium.edu.my/88878/1/Paper_110.pdf
http://irep.iium.edu.my/88878/7/88878_Speech%20emotion%20recognition.pdf
http://irep.iium.edu.my/88878/13/88878_Speech%20emotion%20recognition%20using%20deep%20neural_SCOPUS.pdf
http://irep.iium.edu.my/88878/
https://link.springer.com/book/10.1007%2F978-3-030-70917-4
_version_ 1701162780226224128
score 13.159267