Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases

In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses...

Full description

Saved in:
Bibliographic Details
Main Authors: Ahmad Qadri, Syed Asif, Gunawan, Teddy Surya, Kartiwi, Mira, Mansor, Hasmah
Format: Book Chapter
Language:English
English
Published: Springer 2020
Subjects:
Online Access:http://irep.iium.edu.my/82534/1/Paper_182.pdf
http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf
http://irep.iium.edu.my/82534/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.iium.irep.82534
record_format dspace
spelling my.iium.irep.825342020-09-18T02:58:06Z http://irep.iium.edu.my/82534/ Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases Ahmad Qadri, Syed Asif Gunawan, Teddy Surya Kartiwi, Mira Mansor, Hasmah TK7885 Computer engineering In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses on extracting features such as pitch frequency, formant features, energy-related features, and spectral features from speech, tailing it with a classification quest to foresee different clas-ses of emotion. The critical issue to be addressed for a successful SER system is the emotional feature extraction, which can be solved by using different feature extraction techniques. In this paper, along with Teager Energy Operator (TEO) and Mel Frequency Cepstral Coefficients (MFCC) a trailblazing feature extrac-tion method, a fusion of MFCC and TEO as Teager-MFCC (T-MFCC) is used for the recognition of energy-based emotions. We have used three corpora of emotions in German, English, and Hindi to develop the multilingual SER system. The classification of these energy-based emotions is done by Deep Neural Net-work (DNN). It is found that TEO achieves a better recognition rate compared to MFCC and T-MFCC. Springer 2020-07 Book Chapter NonPeerReviewed application/pdf en http://irep.iium.edu.my/82534/1/Paper_182.pdf application/pdf en http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf Ahmad Qadri, Syed Asif and Gunawan, Teddy Surya and Kartiwi, Mira and Mansor, Hasmah (2020) Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases. In: Springer's Lecture Nores in Electrical Engineering (LNEE). Springer, pp. 1-10. (In Press)
institution Universiti Islam Antarabangsa Malaysia
building IIUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider International Islamic University Malaysia
content_source IIUM Repository (IREP)
url_provider http://irep.iium.edu.my/
language English
English
topic TK7885 Computer engineering
spellingShingle TK7885 Computer engineering
Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Kartiwi, Mira
Mansor, Hasmah
Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
description In the speech signal, emotion is considered one of the most critical elements. For the recognition of emotions, the field of speech emotion recognition came into ex-istence. Speech Emotion Recognition (SER) is becoming an area of research in-terest in the last few years. A typical SER system focuses on extracting features such as pitch frequency, formant features, energy-related features, and spectral features from speech, tailing it with a classification quest to foresee different clas-ses of emotion. The critical issue to be addressed for a successful SER system is the emotional feature extraction, which can be solved by using different feature extraction techniques. In this paper, along with Teager Energy Operator (TEO) and Mel Frequency Cepstral Coefficients (MFCC) a trailblazing feature extrac-tion method, a fusion of MFCC and TEO as Teager-MFCC (T-MFCC) is used for the recognition of energy-based emotions. We have used three corpora of emotions in German, English, and Hindi to develop the multilingual SER system. The classification of these energy-based emotions is done by Deep Neural Net-work (DNN). It is found that TEO achieves a better recognition rate compared to MFCC and T-MFCC.
format Book Chapter
author Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Kartiwi, Mira
Mansor, Hasmah
author_facet Ahmad Qadri, Syed Asif
Gunawan, Teddy Surya
Kartiwi, Mira
Mansor, Hasmah
author_sort Ahmad Qadri, Syed Asif
title Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_short Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_full Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_fullStr Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_full_unstemmed Speech emotion recognition using feature fusion of TEO and MFCC on multilingual databases
title_sort speech emotion recognition using feature fusion of teo and mfcc on multilingual databases
publisher Springer
publishDate 2020
url http://irep.iium.edu.my/82534/1/Paper_182.pdf
http://irep.iium.edu.my/82534/2/Acceptance%20Letter_DrTeddy_IIUM.pdf
http://irep.iium.edu.my/82534/
_version_ 1678592805745721344
score 13.214268