Microphone-independent speech features for automatic depression detection using recurrent neural network

Depression is a common mental disorder that has a negative impact on individuals, society, and the economy. Traditional clinical diagnosis methods are subjective and necessitate extensive expert participation. Because it is fast, convenient, and non-invasive, automatic depression detection using spe...

Full description

Saved in:
Bibliographic Details
Main Authors: Nik Hashim, Nik Nur Wahidah, Ahmad Basri, Nadzirah, Ahmad Ezzi, Mogahed Al Ezzi
Format: Conference or Workshop Item
Language:English
English
Published: Springer 2022
Subjects:
Online Access:http://irep.iium.edu.my/98933/1/98933_Microphone-independent%20speech%20features.pdf
http://irep.iium.edu.my/98933/2/98933_Microphone-independent%20speech%20features_SCOPUS.pdf
http://irep.iium.edu.my/98933/
https://link.springer.com/chapter/10.1007/978-981-16-8515-6_54
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Depression is a common mental disorder that has a negative impact on individuals, society, and the economy. Traditional clinical diagnosis methods are subjective and necessitate extensive expert participation. Because it is fast, convenient, and non-invasive, automatic depression detection using speech signals is a promising depression objective biomarker. Acoustic feature extraction is one of the most challenging techniques for speech analysis applications in mobile phones. The values of the extracted acoustic features are significantly influenced by adverse environmental noises, a wide range of microphone specifications, and various types of recording software. This study identified microphone-independent acoustic features and utilized them in developing an end-to-end recurrent neural network model to classify depression from Bahasa Malaysia speech. The dataset includes 110 female participants. Patient Health Questionnaire 9, Malay Beck Depression Inventory-II, and subjects’ declaration of Major Depressive Disorder diagnosis by a trained clinician were used to determine depression status. Multiple combinations of speech types were compared and discussed. Robust acoustic features derived from female spontaneous speech achieved an accuracy of 85%.