On the optimum speech segment length for depression detection

Depression is a worldwide problem, which according to the World Health Organization, is the largest contributor to global disability. According to a study, around 18336 Malaysians are suffering from depression. Therefore, an automated system that can detect depression from human speech is needed. Th...

Full description

Saved in:
Bibliographic Details
Main Authors: Alghifari, Muhammad Fahreza, Gunawan, Teddy Surya, Wan Nordin, Mimi Aminah, Kartiwi, Mira, Borhan, Lihanna
Format: Conference or Workshop Item
Language:English
English
Published: IEEE 2019
Subjects:
Online Access:http://irep.iium.edu.my/80387/1/80387%20On%20the%20Optimum%20Speech%20Segment%20Length.pdf
http://irep.iium.edu.my/80387/2/80387%20On%20the%20Optimum%20Speech%20Segment%20Length%20%20SCOPUS.pdf
http://irep.iium.edu.my/80387/
https://ieeexplore.ieee.org/document/9057319
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Depression is a worldwide problem, which according to the World Health Organization, is the largest contributor to global disability. According to a study, around 18336 Malaysians are suffering from depression. Therefore, an automated system that can detect depression from human speech is needed. The main objective of this paper is to investigate the optimum speech segment length that provide fast and accurate depression detection. An artificial neural network was used as classifier to detect depression using a speech feature, i.e. the averaged Mel-frequency cepstral coefficients (MFCC). The Distress Analysis Interview Corpus Wizard of Oz (DAIC-WOZ) was used to train and test the system, measured in terms of accuracy and processing time, while varying the number of neurons used. The obtained results are further optimized by investigating the ideal segment length for depression detection. Results showed that our proposed system can recognize voiced depression in 3 levels of depression with an accuracy rate up to 98.3% when given previous samples of the same speaker for training. Furthermore, the optimum speech segment length was found to be 7 seconds, when it is tested for the length between 1 to 20 seconds.