Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model

Background and Objective Automatic voice pathology detection using sustained vowels has been widely explored. Because of the stationary nature of the speech waveform, pathology detection with a sustained vowel is a comparatively easier task than that using a running speech. Some disorder detection s...

Full description

Saved in:
Bibliographic Details
Main Authors: Ali, Z., Elamvazuthi, I., Alsulaiman, M., Muhammad, G.
Format: Article
Published: Mosby Inc. 2016
Online Access:https://www.scopus.com/inward/record.uri?eid=2-s2.0-84949997124&doi=10.1016%2fj.jvoice.2015.08.010&partnerID=40&md5=f7cd22682db3f440cbc29e78a693db31
http://eprints.utp.edu.my/30796/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utp.eprints.30796
record_format eprints
spelling my.utp.eprints.307962022-03-25T07:33:28Z Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model Ali, Z. Elamvazuthi, I. Alsulaiman, M. Muhammad, G. Background and Objective Automatic voice pathology detection using sustained vowels has been widely explored. Because of the stationary nature of the speech waveform, pathology detection with a sustained vowel is a comparatively easier task than that using a running speech. Some disorder detection systems with running speech have also been developed, although most of them are based on a voice activity detection (VAD), that is, itself a challenging task. Pathology detection with running speech needs more investigation, and systems with good accuracy (ACC) are required. Furthermore, pathology classification systems with running speech have not received any attention from the research community. In this article, automatic pathology detection and classification systems are developed using text-dependent running speech without adding a VAD module. Method A set of three psychophysics conditions of hearing (critical band spectral estimation, equal loudness hearing curve, and the intensity loudness power law of hearing) is used to estimate the auditory spectrum. The auditory spectrum and all-pole models of the auditory spectrums are computed and analyzed and used in a Gaussian mixture model for an automatic decision. Results In the experiments using the Massachusetts Eye & Ear Infirmary database, an ACC of 99.56 is obtained for pathology detection, and an ACC of 93.33 is obtained for the pathology classification system. The results of the proposed systems outperform the existing running-speech�based systems. Discussion The developed system can effectively be used in voice pathology detection and classification systems, and the proposed features can visually differentiate between normal and pathological samples. © 2015 The Voice Foundation Mosby Inc. 2016 Article NonPeerReviewed https://www.scopus.com/inward/record.uri?eid=2-s2.0-84949997124&doi=10.1016%2fj.jvoice.2015.08.010&partnerID=40&md5=f7cd22682db3f440cbc29e78a693db31 Ali, Z. and Elamvazuthi, I. and Alsulaiman, M. and Muhammad, G. (2016) Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model. Journal of Voice, 30 (6). 757.e7-757.e19. http://eprints.utp.edu.my/30796/
institution Universiti Teknologi Petronas
building UTP Resource Centre
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Petronas
content_source UTP Institutional Repository
url_provider http://eprints.utp.edu.my/
description Background and Objective Automatic voice pathology detection using sustained vowels has been widely explored. Because of the stationary nature of the speech waveform, pathology detection with a sustained vowel is a comparatively easier task than that using a running speech. Some disorder detection systems with running speech have also been developed, although most of them are based on a voice activity detection (VAD), that is, itself a challenging task. Pathology detection with running speech needs more investigation, and systems with good accuracy (ACC) are required. Furthermore, pathology classification systems with running speech have not received any attention from the research community. In this article, automatic pathology detection and classification systems are developed using text-dependent running speech without adding a VAD module. Method A set of three psychophysics conditions of hearing (critical band spectral estimation, equal loudness hearing curve, and the intensity loudness power law of hearing) is used to estimate the auditory spectrum. The auditory spectrum and all-pole models of the auditory spectrums are computed and analyzed and used in a Gaussian mixture model for an automatic decision. Results In the experiments using the Massachusetts Eye & Ear Infirmary database, an ACC of 99.56 is obtained for pathology detection, and an ACC of 93.33 is obtained for the pathology classification system. The results of the proposed systems outperform the existing running-speech�based systems. Discussion The developed system can effectively be used in voice pathology detection and classification systems, and the proposed features can visually differentiate between normal and pathological samples. © 2015 The Voice Foundation
format Article
author Ali, Z.
Elamvazuthi, I.
Alsulaiman, M.
Muhammad, G.
spellingShingle Ali, Z.
Elamvazuthi, I.
Alsulaiman, M.
Muhammad, G.
Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model
author_facet Ali, Z.
Elamvazuthi, I.
Alsulaiman, M.
Muhammad, G.
author_sort Ali, Z.
title Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model
title_short Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model
title_full Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model
title_fullStr Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model
title_full_unstemmed Automatic Voice Pathology Detection With Running Speech by Using Estimation of Auditory Spectrum and Cepstral Coefficients Based on the All-Pole Model
title_sort automatic voice pathology detection with running speech by using estimation of auditory spectrum and cepstral coefficients based on the all-pole model
publisher Mosby Inc.
publishDate 2016
url https://www.scopus.com/inward/record.uri?eid=2-s2.0-84949997124&doi=10.1016%2fj.jvoice.2015.08.010&partnerID=40&md5=f7cd22682db3f440cbc29e78a693db31
http://eprints.utp.edu.my/30796/
_version_ 1738657158018367488
score 13.160551