A 3-level endpoint detection algorithm for isolated speech and frequency-based features

This paper proposed a new approach for endpoint detection of isolated speech, which proves to significantly improve the endpoint detection performance. The proposed algorithm relies on the root mean square energy (rms energy), zero crossing rate and spectral characteristics of the speech signal wher...

Full description

Saved in:
Bibliographic Details
Main Authors: Goh, K. E., Ahmad, A. M.
Format: Conference or Workshop Item
Language:English
Published: 2004
Subjects:
Online Access:http://eprints.utm.my/id/eprint/20757/1/GohKiaEng2004_A3LevelEndpointDetectionAlgorithm.pdf
http://eprints.utm.my/id/eprint/20757/
https://scienceon.kisti.re.kr/srch/selectPORSrchArticle.do?cn=NPAP08127118&SITE
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper proposed a new approach for endpoint detection of isolated speech, which proves to significantly improve the endpoint detection performance. The proposed algorithm relies on the root mean square energy (rms energy), zero crossing rate and spectral characteristics of the speech signal where the Euclidean distance measure is adopted using cepstral coefficients to accurately detect the endpoint of isolated speech. The algorithm offers better performance than traditional energy-based algorithm. The vocabulary for the experiment includes English digit from one to nine. These experimental results were conducted by 360 utterances from a male speaker. Experimental results show that the accuracy of the algorithm is quite acceptable. Moreover, the computation overload of this algorithm is low since the cepstral coefficients parameters will be used in feature extraction later of speech recognition procedure.