Computer-based stuttered speech detection system using Hidden Markov Model

Stuttering has attracted extensive research interests over the past decades. Most of the available stuttering diagnostics and assessment technique uses human perceptual judgment to overt stuttered speech characteristics. Conventionally, the stuttering severity is diagnosed by manual counting the num...

全面介紹

Saved in:
書目詳細資料
主要作者: Chin, Wee Lip
格式: Thesis
語言:English
出版: 2012
主題:
在線閱讀:http://eprints.utm.my/id/eprint/78550/1/ChinWeeLipMFBME2012.pdf
http://eprints.utm.my/id/eprint/78550/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:110409
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
實物特徵
總結:Stuttering has attracted extensive research interests over the past decades. Most of the available stuttering diagnostics and assessment technique uses human perceptual judgment to overt stuttered speech characteristics. Conventionally, the stuttering severity is diagnosed by manual counting the number of occurrences of disfluencies of pre-recorded therapist-patient conversation. It is a time-consuming task, subjective, inconsistent and easily prone to error across clinics. Therefore, this thesis proposes a computerized system by deploying HMM-based speech recognition technique to detect the stuttered speech disfluency. The continuous Malay digit string has been used as the training and testing set for fluency detection. Hidden Markov Model (HMM) is a robust and powerful statistical-based acoustic modeling technique. With their efficient training algorithm (Forward-backward, Baum-Welch algorithms) and recognition algorithm, as well as its modeling flexibility in model topology and other knowledge sources, HMM has been successfully applied in solving various tasks. In this thesis, a set of normal voice for digit string as database is used for training HMM. Then, the pseudo stuttering voice was collected as testing set for proposed system. The generated experimental results were compared with the results made by Speech Language Pathologist (SLP) from Clinic of Audiology and Speech Sciences of Universiti Kebangsaan Malaysia (UKM). As a result, the proposed system is proven to be capable to achieve 100% average syllable repetition detection accuracy with 86.605% average sound prolongation detection accuracy. The SLP agreed with the result generated by the software. This system can be further enhanced for detecting stuttering disorder for daily speaking words where Microsoft Visual C++ 6.0 and Goldwave have been used for developing the software which can be executed under the window-based environment.