Signal segmentation and its application in the feature extraction of speech

Speech is considered as a time-varying signal since the parameters of the signal such as the amplitude, frequency and phase varies in time. Segmenting a duration of captured speech into analysis frames of 20 msecs ensures the assumption of stationarity. If a captured speech segment representing a wo...

詳細記述

保存先:
書誌詳細
主要な著者: Abdul Rahman, Ahmad Idil, Shaikh Salleh, Sheikh Hussain, Sha’ameri, Ahmad Zuri, AI-Attas, Syed Abdul Rahman
フォーマット: 論文
言語:English
出版事項: 2000
主題:
オンライン・アクセス:http://eprints.utm.my/id/eprint/2300/1/Rahman2000__SignalSegmentationandItsApplication.pdf
http://eprints.utm.my/id/eprint/2300/
タグ: タグ追加
タグなし, このレコードへの初めてのタグを付けませんか!
その他の書誌記述
要約:Speech is considered as a time-varying signal since the parameters of the signal such as the amplitude, frequency and phase varies in time. Segmenting a duration of captured speech into analysis frames of 20 msecs ensures the assumption of stationarity. If a captured speech segment representing a word that may last for 600 msec, then a total of 30 analysis frames are required to the word. Due to the possibility that adjacent frames are identical, then it would be of interest to combine these frames into a single long frame. The interval where adjacent frames have identical parameters is referred as the time-invariant interval (TII). It is of interest to determine these intervals and two methods presented are the instantaneous energy and frequency estimation (IEFE) and localized time correlation (LTC) function. A comparison is made in the accuracy in the TII estimate for a set of speech samples