實物特徵: Merging of native and non-native speech for low-resource accented ASR

Merging of native and non-native speech for low-resource accented ASR

This paper presents our recent study on low-resource automatic speech recognition (ASR) system with accented speech. We propose multi-accent Subspace Gaussian Mixture Models (SGMM) and accent-specific Deep Neural Networks (DNN) for improving non-native ASR performance. In the SGMM framework, we pres...

全面介紹

Saved in:

書目詳細資料
Main Authors:	Samson Juan, Sarah, Besacier, Laurent, Lecouteux, Benjamin, Tien-Ping, Tan
格式:	E-Article
語言:	English
出版:	Springer Verlag 2015
主題:	T Technology (General)
在線閱讀:	http://ir.unimas.my/id/eprint/12098/1/No%2035%20%28abstrak%29.pdf http://ir.unimas.my/id/eprint/12098/ http://www.scopus.com/inward/record.url?eid=2-s2.0-84952362047&partnerID=40&md5=6bc512988afc29cd7ca4af16a836f0b3
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!

實物特徵
總結:	This paper presents our recent study on low-resource automatic speech recognition (ASR) system with accented speech. We propose multi-accent Subspace Gaussian Mixture Models (SGMM) and accent-specific Deep Neural Networks (DNN) for improving non-native ASR performance. In the SGMM framework, we present an original language weighting strategy to merge the globally shared parameters of two models based on native and non-native speech espectively. In the DNN framework, a native deep neural net is fine-tuned to non-native speech. Over the non-native baseline, we achieved relative improvement of 15% for multi-accent SGMM and 34% for accent-specific DNN with speaker adaptation.

Merging of native and non-native speech for low-resource accented ASR

相似書籍