结合高斯混合模型和VOT特征的音素发音错误检测 Phonetic mispronunciation detection based on GMM and VOT期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

结合高斯混合模型和VOT特征的音素发音错误检测

引用本文：	刘明辉,黄中伟.结合高斯混合模型和VOT特征的音素发音错误检测[J].科学技术与工程,2013,13(7):1789-1793.

作者姓名：	刘明辉黄中伟

作者单位：	深圳大学,深圳大学

基金项目：	国家自然科学基金项目（面上项目，重点项目，重大项目）

摘要：	结合高斯混合模型(GMM)和嗓音起始时间(VOT)特征的普通话音素发音错误检测,提出了一种结合语音声道特征信息和音源特征信息的发音错误检测方法。其中GMM用于反映声道特征信息的MFCC参数的建模与评测,并直接对大部分音素的发音质量直接进行错误检测。对于少数通过MFCC参数和GMM难于检测区分的辅音音素,则通过反映VOT信息的音源特征参数进行区分。实验表明,该方法在训练数据有限的情况下取得了较好的性能,非常适合用于聋人语言康复的计算机辅助训练。
关键词：	语音识别发音错误检测高斯混合模型嗓音起始时间
收稿时间：	9/18/2012 9:17:33 AM
修稿时间：	2012/10/11 0:00:00
Phonetic mispronunciation detection based on GMM and VOT

Liu Minghui and Huang Zhongwei.Phonetic mispronunciation detection based on GMM and VOT[J].Science Technology and Engineering,2013,13(7):1789-1793.

Authors:	Liu Minghui and Huang Zhongwei

Institution:	(Phonetic Laboratory,Shenzhen University,Shenzhen 518060,P.R.China)

Abstract:	Based on Gaussian mixture model (GMM) and voice onset time (VOT), a novel Mandarin phonetic mispronunciation detection approach was proposed which combines the spectral feature and the prosodic feature. GMM was used to model the Mel-frequency cepstral coefficients (MFCCs) and assess the pronunciation of most phonemes. For some consonants which are difficult to distinguish, prosodic features reflecting the VOT were extracted and used directly. Mandarin phonetic mispronunciation detection experiments on limited data showed the significant improvement.

Keywords:	speech recognition mispronunciation detection Gaussian mixture model voice onset time
本文献已被 CNKI 等数据库收录！
	点击此处可从《科学技术与工程》浏览原始摘要信息
	点击此处可从《科学技术与工程》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏