面向多口音语音识别的声学模型重构 Acoustic model reconstruction for multi-accent Chinese speech recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

面向多口音语音识别的声学模型重构

引用本文：	张超,刘轶,郑方.面向多口音语音识别的声学模型重构[J].清华大学学报(自然科学版),2011(9):1161-1166.

作者姓名：	张超刘轶郑方

作者单位：	清华信息科学技术国家实验室技术创新与开发部语音和语言技术中心;清华大学计算机科学与技术系;

基金项目：	国家自然科学基金资助项目(60975018); 教育部新教师基金(20090002120012)

摘要：	该文提出了应用声学似然分作为置信度来生成可靠口音相关单元的方法。基于可靠口音相关单元构造声学模型,并通过声学模型重构的方法将它们融合到标准普通话模型中,以改善普通话语音识别器对带多方言口音语音的识别效果。另外,还提出了使用增量式决策树融合及根据支配度选择Gauss混合2种方法来减少冗余的Gauss混合,从而提高了重构后的声学模型的效率。实验表明:该方法在不降低对标准普通话的识别率的前提下,对粤、吴口音的绝对音节错误率分别下降了9.25%和9.21%。
关键词：	语音识别多方言口音可靠口音相关单元声学模型重构
Acoustic model reconstruction for multi-accent Chinese speech recognition

ZHANG Chao,LIU Yi,ZHENG Thomas Fang.Acoustic model reconstruction for multi-accent Chinese speech recognition[J].Journal of Tsinghua University(Science and Technology),2011(9):1161-1166.

Authors:	ZHANG Chao LIU Yi ZHENG Thomas Fang

Institution:	ZHANG Chao1,2,LIU Yi1,ZHENG Thomas Fang1(1.Center for Speech and Language Technologies,Division of Technology Innovation and Development,Tsinghua National Laboratory for Information Science and Technology,Beijing 100084,China,2.Department of Computer Science and Technology,Tsinghua University,China)

Abstract:	The acoustic likelihood score is used as a confidence measure to generate reliable accent-specific units and to merge such reliable accent-specific units through acoustic model reconstruction.The decision tree merge and acoustic model reconstruction efficiencies are improved by reducing redundant Gaussian components through an incremental decision tree merge procedure and selection of Gaussian components according to their dominance.Tests on Cantonese and Wu accents show that this approach yields significan...

Keywords:	speech recognition multiple accents reliable accent-specific unit acoustic model reconstruct
本文献已被 CNKI 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏