快速口音自适应的动态说话人选择性训练 Dynamic speaker selected training for rapid speaker adaptation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

快速口音自适应的动态说话人选择性训练

引用本文：	董明,刘加,刘润生.快速口音自适应的动态说话人选择性训练[J].清华大学学报(自然科学版),2005,45(7):912-915.

作者姓名：	董明刘加刘润生

作者单位：	清华大学电子工程系,北京,100084

基金项目：	国家自然科学基金资助项目(60272016)

摘要：	为解决语音识别系统实用中的说话人口音快速自适应问题,提出了一种动态说话人选择性训练方法。基于说话人选择性训练方法,采用基于Gauss混合模型似然分数计算的置信测度选择训练用说话人,改变训练用说话人的绝对数目选取方式,提高了选取的效能并拓展了选取标准的推广性。根据各个训练用说话人同被适应说话人的不同似然程度,加权地合成动态说话人选择性训练的语音模型,提高了自适应训练的效果。实验表明:该方法使识别率从80.16%提高到84.12%,相对误识率降低了19.96%,在实用中提高了基线系统的识别性能。
关键词：	语音识别说话人快速自适应置信测度
文章编号：	1000-0054(2005)07-0912-04
修稿时间：	2004年6月9日
Dynamic speaker selected training for rapid speaker adaptation

DONG Ming,LIU Jia,Liu Runsheng.Dynamic speaker selected training for rapid speaker adaptation[J].Journal of Tsinghua University(Science and Technology),2005,45(7):912-915.

Authors:	DONG Ming LIU Jia Liu Runsheng

Abstract:	Practical speech recognition systems need rapid speaker adaptation to be effective with a wide variety of speakers. A dynamic speaker selected training method developed for rapid speaker adaptation improves the basic speaker selected training method by replacing the absolute number selection method used in the basic method with a confidence measure calculated from the Gaussian mixture model likelihood. The new method enhances both the training speaker selecting efficiency and the selecting adaptability. The dynamic acoustic model, which uses different weightings for each training speaker so that they resemble the adapted speaker, further increases the recognition accuracy rate. Simulation show that the dynamic method improves the baseline recognition accuracy rate from 80.1% to (84.1%), with a decrease of 19.96% in the relative error rate. Thus, the dynamic method rapidly increases practical speech recognition system performance.

Keywords:	speech recognition rapid speaker adaptation confidence measure
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏