连续语音识别中的说话人快速自适应技术 Rapid speaker adaptation for continuous speech recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

连续语音识别中的说话人快速自适应技术

引用本文：	吕萍,吴及,王作英,陆大. 连续语音识别中的说话人快速自适应技术[J]. 清华大学学报(自然科学版), 2002, 42(7): 977-980

作者姓名：	吕萍吴及王作英陆大

作者单位：	清华大学,电子工程系,北京,100084

基金项目：	清华大学“九八五”重大项目 (985校 -2 2 -攻关 -0 6)

摘要：	语音识别技术中说话人快速自适应技术受到普遍关注。该文综述了说话人快速自适应技术在国际上的研究现状 ,并且介绍了本研究组提出的快速自适应方法 ,即最大似然模型插值快速自适应框架及插值算法。与现有的相关自适应方法相比 ,该算法在更复杂的识别系统上同时实现了均值和协方差的自适应 ,并取得较好的自适应效果。当仅有一句自适应数据时 ,识别系统的误识率从 2 8.75 %下降到2 4 .93%。
关键词：	连续语音识别说话人快速自适应最大似然模型插值
文章编号：	1000-0054(2002)07-0977-04
修稿时间：	2001-11-20
Rapid speaker adaptation for continuous speech recognition

Abstract:	Fast speaker adaptation techniques for speech recognition are of great interest. This paper summarizes the state of the art for rapid speaker adaptation. This paper introduces four algorithms for the maximum likelihood interpolation adaptation model. These algorithms simultaneously adapt the mean and covariance in a more complicated recognizer and provide high performances of rapid adaptation. The error rate was reduced from 28.75% to 24.93% with just one sentence.

Keywords:	continuous speech recognition rapid speaker adaptation maximum likelihood model interpolation
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏