汉语语音听写机中语音识别模型的研究 Research on speech recognition models in the Chinese dictation machine期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

汉语语音听写机中语音识别模型的研究

引用本文：	郑方,吴文虎,方棣棠.汉语语音听写机中语音识别模型的研究[J].清华大学学报(自然科学版),1997(9).

作者姓名：	郑方吴文虎方棣棠

作者单位：	清华大学计算机科学与技术系

基金项目：	国家“八六三”高技术项目

摘要：	语音听写机中语音、语言模型是两个非常重要的部分，而语音模型的好坏直接影响语言模型和听写机的性能。文中在一个大型数据库上对语音识别基元、语音模型、模型的输出观察向量的计分方法进行了大量的比较实验。实验表明，采取以音节为识别基元、基于中心距离正态分布的中心距离连续概率模型，和基于最近邻原则的输出观察向量计分方法即嵌入式多模板方案，可以取得很好的识别效果。
关键词：	中心距离正态分布中心距离连续概率模型最近邻原则嵌入式多模板
Research on speech recognition models in the Chinese dictation machine

Zheng Fang,Wu Wenhu,Fang Ditang.Research on speech recognition models in the Chinese dictation machine[J].Journal of Tsinghua University(Science and Technology),1997(9).

Authors:	Zheng Fang Wu Wenhu Fang Ditang

Institution:	Zheng Fang,Wu Wenhu,Fang Ditang Department of Computer Science and Technology,Tsinghua University,Beijing 100084

Abstract:	The speech recognition model and the Language model are two extremely important components in the Chinese dictation machine, the performance of the Language model and the dictation machine will be affected directly by that of the speech model. A great deal of experiments on speech recognition units, speech recognition models and the forms of scoring methods for output observation vectors have been done based on a giant speech corpus. The result is that best performance can be achieved while choosing the syllable as the speech recognition unit, using CDN (center distance normal ) distribution based CDCPM (center distance continuous probability model), and adopting NN ( nearest neighbor ) based scoring scheme, i.e., the embedded multi model (EMM) scheme.

Keywords:
本文献已被 CNKI 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏