Speaker Adaptation with Transformation Matrix Linear Interpolation Speaker adaptation with transformation matrix linear interpolation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

Speaker Adaptation with Transformation Matrix Linear Interpolation

作者姓名：	XUXiang-hua ZHUJie

作者单位：	DepartmentofElectronicEngineering,ShanghaiJiaotongUniversity,Shanghai200030,China

基金项目：	theScienceandTechnologyCommitteeofShanghai(0 1JC1 4 0 33)

摘要：	A transformation matrix linear interpolation (TMLI) approach for speaker adaptation is proposed. TMLI uses the transformation matrixes produced by MLLR from selected training speakers and the testing speaker. With only 3 adaptation sentences, the performance shows a 12.12% word error rate reduction. As the number of adaptation sentences increases, the performance saturates quickly. To improve the behavior of TMLI for large amounts of adaptation data, the TMLI MAP method which combines TMLI with MAP technique is proposed. Experimental results show TMLI MAP achieved better recognition accuracy than MAP and MLLR MAP for both small and large amounts of adaptation data.
关键词：	语音识别扬声器适配器变换矩阵线性内插法 MAP MLLR 模型
收稿时间：	1 March 2004
Speaker adaptation with transformation matrix linear interpolation

XUXiang-hua ZHUJie.Speaker Adaptation with Transformation Matrix Linear Interpolation[J].Wuhan University Journal of Natural Sciences,2004,9(6):927-930.

Authors:	Xu Xiang-hua Zhu Jie

Institution:	(1) Department of Electronic Engineering, Shanghai Jiaotong University, 200030 Shanghai, China

Abstract:	A transformation matrix linear interpolation (TMLI) approach for speaker adaptation is proposed. TMLI uses the transformation matrixes produced by MLLR from selected training speakers and the testing speaker. With only 3 adaptation sentences, the performance shows a 12.12% word error rate reduction. As the number of adaptation sentences increases, the performance saturates quickly. To improve the behavior of TMLI for large amounts of adaptation data, the TMLI+MAP method which combines TMLI with MAP technique is proposed. Experimental results show TMLI+MAP achieved better recognition accuracy than MAP and MLLR+MAP for both small and large amounts of adaptation data.

Keywords:	speech recognition speaker adaptation MLLR MAP maximum likelihood model interpolation (MLMI)
本文献已被 CNKI 维普万方数据 SpringerLink 等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏