首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Speaker Adaptation with Transformation Matrix Linear Interpolation
作者姓名:XUXiang-hua  ZHUJie
作者单位:DepartmentofElectronicEngineering,ShanghaiJiaotongUniversity,Shanghai200030,China
基金项目:theScienceandTechnologyCommitteeofShanghai(0 1JC1 4 0 33)
摘    要:A transformation matrix linear interpolation (TMLI) approach for speaker adaptation is proposed. TMLI uses the transformation matrixes produced by MLLR from selected training speakers and the testing speaker. With only 3 adaptation sentences, the performance shows a 12.12% word error rate reduction. As the number of adaptation sentences increases, the performance saturates quickly. To improve the behavior of TMLI for large amounts of adaptation data, the TMLI MAP method which combines TMLI with MAP technique is proposed. Experimental results show TMLI MAP achieved better recognition accuracy than MAP and MLLR MAP for both small and large amounts of adaptation data.

关 键 词:语音识别  扬声器适配器  变换矩阵线性内插法  MAP  MLLR  模型
收稿时间:1 March 2004

Speaker adaptation with transformation matrix linear interpolation
XUXiang-hua ZHUJie.Speaker Adaptation with Transformation Matrix Linear Interpolation[J].Wuhan University Journal of Natural Sciences,2004,9(6):927-930.
Authors:Xu Xiang-hua  Zhu Jie
Institution:(1) Department of Electronic Engineering, Shanghai Jiaotong University, 200030 Shanghai, China
Abstract:A transformation matrix linear interpolation (TMLI) approach for speaker adaptation is proposed. TMLI uses the transformation matrixes produced by MLLR from selected training speakers and the testing speaker. With only 3 adaptation sentences, the performance shows a 12.12% word error rate reduction. As the number of adaptation sentences increases, the performance saturates quickly. To improve the behavior of TMLI for large amounts of adaptation data, the TMLI+MAP method which combines TMLI with MAP technique is proposed. Experimental results show TMLI+MAP achieved better recognition accuracy than MAP and MLLR+MAP for both small and large amounts of adaptation data.
Keywords:speech recognition  speaker adaptation  MLLR  MAP  maximum likelihood model interpolation (MLMI)
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号