鉴别性最大后验概率线性回归说话人自适应研究

齐耀辉; 潘复平; 葛凤培; 颜永红

doi:10.15918/j.tbit1001-0645.2015.09.013

鉴别性最大后验概率线性回归说话人自适应研究

Investigation on Discriminative Maximum a Posteriori Linear Regression for Speaker Adaptation

摘要

摘要: 为增强自适应后的声学模型的鉴别能力,提出了一种基于最大互信息(MMI)的鉴别性最大后验概率线性回归(MMI-DMAPLR)说话人自适应方法. 将最大互信息准则和最大后验概率(MAP)准则相结合,设计了一个新的目标函数来估计基于线性变换的自适应方法中的变换参数,在最大后验概率估计中加入了鉴别性. 大词汇量连续语音识别的实验结果表明,新方法在增强声学模型与测试数据的匹配性的同时,可以有效提高声学模型的鉴别能力,在少量自适应数据的情况下,其性能比最大后验概率线性回归(MAPLR)相对提高4.8%.

Abstract: In order to increase the discriminative capability of the adapted acoustic model, the maximum mutual information based discriminative maximum a posteriori linear regression (MMI-DMAPLR) adaptation method was proposed. Combining the maximum mutual information criterion with maximum a posteriori (MAP) criterion, a new objective function was designed to estimate the transform parameters of adaptation method based on the linear transformation, to increase the discriminative capability in maximum a posteriori estimation. The experimental results in large vocabulary continuous recognition show that the proposed method can both enhance the match degree between the acoustic model and the test data and the discriminative power of acoustic model. Compared with maximum a posteriori linear regression (MAPLR), the proposed method can obtain 4.8% relative reduction in word error rate when the amount of data is limited.

HTML全文

参考文献(13)

施引文献

资源附件(0)