首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于优化检测网络和MLP特征改进发音错误检测的方法
引用本文:袁桦,钱彦旻,赵军红,刘加.基于优化检测网络和MLP特征改进发音错误检测的方法[J].清华大学学报(自然科学版),2012(4):557-560,570.
作者姓名:袁桦  钱彦旻  赵军红  刘加
作者单位:清华大学电子工程系清华信息科学与技术国家实验室;中国科学院电子学研究所传感技术国家重点实验室
基金项目:国家自然科学基金资助项目(60931160443,90920302,N-CUHK414/09);国家科技支撑计划项目(2009BAH41B01)
摘    要:该文基于优化的检测网络和多层感知(multi-layerperception,MLP)特征,提出一种可以更加准确地检测出错误发音类型的方法。首先,从第二语言学习的语音库中提取出基本的发音规则以及组合的发音规则,并相应地计算它们发生的先验概率,再将这些具有先验概率的规则用于构建基于多发音的扩展检测网络。然后在检测过程中,引入基于发音特征的MLP特征来描述发音概率,替代了传统的语音声学特征。最后使用基于MLP特征的GMM-HMM框架从检测网络中识别出最可能的发音音素串。实验表明:该方法将音素识别正确率提高了3.11%,错误类型准确率提高了7.42%。

关 键 词:发音错误检测  发音规则  多层感知(MLP)  发音特征

Mispronunciation detection with an optimized detection network and multi-layer perception based features
YUAN Hua,QIAN Yanmin,ZHAO Junhong,LIU Jia.Mispronunciation detection with an optimized detection network and multi-layer perception based features[J].Journal of Tsinghua University(Science and Technology),2012(4):557-560,570.
Authors:YUAN Hua  QIAN Yanmin  ZHAO Junhong  LIU Jia
Institution:1(1.Tsinghua National Laboratory for Information Science and Technology,Department of Electronic Engineering, Tsinghua University,Beijing 100084,China; 2.State Key Laboratory on Transducing Technology,Institute of Electronics,Chinese Academy of Sciences,Beijing 100190,China)
Abstract:This paper describes an optimized detection network for multi-layer perceptron(MLP) features to more accurately capture mispronunciations.First,the basic and combined phonological rules are extracted from the L2 speech corpus with computation of their prior probability of occurrence.The prior probability rules are then used to build a multiple pronunciation based extended detection network.Then,articulatory based MLP features are introduced to describe the pronunciation probability instead of the conventional speech acoustic features during detection.Finally,the GMM-HMM framework with MLP features is used to pick the most probable pronunciation phoneme sequences from the detection network.Tests show that this approach improves phoneme recognition accuracy by 3.11% and the mispronunciation type accuracy by 7.42%.
Keywords:mispronunciation detection  phonological rules  multi-layer perceptron(MLP)  articulatory feature
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号