首页 | 本学科首页   官方微博 | 高级检索  
     

基于超音段韵律特征和GMM-UBM的文本无关的说话人识别
引用本文:许东星,戴蓓缮,刘青松,许敏强. 基于超音段韵律特征和GMM-UBM的文本无关的说话人识别[J]. 中国科学技术大学学报, 2010, 40(2). DOI: 10.3969/j.issn.0253-2778.2010.02.009
作者姓名:许东星  戴蓓缮  刘青松  许敏强
作者单位:中国科学技术大学电子科学与技术系,安徽合肥,230027
摘    要:提出一种采用超音段韵律特征和GMM-UBM模型结构的文本无关的说话人识别方法,用多尺度小波分析方法从短时倒谱参数MFCC和基频F0随时间变化的韵律中分别提取可用于文本无关说话人识别的超音段韵律特征参数PMFCC和PF0,并组成联合参数PMFCCF0.在NIST068side-1side复杂背景电话手机语音数据库上的说话人确认实验则表明,采用一阶小波分析方法提取的超音段韵律参数PMFCC的识别性能与短时MFCC相当,采用超音段韵律特征PMFCCF0的系统确认性能比采用短时MFCC系统有较大的提高.在微软数据库进行不同信噪比测试语音的说话人辨认实验表明,PMFCCF0有比短时MFCC更好的噪声鲁棒性.

关 键 词:超音段韵律特征  文本无关  说话人识别

Text-independent speaker recognition based on super-segment prosodic feature and GMM-UBM
XU Dongxing,DAI Beiqian,LIU Qingsong,XU Minqiang. Text-independent speaker recognition based on super-segment prosodic feature and GMM-UBM[J]. Journal of University of Science and Technology of China, 2010, 40(2). DOI: 10.3969/j.issn.0253-2778.2010.02.009
Authors:XU Dongxing  DAI Beiqian  LIU Qingsong  XU Minqiang
Affiliation:XU Dongxing,DAI Beiqian,LIU Qingsong,XU Minqiang(Department of Electronic Science , Technology,University of Science , Technology of China,Hefei 230027,China)
Abstract:A text-independent speaker recognition method was proposed based on the super-segment prosodic feature and GMM-UBM.With wavelet multiresolution analysis,the super-segment prosodic feature PF0 from F0~t and PMFCC from MFCC~t were extracted,which were used for text-independent speaker recognition and could be combined as PMFCCF0.Experiments of speaker identification in different SNRs on Microsoft database indicate that PMFCCF0 is more robust than MFCC.Experiments on the 2006 NIST 8side-1side subset speaker re...
Keywords:GMM-UBM  super-segment prosodic feature  GMM-UBM  text-independent  speaker recognition
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号