首页 | 本学科首页   官方微博 | 高级检索  
     检索      

含语音增强模块的i-向量说话人识别性能分析
引用本文:李昕,李为,游寒旭,朱杰.含语音增强模块的i-向量说话人识别性能分析[J].上海师范大学学报(自然科学版),2016,45(2):237-242.
作者姓名:李昕  李为  游寒旭  朱杰
作者单位:上海交通大学,上海交通大学,上海交通大学,上海交通大学
基金项目:国家自然科学基金(61271349,61371147,11433002);上海交通大学医工合作基金(YG2012ZD04)
摘    要:为解决文本无关说话人识别中训练与识别环境不同导致模式失配的问题,提出了一种采用语音增强模块进行前端预处理的i-向量说话人识别系统,从而提高系统对于环境噪声的鲁棒性.为评估不同语音增强算法的性能,利用NIST08核心测试集进行仿真实验.采用IMCRA算法对语音进行噪声估计后,分别用维纳滤波法、MMSE-LSA、传统谱减法和多频带谱减法等4种方法进行语音增强前端处理,在基于i-向量的说话人识别系统下进行实验.实验结果表明采用了语音增强的系统具有一定抗噪声性能,并且在高信噪比条件下,基于多频带的谱减法在此系统下性能最佳,而低信噪比情况下MMSE-LSA算法更有优势.

关 键 词:说话人识别    i-向量    语音增强    维纳滤波    MMSE    谱减法
收稿时间:2016/2/29 0:00:00

Speech enhancement ini-vector speaker verification system
LI Xin,LI Wei,YOU Hanxu and ZHU Jie.Speech enhancement ini-vector speaker verification system[J].Journal of Shanghai Normal University(Natural Sciences),2016,45(2):237-242.
Authors:LI Xin  LI Wei  YOU Hanxu and ZHU Jie
Institution:School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University and School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University
Abstract:To solve the model-mismatch problem in text-independent speaker verification system when training environment differs from recognition environment,We propose a i-vector speaker verification system using speech enhancement in front-end preprocessing it can improve the system robustness to additive noise.To estimate the performance of different speech enhancement methods,we used NIST08 core test set in the experiment.Four speech enhancement methods,including wiener filtering,MMSE-LSA,traditional spectral subtraction and multi-band spectral subtraction,combining with IMCRA noise estimation,were evaluated in the speaker verification system based on i-vector.The result shows the proposed system with speech enhancement had some improvement in noise environment and that multi-band spectral subtraction method performed the best when SNR was relatively high and MMSE-LSA performed the best when SNR was low.
Keywords:speaker verification  i-vector  speech enhancement  wiener filtering  MMSE  spectral subtraction method
本文献已被 CNKI 等数据库收录!
点击此处可从《上海师范大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《上海师范大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号