首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于PNCC与基频的鲁棒电话语音性别检测方案
引用本文:钟顺明,况鹏,庄豪爽,冯韩德,王剑莹,张涵.基于PNCC与基频的鲁棒电话语音性别检测方案[J].华南师范大学学报(自然科学版),2019,51(6):118-122.
作者姓名:钟顺明  况鹏  庄豪爽  冯韩德  王剑莹  张涵
作者单位:广东省心脑血管个体化医疗大数据工程技术研究中心∥华南师范大学物理与电信工程学院,广州510006;深圳壹鸽科技有限公司,深圳518000
基金项目:国家自然科学基金项目61471176教育部蓝火计划(惠州)产学研项目CXZJHZ201705广东省特支计划项目2016TQ03X100广东省自然科学基金项目2018A030313990广东省科技计划项目2017A010101015广东省科技计划项目2017B030308009广东省科技计划项目2017KZ010101蓝盾产学研基金项目LD20170204蓝盾产学研基金项目LD20170207
摘    要:针对电话语音性别检测存在识别准确率较低的问题,提出了一种有效的电话语音性别检测方案(CNN+SVM); 首先,采用卷积神经网络(Convolutional Neural Network, CNN)提取幂律归一化倒谱系数(Power-Normalized Cepstral Coefficient, PNCC)的有效信息;然后, 结合优化后的基频特征,选用支持向量机(Support Vector Machine, SVM)实现性别分类.该方案有效融合了男、女发音和听觉感知特性上的差异,同时利用了CNN特征提取能力以及SVM鲁棒分类能力.仿真结果表明:CNN+SVM方案针对实际场景电话语音数据集的性别识别准确率优于传统识别方法.

关 键 词:幂律归一化倒谱系数  卷积神经网络  性别检测  支持向量机  基频
收稿时间:2019-06-13

A Robust Gender Recognition Scheme for Telephone Speech Based on PNCC and Fundamental Frequency
Institution:1.Guangdong Provincial R&D Center of Cardiovascular Individual Medicine & Big Data//School of Physics and Telecommunication Engineering, South China Normal University, Guangzhou 510006, China2.Shenzhen Egos Science Technology co., LTD., Shenzhen 518000, China
Abstract:In view of the low recognition accuracy of telephone voice gender detection, an effective gender detection scheme for telephone speech is proposed. Firstly, the Convolutional Neural Network (CNN) is used to extract the effective information of Power-Normalized Cepstral Coefficient (PNCC), and then Support Vector Machine (SVM) is selected to realize gender classification based on the optimized fundamental frequency features. The proposed scheme can effectively study the differences of male and female's pronunciation and auditory perception characteristics, and can benefit from the ability of CNN feature extraction and SVM robust classification. Experimental results show that the proposed scheme outperforms the traditional methods in gender recognition accuracy for the telephone speech data set in practical scenarios.
Keywords:
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《华南师范大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《华南师范大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号