首页 | 本学科首页   官方微博 | 高级检索  
     检索      

Mel 频率下基于 LPC 的语音信号深度特征提取算法
引用本文:罗元,吴承军,张毅,黎小松,席兵.Mel 频率下基于 LPC 的语音信号深度特征提取算法[J].重庆邮电大学学报(自然科学版),2016,28(2):174-179.
作者姓名:罗元  吴承军  张毅  黎小松  席兵
作者单位:1. 重庆邮电大学光电信息感测与传输技术重点实验室,重庆,400065;2. 重庆邮电大学信息无障碍工程研发中心,重庆,400065
基金项目:重庆市自然科学基金重点项目(CSTC2015jcyjB0241);重庆市教委科技项目(KJ13051)
摘    要:针对传统语音信号二次特征提取方法在保证识别率的前提下,实时性较差的问题,提出一种Mel频率下基于线性预测系数(linear predictive coefficient,LPC)的改进的语音信号深度特征提取算法.该方法根据人耳的听觉特性把LPC在Mel频率下进行非线性变换,再进行微分、高阶微分和按比例重组等步骤,得到一种既考虑声道激励又兼顾人耳听觉的新特征参数,从而大大减少传统语音信号深度特征提取的计算量,在不影响识别效率的情况下,极大提高系统的实时性.最后,将该算法在智能轮椅平台进行有效性验证,大量实验表明,语音控制系统实时性差的问题在使用该算法后能够得到明显改善,该算法既保证了特征提取识别率,也有效地改善了系统的实时性.在一定程度上使语音控制智能轮椅更具实用性.

关 键 词:语音识别  线性预测系数  Mel频率倒谱系数  Mel-LPC算法  深度特征提取
收稿时间:2014/12/4 0:00:00
修稿时间:2015/10/4 0:00:00

A further speech signal features extraction algorithm based on LPC Mel frequency scale
LUO Yuan,WU Chengjun,ZHANG Yi,LI Xiaosong and Xi Bing.A further speech signal features extraction algorithm based on LPC Mel frequency scale[J].Journal of Chongqing University of Posts and Telecommunications,2016,28(2):174-179.
Authors:LUO Yuan  WU Chengjun  ZHANG Yi  LI Xiaosong and Xi Bing
Institution:Key Lab of Optical Sensing Information and Transmission Technology, Chongqing University of Posts and Telecommunications,Chongqing 400065, P. R. China,Key Lab of Optical Sensing Information and Transmission Technology, Chongqing University of Posts and Telecommunications,Chongqing 400065, P. R. China,Engineering Research & Development Center of Information Accessibility,Chongqing University of Posts and Telecommunications, Chongqing 400065,P. R. China,Engineering Research & Development Center of Information Accessibility,Chongqing University of Posts and Telecommunications, Chongqing 400065,P. R. China and Key Lab of Optical Sensing Information and Transmission Technology, Chongqing University of Posts and Telecommunications,Chongqing 400065, P. R. China
Abstract:According to the bad real-time performance of the traditional further speech signal features extraction algorithm in the premise of ensuring the recognition rate, a further speech signal features extraction algorithm based on linear predictive coefficient(LPC) Mel frequency scale is put forward in this paper. This method transforms LPC with Mel-frequency in a nonlinear way, calculates the derivative, high order differential and combines the feature according to a certain proportion to realize a new features parameter which takes both the channel incentives and the human auditory into account. So the calculation quantity of the traditional speech signal further features extraction is decreased sharply. The real-time performance of the system is improved in the premise of ensuring the recognition rate. Through the intelligent wheelchair platform to verify the validity of the algorithm a lot of experiments show that the problem of real-time performance is not good of traditional algorithm can be improved effectively; this algorithm can improve the real-time performance and the practicability, on the basis of ensuring the recognition rate of the further features extraction.
Keywords:speech recognition  linear prediction coefficient  Mel-frequency cepstrum coefficients  Mel-LPC algorithm  further features extraction
本文献已被 万方数据 等数据库收录!
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号