首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于Fourier-Bessel展开和Karhunen-Loeve变换的语音变换
引用本文:岳振军,宋巍,王浩,张雄伟.基于Fourier-Bessel展开和Karhunen-Loeve变换的语音变换[J].解放军理工大学学报,2008,9(1):15-19.
作者姓名:岳振军  宋巍  王浩  张雄伟
作者单位:[1]解放军理工大学理学院,江苏南京211101 [2]解放军理工大学通信工程学院,江苏南京210007
摘    要:为了有效地进行语音变换,改善变换后语音的自然度和目标人倾向度.依据语音信号传播机理和(Fourier-Bessel)展开式系数对语音信号的表现能力,提出了利用F-B展开系数作为变换参数.在该算法中,根据F-B展开系数无语音相位信息的特点,提出基于最大基频相位的语音分帧算法;针对F-B展开式数据量过大的问题,提出了基于Karhunen-Loeve变换的参数压缩算法,转换模型使用GMM(Gaussian mixture model)模型.对算法进行了仿真实验.对变换后语音所进行的ABX测试表明,算法能够较好地完成语音变换,变换后语音的目标人趋向度比较高.

关 键 词:语音变换  傅里叶-贝塞尔展开  Karhunen-Loeve变换  高斯混合模型
文章编号:1009-3443(2008)01-0015-05
修稿时间:2007年1月7日

Voice transform method based on Fourier-Bessel expansion and Karhunen-Loeve transform
YUE Zhen-jun,SONG Wei,WANG Hao and ZHANG Xiong-wei.Voice transform method based on Fourier-Bessel expansion and Karhunen-Loeve transform[J].Journal of PLA University of Science and Technology(Natural Science Edition),2008,9(1):15-19.
Authors:YUE Zhen-jun  SONG Wei  WANG Hao and ZHANG Xiong-wei
Institution:Institute of Sciences,PLA Univ.of Sci.& Tech.,Nanjing 211101,China;Institute of Communications Engineering,PLA Univ.of Sci.& Tech.,Nanjing 210007,China;Institute of Sciences,PLA Univ.of Sci.& Tech.,Nanjing 211101,China;Institute of Communications Engineering,PLA Univ.of Sci.& Tech.,Nanjing 210007,China
Abstract:In order to impro ve the ef fect of vo ice conv ersion and reduce the data quant ity, an algo rithm of voice co nv ersion metho d w as pr opo sed based on Fo ur ier-Bessel ex pansion and K -L ( Karhunen-Loeve ) t ransfo rm, w hose transform model is GMM. This mo del of alg orithm appro aches the rule of vo ice t ransmission and can describe voice sig nals in vivid detail. In the alg orithm, a f rame-divided method w as put fo rw ard based on maximum phase of fundamental frequency and a parameter -compressed method based on K-L t ransform to aim at a bet ter ef fect . The co rresponding emulat ional experiment s w ere carried out , and the resul t of the ABX test to the t ransformed signal show s that the algo rithm performs well
Keywords:voice conver sion  F-B( Fourier-Bessel) expansion  K-L ( Karhunen-Loeve) t ransform  GMM model
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《解放军理工大学学报》浏览原始摘要信息
点击此处可从《解放军理工大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号