首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于粒子滤波的双模态语音提取方法
引用本文:金乃高,殷福亮.一种基于粒子滤波的双模态语音提取方法[J].大连理工大学学报,2008,48(4):596-601.
作者姓名:金乃高  殷福亮
作者单位:大连理工大学,电子与信息工程学院,辽宁,大连,116024
基金项目:国家自然科学基金资助项目
摘    要:说话入的唇动信息有助于加强对语音的感知.根据说话人语音的双模态特性,将振动信息引入语音提取问题,提出了一种基于粒子滤波的贝叶斯融合架构的双模态语音提取方法.该方法融合说话人的语音和唇动信息,根据信息论中的最大互信息准则与盲源分离中的高阶统计量准则.将音视频互信息与语音峭度的乘积作为代价函数,利用粒子滤波估计混合矩阵.解决时变瞬时混合情况下的语音提取问题.仿真结果表明.该方法在低信噪比情况下仍然能够实现语音信号的有效提取.

关 键 词:语音提取  粒子滤波  高阶统计量  最大互信息

Bimodal speech extraction method based on particle filtering
JIN Naigao YIN Fuliang.Bimodal speech extraction method based on particle filtering[J].Journal of Dalian University of Technology,2008,48(4):596-601.
Authors:JIN Naigao YIN Fuliang
Abstract:Lip movement information helps language comprehension when the auditory signal is degraded. A bimodal speech extraction method is presented based on the method of audio-visual signal processing. The particle filtering is used to construct a Bayesian fusion framework for bimodal speech extraction problem. By combining maximum mutual information criterion with higher-order statistics criterion of blind signal separation and estimating mixed matrices by particle filtering method, the proposed method can extract the interested instaneous time-varying speech signal by maximizing the product of kurtosis and audio-visual mutual information. Simulation results show that the proposed method improves the performance of the speech extraction system in the low SNR environment.
Keywords:speech extraction  particle filtering  higher-order statistics  maximum mutual information
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《大连理工大学学报》浏览原始摘要信息
点击此处可从《大连理工大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号