一种基于粒子滤波的双模态语音提取方法 Bimodal speech extraction method based on particle filtering期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

一种基于粒子滤波的双模态语音提取方法

引用本文：	金乃高,殷福亮.一种基于粒子滤波的双模态语音提取方法[J].大连理工大学学报,2008,48(4):596-601.

作者姓名：	金乃高殷福亮

作者单位：	大连理工大学,电子与信息工程学院,辽宁,大连,116024

基金项目：	国家自然科学基金资助项目

摘要：	说话入的唇动信息有助于加强对语音的感知.根据说话人语音的双模态特性,将振动信息引入语音提取问题,提出了一种基于粒子滤波的贝叶斯融合架构的双模态语音提取方法.该方法融合说话人的语音和唇动信息,根据信息论中的最大互信息准则与盲源分离中的高阶统计量准则.将音视频互信息与语音峭度的乘积作为代价函数,利用粒子滤波估计混合矩阵.解决时变瞬时混合情况下的语音提取问题.仿真结果表明.该方法在低信噪比情况下仍然能够实现语音信号的有效提取.
关键词：	语音提取粒子滤波高阶统计量最大互信息
Bimodal speech extraction method based on particle filtering

JIN Naigao YIN Fuliang.Bimodal speech extraction method based on particle filtering[J].Journal of Dalian University of Technology,2008,48(4):596-601.

Authors:	JIN Naigao YIN Fuliang

Abstract:	Lip movement information helps language comprehension when the auditory signal is degraded. A bimodal speech extraction method is presented based on the method of audio-visual signal processing. The particle filtering is used to construct a Bayesian fusion framework for bimodal speech extraction problem. By combining maximum mutual information criterion with higher-order statistics criterion of blind signal separation and estimating mixed matrices by particle filtering method, the proposed method can extract the interested instaneous time-varying speech signal by maximizing the product of kurtosis and audio-visual mutual information. Simulation results show that the proposed method improves the performance of the speech extraction system in the low SNR environment.

Keywords:	speech extraction particle filtering higher-order statistics maximum mutual information
本文献已被维普万方数据等数据库收录！
	点击此处可从《大连理工大学学报》浏览原始摘要信息
	点击此处可从《大连理工大学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏