两级决策的开集说话人辨认方法 Method of open-set speaker identification with two-level decision strategy期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

两级决策的开集说话人辨认方法

引用本文：	何致远,胡起秀,徐光祐.两级决策的开集说话人辨认方法[J].清华大学学报(自然科学版),2003,43(4):516-520.

作者姓名：	何致远胡起秀徐光祐

作者单位：	清华大学,计算机科学与技术系,北京,100084

基金项目：	国家“八六三”高技术项目 ( 863 -3 0 6-ZT0 3 -0 1-1)，国家教育振兴计划

摘要：	为了减少语音数据量 ,提高处理速度和识别的准确性 ,提出了一种采用公共码本、个人隐 Markov模型 (HMM)和个人拒识阈值进行两级决策来实现开集说话人辨认的新方法。在系统实现时 ,采用了一种改进的语音切分算法来提高输入数据的有效性 ,并将说话人识别和人脸识别融合在一起进行身份验证。实验证明这种融合方法能够有效地降低识别的相等错误率至 1%。
关键词：	说话人识别说话人辨认语音切分隐Markov模型
文章编号：	1000-0054(2003)04-0516-05
修稿时间：	2002年2月25日
Method of open-set speaker identification with two-level decision strategy

HE Zhiyuan,HU Qixiu,XU Guangyou.Method of open-set speaker identification with two-level decision strategy[J].Journal of Tsinghua University(Science and Technology),2003,43(4):516-520.

Authors:	HE Zhiyuan HU Qixiu XU Guangyou

Abstract:	To reduce required speech data and improve the processing speed and the recognition precision, this paper presents a novel speaker identification method using the public codebook, the individual hidden Markov model (HMM) and the individual threshold of rejection to make a two level decision strategy. The system used an improved algorithm of speech segmentation to extract the available speech data from utterances. An approach of integrating the speaker recognition with the face recognition to verify a person's identity could further reduce the equal error rate to 1%.

Keywords:
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏