首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种应用于文本相关说话人确认的L-向量表示和改进的余弦距离核函数
引用本文:李为,游寒旭,朱杰,陈宁.一种应用于文本相关说话人确认的L-向量表示和改进的余弦距离核函数[J].上海师范大学学报(自然科学版),2016,45(2):243-247.
作者姓名:李为  游寒旭  朱杰  陈宁
作者单位:上海交通大学,上海交通大学,上海交通大学,华东理工大学
基金项目:the National Natural Science Foundation of China (NSFC) under Grant (61271349,61371147,11433002),and Shanghai Jiao Tong University joint research fund for Biomedical Engineering under (YG2012ZD04)
摘    要:提出了一种用于文本相关说说话人确认技术的i-向量提取方法和L-向量表示.一段用于注册或识别的语音可以用i-向量和L-向量联合表示.同时提出了一种改进的用于支持向量机(SVM)后端分类的核函数,改进的核函数可以同时区分说话人身份的差异和文本内容的差异.在RSR 2015语料集合1和集合2上验证系统的性能,实验结果显示改进的算法相对于传统的i-向量系统的基线能提高至多30%的识别率.

关 键 词:文本相关说话人识别    i-向量    L-向量    余弦核函数
收稿时间:2016/2/29 0:00:00

A novel L-vector representation and improved cosine distance kernel for Text-dependent Speaker Verification
LI Wei,YOU Hanxu,ZHU Jie and CHEN Ning.A novel L-vector representation and improved cosine distance kernel for Text-dependent Speaker Verification[J].Journal of Shanghai Normal University(Natural Sciences),2016,45(2):243-247.
Authors:LI Wei  YOU Hanxu  ZHU Jie and CHEN Ning
Institution:School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University,School of Electronic Information and Electrical Engineering,Shanghai Jiao Tong University and School of Information Science and Engineering,East China University of Science and Technology
Abstract:A text-dependent i-vector extraction scheme and a lexicon-based binary vector (L-vector) representation are proposed to improve the performance of text-dependent speaker verification.An utterance used for enrollment or test is represented by these two vectors.An improved cosine distance kernel combining i-vector and L-vector is constructed to discriminate both speaker identity and lexical (or text) diversity with back-end support vector machine(SVM).Experiments are conducted on RSR 2015 Corpus part 1 and part 2.The results indicate that at most 30% improvement can be obtained compared with traditional i-vector baseline.
Keywords:text-dependent speaker verification  i-vector  L-vector  cosine distance kernel
点击此处可从《上海师范大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《上海师范大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号