首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于子词PSPL的汉语语音文档索引
引用本文:陆明明,张连海,屈丹.基于子词PSPL的汉语语音文档索引[J].应用科学学报,2013,31(3):259-265.
作者姓名:陆明明  张连海  屈丹
作者单位:解放军信息工程大学信息工程学院,郑州450002
基金项目:国家自然科学基金(No.61175017)资助
摘    要:针对汉语语音文档检索中最优识别单元和检索单元不一致的问题,提出一种基于子词(position specific posterior lattices, PSPL)的语音文档索引方法;该方法以词为识别单元对语音文档进行解码,得到PSPL;然后对PSPL进行子词切分,并根据子词弧与原始词弧的后验概率关系,将PSPL转换为相应的子词PSPL,以子词PSPL为索引进行查询项检索. 实验结果表明,所提出的方法在利用丰富语言信息的同时,解决了词解码器存在的边界分割不正确的问题,检索性能明显优于目前普遍使用的识别单元和检索单元均为词的PSPL索引方法.

关 键 词:语音文档检索  语音文档索引  子词PSPL  词格  子词后验概率  
收稿时间:2011-10-14
修稿时间:2011-12-31

Subword-Based Position Specific Posterior Lattices for Chinese Spoken Document Indexing
LU Ming-ming,ZHANG Lian-hai,QU Dan.Subword-Based Position Specific Posterior Lattices for Chinese Spoken Document Indexing[J].Journal of Applied Sciences,2013,31(3):259-265.
Authors:LU Ming-ming  ZHANG Lian-hai  QU Dan
Institution:School of Information Engineering, PLA Information Engineering University, Zhengzhou 450002, China
Abstract:A spoken document indexing method based on subword-based position specific posterior lattices (SPSPL) is proposed to overcome inconsistency between optimal recognition unit and retrieval unit in the existing Chinese spoken document indexing methods. In the proposed method, a word-based PSPL is generated with a word-based speech recognizer. Each word in the PSPL is replaced by its constituent subword units. According to the posterior probability relationship between each word and its constituent subword units, the original PSPL can be converted to the corresponding S-PSPL to be used in generating a subword-based index for retrieval. Experimental results show that the new method can make use of a well-trained language model, and avoid incorrect segmentation in the word-based recognizer as well. Better performance is obtained compared to the current indexing methods that use words as both recognition and retrieval units.
Keywords:spoken document retrieval  spoken document indexing  subword-based position specific posterior lattices  lattice  subword posterior probability  
点击此处可从《应用科学学报》浏览原始摘要信息
点击此处可从《应用科学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号