首页 | 本学科首页   官方微博 | 高级检索  
     

基于动态特征词的中文句子相似度计算
引用本文:黄莉. 基于动态特征词的中文句子相似度计算[J]. 宝鸡文理学院学报(自然科学版), 2013, 33(3): 49-52
作者姓名:黄莉
作者单位:宝鸡文理学院 计算机科学系,陕西宝鸡,721016
基金项目:宝鸡文理学院重点科研项目(ZK10163)
摘    要:目的针对当前常用的汉语句子相似度计算方法存在的问题,结合语言习得特点,提出了一种基于动态特征词的中文句子相似度计算方法。方法首先以特征词作为语块切分边界,提取左右语块信息,采用语义向量空间模型;然后计算2个句子对应的左右组块的相似度;最终将各组块的相似度量值加权求和作为2个句子的相似度。结果实验表明,提出的方法计算结果较为理想,与人工判断的相似度较为一致。结论基于动态特征词的中文句子相似度计算方法在常用句式中具有更好的效果。

关 键 词:句子相似度  特征词  语义相似度  语义向量

Computation method for Chinese sentence similarity based on dynamic feature words
HUANG Li. Computation method for Chinese sentence similarity based on dynamic feature words[J]. Journal of Baoji College of Arts and Science(Natural Science Edition), 2013, 33(3): 49-52
Authors:HUANG Li
Affiliation:HUANG Li (Dept. Computer Sci. , Baoji University of Arts and Sciences, Baoji 721016, Sbaanxi, China)
Abstract:Objective--To propose a computation method for Chinese sentence similarities based on dynamic feature words in combination with the feature of language acquisition because there is some shortcomings in the current similarity computation methods of Chinese sentence. Methods--First, the left and right chunks are extracted with the feature words as chunks segmentation boundary. Then the similarities between left and right chunks from the two sentences are calculated using a semantic vec- tor space model. Finally the overall sentence similarity is defined with a combination of these chunk similarities by the weighted parameter. Results--The experiments show that the proposed method with great precision is much close to the similarity of manual judgment, Conclusion--The similarity computation method of Chinese sentence based on dynamic feature words has better performance in common sentence pattern.
Keywords:sentence similarity  feature word  semantic similarity  semantic vector
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号