用不同语义单元度量的句子相似度计算 |
| |
引用本文: | 王东,熊世桓. 用不同语义单元度量的句子相似度计算[J]. 信阳师范学院学报(自然科学版), 2014, 0(1): 145-148 |
| |
作者姓名: | 王东 熊世桓 |
| |
作者单位: | ;1.贵州师范学院数学与计算机科学学院 |
| |
摘 要: | 提出了一种基于不同语义单元度量的句子相似度计算方法.将句子按词块分割为对应的公共词块和非公共词块,利用外部语义资源进行同义词替换和语义消歧处理.分别用词、词块和字为语义单元度量句子相似度,以不同的权重调节各语义单元对句子相似度的贡献.实验结果表明,该方法综合考虑的因素更加全面,有较高的准确率.
|
关 键 词: | 句子相似度 词块 公共词块 同义词词林 搭配词库 |
Sentence Similarity Computing with Different Semantic Unit Measure |
| |
Affiliation: | ,Mathematics and Computer Science Institute,Guizhou Normal College |
| |
Abstract: | A method of sentence similarity computing based on different semantic units was proposed. A sentence can be divided into corresponding public word blocks and non-public word blocks according to word blocks,and then synonym substitution and semantic disambiguation processing can be carried by using external semantic resource.Words,word blocks and characters were used as the semantic units to measure the sentence similarity and adjust the contribution of each semantic unit to the sentence similarity with different weights. The experimental results showed that this approach of overall evaluation factor was more comprehensive and higher accuracy can be achieved. |
| |
Keywords: | sentence similarity word block common word block tongyici Cilin collocation dictionary |
本文献已被 CNKI 等数据库收录! |
|