首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种频率增强的语句语义相似度计算
引用本文:廖志芳,邱丽霞,谢岳山,樊晓平.一种频率增强的语句语义相似度计算[J].湖南大学学报(自然科学版),2013,40(2):82-88.
作者姓名:廖志芳  邱丽霞  谢岳山  樊晓平
作者单位:中南大学软件学院;中南大学信息科学与工程学院;湖南财政经济学院
基金项目:国家科技支撑项目(2012BAH08B01);湖南省自然科学基金资助项目(12JJ3074);湖南省科技计划项目(2012KG3170)(2009FJ3053)
摘    要:目前,在基于HowNet进行语句语义相似度计算的算法中,没有考虑语句中的不同词语对语句之间相似度值的不同贡献程度,以致计算结果不理想.为了更好地解决上述缺陷,提出了一种频率增强语句语义相似度算法.该算法利用HowNet作为词典库,在同时考虑义原距离和义原深度的条件下,进行词语相似度计算;在此基础上算法进一步将词语在语料库中的频率函数作为权重值,引入至语句的语义相似度计算中,以降低高频率词语在语句相似度值中的比重.实验表明,改进的算法在语句相似度计算结果上与人们的主观判断更接近,结果更合理.

关 键 词:HowNet  义原树状结构  语料库  语义相似度

A Frequency Enhanced Algorithm of Sentence Semantic Similarity
LIAO Zhi-fang,QIU Li-xia,XIE Yue-shan,FAN Xiao-ping.A Frequency Enhanced Algorithm of Sentence Semantic Similarity[J].Journal of Hunan University(Naturnal Science),2013,40(2):82-88.
Authors:LIAO Zhi-fang  QIU Li-xia  XIE Yue-shan  FAN Xiao-ping
Institution:2,3(1.School of Software,Central South Univ,Changsha,Hunan 410002,China; 2.School of Information Science and Engineering,Central South Univ,Changsha,Hunan 410075,China; 3.Hunan College of Finance and Economics,Changsha,Hunan 410086,China)
Abstract:Sentence semantic similarity algorithms based on HowNet ignored the fact that different words have different contribution weight to sentence similarity value, and therefore, the similarity result is not quite reasonable. In order to solve this problem, we proposed an improved algorithm based on word frequency. The algorithm calculates the similarity between words based on HowNet, both considering the distance and the height of primitives. Then, a frequency function of words in corpus as a weight factor is embedded into the sentence semantic similarity algorithm, which reduces the proportion value that the high frequency words devote to sentence similarity calculation. The sentence semantic similarity experiment results show that the improved algorithm is much better in rationality as well as in matching with people''s subjective judgment.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《湖南大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《湖南大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号