首页 | 本学科首页   官方微博 | 高级检索  
     检索      

融入情感信息词向量的评论文本情感分析方法
引用本文:吕妹园,张永健,张永强,孙胜娟.融入情感信息词向量的评论文本情感分析方法[J].河北科技大学学报,2021,42(4):380-388.
作者姓名:吕妹园  张永健  张永强  孙胜娟
作者单位:河北工程大学信息与电气工程学院,河北邯郸 056107
基金项目:河北省创新能力提升计划项目(19456003D)
摘    要:为了解决分布式词表示方法因忽略词语情感信息导致情感分类准确率较低的问题,提出了一种融入情感信息加权词向量的情感分析改进方法。依据专属领域情感词典构建方法,结合词典和语义规则,将情感信息融入到TF-IDF算法中,利用Word2vec模型得到加权词向量表示方法,并运用此方法对采集到的河北省旅游景点的评论文本与对照组进行对比实验。结果表明,与基于分布式词向量表示的情感分析方法相比,采用融入情感信息加权词向量的改进方法进行情感分析,积极文本的准确率提高了6.1%,召回率提高了6.6%,F值达到了90.3%;消极评论文本的准确率提高了6.0%,召回率提高了7.2%,F值达到了89.6%。因此,融入情感信息加权词向量的情感分析改进方法可以有效提高评论文本情感分析的准确率,为用户获得更为准确的评论观点提供参考。

关 键 词:自然语言处理  语义规则  情感信息  TF-IDF  Word2vec  加权词向量  情感分析
收稿时间:2021/3/25 0:00:00
修稿时间:2021/6/11 0:00:00

Sentiment analysis method of comment text based on word vector with sentiment information
LYU Meiyuan,ZHANG Yongjian,ZHANG Yongqiang,SUN Shengjuan.Sentiment analysis method of comment text based on word vector with sentiment information[J].Journal of Hebei University of Science and Technology,2021,42(4):380-388.
Authors:LYU Meiyuan  ZHANG Yongjian  ZHANG Yongqiang  SUN Shengjuan
Abstract:In order to solve the problem of low accuracy of sentiment classification caused by neglecting the sentiment information of words in distributed word representation method,an improved sentiment analysis method incorporating weighted word vectors of sentiment information was proposed.According to the exclusive domain sentiment dictionary,combined with the dictionary and semantic rules,the sentiment information is integrated into the TF-IDF algorithm,and the weighted word vector representation method is obtained by using word2vec model.The method is used to compare the collected comments of tourist attractions in Hebei Province with the control group.The results show that compared with the sentiment analysis method based on distributed word vector representation,the accuracy and recall rate of positive text are increased by 61% and 66%,and the F value reached 903%,the accuracy and recall rate of negative text are increased by 60% and 72%,and the F value reached 896% by using the improved method of sentiment analysis integrated with sentiment information weighted word vector.Therefore,the improved method of sentiment analysis integrated with sentiment information weighted word vector can effectively improve the accuracy of sentiment analysis of comment text,and provide valuable reference for users to obtain more accurate comments.
Keywords:natural language processing  semantic rules  sentiment information  TF-IDF  Word2vec  weighted word vector  [JP]sentiment analysis
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《河北科技大学学报》浏览原始摘要信息
点击此处可从《河北科技大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号