首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于特征加权词向量的在线医疗评论情感分析
引用本文:高慧颖,公孟秋,刘嘉唯.基于特征加权词向量的在线医疗评论情感分析[J].北京理工大学学报,2021,41(9):999-1005.
作者姓名:高慧颖  公孟秋  刘嘉唯
作者单位:北京理工大学管理与经济学院,北京 100081
基金项目:国家自然科学基金资助项目(71972012)
摘    要:针对在线医疗评论文本具有行业专业性强、差异性大、不够规范等特点,提出一种基于特征加权词向量的在线医疗评论情感分析方法.利用Word2vec方法构建词向量模型,抽取情感词集合完善医疗服务领域情感词典,根据句法关系识别主题词与情感词的依存关系,引入期望交叉熵因子,建立特征加权词向量模型,分析在线医疗评论的情感倾向.实验结果表明扩充的医疗服务情感词典在分析性能上的准确率、召回率以及F1值均高于基础情感词典,引入期望交叉熵因子后,基于特征加权词向量的情感分析方法在SVM分类上表现出更好的效果,体现了其在在线医疗评论挖掘领域的良好效用. 

关 键 词:情感分析  在线医疗评论  特征加权词向量  情感词典  主题模型
收稿时间:2021/1/3 0:00:00

Sentiment Analysis of Online Healthcare Reviews Based on Feature Weighted Word Vector
GAO Huiying,GONG Mengqiu,LIU Jiawei.Sentiment Analysis of Online Healthcare Reviews Based on Feature Weighted Word Vector[J].Journal of Beijing Institute of Technology(Natural Science Edition),2021,41(9):999-1005.
Authors:GAO Huiying  GONG Mengqiu  LIU Jiawei
Institution:School of Management and Economics, Beijing Institute of Technology, Beijing 100081, China
Abstract:A sentiment analysis method of online healthcare reviews based on feature weighted word vector was proposed in view of the professional, diverse and less normative features of online healthcare reviews. The Word2vec method was used to construct the word vector model, and the sentiment word set was extracted to improve the sentiment lexicon in the field of healthcare service. The dependency between subject words and sentiment words was identified according to the syntactic relations. The expected cross entropy factor was introduced to establish a feature weighted word vector model to analyze the sentiment tendency of online healthcare reviews. The experimental results show that the accuracy, recall rate and F1 value of the expanded healthcare service sentiment lexicon are higher than those of the basic sentiment lexicon. After the introduction of the expected cross entropy factor, the sentiment analysis method based on the feature weighted word vector shows better effect in the SVM classification, which reflects its good utility in the online healthcare reviews mining.
Keywords:sentiment analysis  online healthcare reviews  feature weighted word vector  sentiment lexicon  topic model
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京理工大学学报》浏览原始摘要信息
点击此处可从《北京理工大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号