首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于三层过滤的评价对象抽取
引用本文:牛振东,刘沙.基于三层过滤的评价对象抽取[J].北京理工大学学报,2016,36(11):1154-1159.
作者姓名:牛振东  刘沙
作者单位:北京理工大学计算机科学技术学院,北京 100081;北京市海量语言信息处理与云计算应用工程技术研究中心,北京 100081;北京理工大学计算机科学技术学院,北京,100081
基金项目:国家自然科学基金资助项目(61370137)
摘    要:针对互联网中的产品评论信息,提出一种三层过滤的评价对象抽取方法.该方法采用一个自举式的抽取算法在评论文本中得到候选的评价对象和情感词;利用评价对象与情感词之间的关联度对候选词进行关联置信度计算,提取关联置信度高的评价对象以提高识别的准确率;引入一个不相关的平行领域对剩余的候选词进行领域置信度计算,挖掘低频的评价对象.3个公开数据集中的实验结果表明该方法能够显著地提高评价对象的识别效果. 

关 键 词:评价对象抽取  情感词  关联置信度  领域置信度
收稿时间:2014/8/20 0:00:00

Opinion Targets Extraction with a Three-Level Filter
NIU Zhen-dong and LIU Sha.Opinion Targets Extraction with a Three-Level Filter[J].Journal of Beijing Institute of Technology(Natural Science Edition),2016,36(11):1154-1159.
Authors:NIU Zhen-dong and LIU Sha
Institution:1. School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China;2. Beijing Engineering Research Center of Massive Language Information Processing and Cloud Computing Application, Beijing 100081, China
Abstract:A three-level filter method was proposed to extract the opinion targets for product reviews on the Internet. In the first level, a bootstrapping framework was adopted to extract candidate opinion targets and opinion words from opinion texts. In the second level, the association between the opinion target and opinion word was used to estimate the association confidence of every candidate opinion target and candidate opinion word. The opinion targets with high association confidence were extracted to improve recognition accuracy. In the third level, an uncorrelated domain was adopted to calculate the domain confidence of every opinion target in the rest set which was for mining the opinion targets of low frequency. The experimental results on three public datasets demonstrate the effectiveness of the proposed approach.
Keywords:opinion targets extraction  opinion word  association confidence  domain confidence
本文献已被 万方数据 等数据库收录!
点击此处可从《北京理工大学学报》浏览原始摘要信息
点击此处可从《北京理工大学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号