首页 | 本学科首页   官方微博 | 高级检索  
     检索      

结合主动学习和自动标注的评价对象抽取方法
引用本文:朱珠,李寿山,戴敏,周国栋.结合主动学习和自动标注的评价对象抽取方法[J].山东大学学报(理学版),2015,50(7):38-44.
作者姓名:朱珠  李寿山  戴敏  周国栋
作者单位:苏州大学自然语言处理实验室, 江苏 苏州 215006
基金项目:国家自然科学基金资助项目
摘    要:提出了结合主动学习和自动标注的评价对象抽取方法。具体实现过程中,首先,利用少量的已标注样本训练分类器,对非标注样本进行测试,获取自动标注结果及其置信度:其次,通过置信度计算每个样本的整体置信度,挑选出低置信度即不确定性高的样本待标注:最后,对待标注样本中置信度低的词语进行人工标注,而置信度高的部分则采用自动标注结果。实验表明,该方法可以在确保抽取性能的同时有效地减小人工标注语料的开销。

关 键 词:情感分析  评价对象抽取  主动学习  自动标注  
收稿时间:2015-03-03

Opinion target extraction with active-learning and automatic annotation
ZHU Zhu,LI Shou-shan,DAI Min,ZHOU Guo-dong.Opinion target extraction with active-learning and automatic annotation[J].Journal of Shandong University,2015,50(7):38-44.
Authors:ZHU Zhu  LI Shou-shan  DAI Min  ZHOU Guo-dong
Institution:Natural Language Processing Lab, Soochow University, Suzhou 215006, Jiangsu, China
Abstract:An opinion target extraction method combined active-learning and automatic annotation is introduced. Firstly, the results of automatically annotation with the confidence are obtained by using a few of labeled corpus to train the classifier to test the unlabeled samples: secondly, the samples of low confidence is annotated by calculating the confidence of every sample: finally, the words of low confidence in the selected samples is annotated manually, while the others are adopted the results of automatic annotation. The empirical results demonstrate that the proposed method effectively reduces the annotation cost and achieves good performance on opinion target extraction.
Keywords:sentiment analysis  opinion target extraction  active-learning  automatic annotation
点击此处可从《山东大学学报(理学版)》浏览原始摘要信息
点击此处可从《山东大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号