牛振东, 刘沙. 基于三层过滤的评价对象抽取[J]. 北京理工大学学报自然版, 2016, 36(11): 1154-1159. DOI: 10.15918/j.tbit1001-0645.2016.11.011
引用本文: 牛振东, 刘沙. 基于三层过滤的评价对象抽取[J]. 北京理工大学学报自然版, 2016, 36(11): 1154-1159. DOI: 10.15918/j.tbit1001-0645.2016.11.011
NIU Zhen-dong, LIU Sha. Opinion Targets Extraction with a Three-Level Filter[J]. Transactions of Beijing institute of Technology, 2016, 36(11): 1154-1159. DOI: 10.15918/j.tbit1001-0645.2016.11.011
Citation: NIU Zhen-dong, LIU Sha. Opinion Targets Extraction with a Three-Level Filter[J]. Transactions of Beijing institute of Technology, 2016, 36(11): 1154-1159. DOI: 10.15918/j.tbit1001-0645.2016.11.011

基于三层过滤的评价对象抽取

Opinion Targets Extraction with a Three-Level Filter

  • 摘要: 针对互联网中的产品评论信息,提出一种三层过滤的评价对象抽取方法.该方法采用一个自举式的抽取算法在评论文本中得到候选的评价对象和情感词;利用评价对象与情感词之间的关联度对候选词进行关联置信度计算,提取关联置信度高的评价对象以提高识别的准确率;引入一个不相关的平行领域对剩余的候选词进行领域置信度计算,挖掘低频的评价对象.3个公开数据集中的实验结果表明该方法能够显著地提高评价对象的识别效果.

     

    Abstract: A three-level filter method was proposed to extract the opinion targets for product reviews on the Internet. In the first level, a bootstrapping framework was adopted to extract candidate opinion targets and opinion words from opinion texts. In the second level, the association between the opinion target and opinion word was used to estimate the association confidence of every candidate opinion target and candidate opinion word. The opinion targets with high association confidence were extracted to improve recognition accuracy. In the third level, an uncorrelated domain was adopted to calculate the domain confidence of every opinion target in the rest set which was for mining the opinion targets of low frequency. The experimental results on three public datasets demonstrate the effectiveness of the proposed approach.

     

/

返回文章
返回