首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于概念共现图的多文档自动摘要研究
引用本文:周进华,刘贵全,陈恩红.基于概念共现图的多文档自动摘要研究[J].中国科学技术大学学报,2009,39(11).
作者姓名:周进华  刘贵全  陈恩红
作者单位:中国科学技术大学计算机科学技术系,安徽,合肥,230027
摘    要:以概念统计为基础,以WordNet为语义资源进行语义消歧和概念归并,提出了一种概念共现图模型并把它应用于多文档自动文摘.该模型利用概念间的共现信息构造概念共现图,抽取多文档集合的主题概念,再根据主题概念构建向量空间模型并计算句子的重要性.由于对概念进行了良好的归纳,该模型能够挖掘蕴涵在文档集中的深层次主题.在DUC2005数据集上评测的结果表明,该方法取得的效果令人满意,可用于实际的应用.

关 键 词:概念统计  概念共现图  多文档

Research on automatic multi--document summarization based on concept CO-occurrence graph
ZHOU Jin-hua,LIU Gui-quan,CHEN En-hong.Research on automatic multi--document summarization based on concept CO-occurrence graph[J].Journal of University of Science and Technology of China,2009,39(11).
Authors:ZHOU Jin-hua  LIU Gui-quan  CHEN En-hong
Abstract:A concept co-occurrence graph model was proposed and applied to automatic multi-document summarization.This model bases itself on the concept counting,disambiguating the different meanings of multi-sense words on the basis of the semantic resource--WordNet and merging concepts.It constructs concept co-occurrence graphs and extracts subject concepts from the multi-document set by means of the co-occurrence information between concepts.Subsequently,it builds a vector space model and computes sentence importance in accordance with the subj ect concepts.As a result of generalizing the concepts well,this model is capable of digging out subj ects hidden deep in the document set.Results from the DUC2005 evaluation indicate that the model of concept co-occurrence graph can be put into practice.
Keywords:WordNet  WordNet  concept counting  concept co-occurrence graph  multi-document summarization  natural language processing
本文献已被 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号