首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于细粒度可解释矩阵的摘要生成模型
引用本文:王浩男,高扬,冯俊兰,胡珉,王惠欣,柏宇.基于细粒度可解释矩阵的摘要生成模型[J].北京大学学报(自然科学版),2021,57(1):23-30.
作者姓名:王浩男  高扬  冯俊兰  胡珉  王惠欣  柏宇
作者单位:1. 北京理工大学计算机学院, 北京 100081 2. 中国移动通信研究院, 北京 100032 3. 北京市海量语言信息处理与云计算应用工程技术研究中心, 北京 100081
基金项目:教育部-中国移动科研基金
摘    要:针对摘要模型中总结并解释长篇上下文信息存在的困难,提出一种基于细粒度可解释矩阵,先抽取再生成的摘要模型(fine-grained interpretable matrix,FGIM),提升长文本对显著度、更新性和相关度的可解释抽取能力,引导系统自动生成摘要.该模型通过一个句对判别(pair-wise)抽取器对文章内容进...

关 键 词:生成式摘要  可解释抽取  中心度  掩码矩阵  可控生成
收稿时间:2020-06-08

Abstractive Summarization Based on Fine-Grained Interpretable Matrix
WANG Haonan,GAO Yang,FENG Junlan,HU Min,WANG Huixin,BAI Yu.Abstractive Summarization Based on Fine-Grained Interpretable Matrix[J].Acta Scientiarum Naturalium Universitatis Pekinensis,2021,57(1):23-30.
Authors:WANG Haonan  GAO Yang  FENG Junlan  HU Min  WANG Huixin  BAI Yu
Institution:1. School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081 2. China Mobile Research Institute, Beijing 100032

3. Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications, Beijing 100081

Abstract:According to the great challenge of summarizing and interpreting the information of a long article in the summary model. A summary model (Fine-Grained Interpretable Matrix, FGIM), which is retracted and then generated, is proposed to improve the interpretability of the long text on the significance, update and relevance, and then guide to automatically generate a summary. The model uses a pair-wise extractor to compress the content of the article, capture the sentence with a high degree of centrality, and uses the compressed text to combine with the generator to achieve the process of generating the summary. At the same time, the interpretable mask matrix can be used to control the direction of digest generation at the generation end. The encoder uses two methods based on Transformer and BERT respectively. This method is better than the best baseline model on the benchmark text summary data set (CNN/DailyMail and NYT50). The experiment further builds two test data sets to verify the update and relevance of the abstract, and the proposed model achieves corresponding improvements in the controllable generation of the data set.
Keywords:abstractive summarization  interpretable extraction  centrality  mask matrix  controllable  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《北京大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号