基于细粒度可解释矩阵的摘要生成模型 Abstractive Summarization Based on Fine-Grained Interpretable Matrix期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

基于细粒度可解释矩阵的摘要生成模型

引用本文：	王浩男,高扬,冯俊兰,胡珉,王惠欣,柏宇.基于细粒度可解释矩阵的摘要生成模型[J].北京大学学报(自然科学版),2021,57(1):23-30.

作者姓名：	王浩男高扬冯俊兰胡珉王惠欣柏宇

作者单位：	1. 北京理工大学计算机学院, 北京 100081 2. 中国移动通信研究院, 北京 100032 3. 北京市海量语言信息处理与云计算应用工程技术研究中心, 北京 100081

基金项目：	教育部-中国移动科研基金

摘要：	针对摘要模型中总结并解释长篇上下文信息存在的困难,提出一种基于细粒度可解释矩阵,先抽取再生成的摘要模型(fine-grained interpretable matrix,FGIM),提升长文本对显著度、更新性和相关度的可解释抽取能力,引导系统自动生成摘要.该模型通过一个句对判别(pair-wise)抽取器对文章内容进...
关键词：	生成式摘要可解释抽取中心度掩码矩阵可控生成
收稿时间：	2020-06-08
Abstractive Summarization Based on Fine-Grained Interpretable Matrix

WANG Haonan,GAO Yang,FENG Junlan,HU Min,WANG Huixin,BAI Yu.Abstractive Summarization Based on Fine-Grained Interpretable Matrix[J].Acta Scientiarum Naturalium Universitatis Pekinensis,2021,57(1):23-30.

Authors:	WANG Haonan GAO Yang FENG Junlan HU Min WANG Huixin BAI Yu

Institution:	1. School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081 2. China Mobile Research Institute, Beijing 100032 3. Beijing Engineering Research Center of High Volume Language Information Processing and Cloud Computing Applications, Beijing 100081

Abstract:	According to the great challenge of summarizing and interpreting the information of a long article in the summary model. A summary model (Fine-Grained Interpretable Matrix, FGIM), which is retracted and then generated, is proposed to improve the interpretability of the long text on the significance, update and relevance, and then guide to automatically generate a summary. The model uses a pair-wise extractor to compress the content of the article, capture the sentence with a high degree of centrality, and uses the compressed text to combine with the generator to achieve the process of generating the summary. At the same time, the interpretable mask matrix can be used to control the direction of digest generation at the generation end. The encoder uses two methods based on Transformer and BERT respectively. This method is better than the best baseline model on the benchmark text summary data set (CNN/DailyMail and NYT50). The experiment further builds two test data sets to verify the update and relevance of the abstract, and the proposed model achieves corresponding improvements in the controllable generation of the data set.

Keywords:	abstractive summarization interpretable extraction centrality mask matrix controllable
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《北京大学学报(自然科学版)》浏览原始摘要信息
	点击此处可从《北京大学学报(自然科学版)》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏