首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于时间卷积网络的机器阅读理解
引用本文:林世平,陈 璐,陈开志,吴运兵,廖祥文.基于时间卷积网络的机器阅读理解[J].福州大学学报(自然科学版),2020,48(3):276-282.
作者姓名:林世平  陈 璐  陈开志  吴运兵  廖祥文
作者单位:福州大学数学与计算机科学学院,福州大学数学与计算机科学学院,福州大学数学与计算机科学学院,福州大学数学与计算机科学学院,福州大学数学与计算机科学学院
基金项目:国家自然科学基金项目 、中国科学院网络数据科学与技术重点实验室开放基金课题、模式识别国家重点实验室开放课题基金项目、福建省自然科学基金面上项目、 赛尔网络下一代互联网技术创新项目、北邮可信分布式计算与服务教育部重点实验室主任基金
摘    要:针对目前机器阅读理解任务中缺乏有效的上下文信息融合方式和丢失文本的整体信息等情况,提出基于时间卷积网络的机器阅读理解模型.首先将文本的单词转化成词向量并加入词性特征;接着通过时间卷积网络获取问题和文章的上下文表示;之后采用注意力机制来计算出问题感知的文章表示;最后由循环神经网络模拟推理过程得到多步预测结果,并用加权和的方式来综合结果得到答案.实验使用了SQuAD2.0数据集,在EM和F1值上相比基准实验分别提升了6.6%和8.1%,证明了该方法的有效性.

关 键 词:机器阅读理解  整体信息  时间卷积网络
收稿时间:2019/8/14 0:00:00
修稿时间:2019/10/12 0:00:00

Machine read comprehension based on temporal convolutional network
LIN Shiping,CHEN Lu,CHEN Kaizhi,WU Yunbing and LIAO Xiangwen.Machine read comprehension based on temporal convolutional network[J].Journal of Fuzhou University(Natural Science Edition),2020,48(3):276-282.
Authors:LIN Shiping  CHEN Lu  CHEN Kaizhi  WU Yunbing and LIAO Xiangwen
Institution:College of Mathematics and Computer Science, Fuzhou University,College of Mathematics and Computer Science, Fuzhou University,College of Mathematics and Computer Science, Fuzhou University,College of Mathematics and Computer Science, Fuzhou University,College of Mathematics and Computer Science, Fuzhou University
Abstract:Regarding most current studies in the task of machine read comprehension using recurrent neural network to encode text, lack of effective method for fusing the information of the above and below text, and the fact that the way of sequential processing over time may lose the overall information of the text, this paper proposes a model based on temporal convolutional network for machine read comprehension. The model first transforms words of text into word embeddings and adds part-of-speech features. Then the contextual representations of question and passage are obtained by applying temporal convolution network. Next the attention mechanism is used to get question-aware passage representations. Finally, the recurrent neural network simulates reasoning process to obtain the multi-step prediction results, and the answer is based on the weighted sum of the results.The experiment uses the SQuAD2.0 dataset, and the EM and F1 are 6.6% and 8.1% higher than the baseline experiments respectively on the development set, demonstrating the effectiveness of the proposed method.
Keywords:machine read comprehension  overall information  temporal convolutional network
本文献已被 CNKI 等数据库收录!
点击此处可从《福州大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《福州大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号