首页 | 本学科首页   官方微博 | 高级检索  
     检索      

密集池化连接和短语注意力下的文本分类算法
引用本文:黄卫春,陶自强,熊李艳.密集池化连接和短语注意力下的文本分类算法[J].科学技术与工程,2021,21(17):7193-7199.
作者姓名:黄卫春  陶自强  熊李艳
作者单位:华东交通大学软件学院, 南昌330013;华东交通大学信息学院, 南昌330013
基金项目:国家自然科学基金项目(面上项目,重点项目,重大项目), 江西省教育厅科学技术研究项目
摘    要:为了解决在文本分类中神经网络训练时产生的梯度消失、特征信息丢失以及注意力机制短语维度组合不匹配的问题,提出一种基于密集池化连接和短语注意力机制的文本分类算法.首先,通过密集池化连接中的残差网络部分进行特征提取,可有效缓解梯度消失问题;其次,通过池化层复用重要特征,改善特征信息丢失问题;最后,通过改进常规注意力机制,提出短语注意力机制,可灵活得到不同阶短语之间的联系,解决常规注意力机制短语维度不匹配问题.结果表明,该模型在对比模型中取得了最好的效果,在相同的新闻数据集中准确率可达92.7%,同时还对3个对比模型的收敛性和分类准确性进行分析,可见改进后的模型可以有效缓解梯度消失,并且解决短语维度组合不匹配问题,从而提高了分类准确性.

关 键 词:文本分类  密集池化连接  特征提取  注意力机制
收稿时间:2020/10/6 0:00:00
修稿时间:2021/3/31 0:00:00

Text classification based on dense-pooling connection and phrase attention
Huang Weichun,Tao Ziqiang,Xiong Liyan.Text classification based on dense-pooling connection and phrase attention[J].Science Technology and Engineering,2021,21(17):7193-7199.
Authors:Huang Weichun  Tao Ziqiang  Xiong Liyan
Institution:School of Software,East China Jiaotong University; Department of Information,East China Jiaotong University
Abstract:In order to solve problems for the disappearance of gradients, the deficiency of text feature and the mismatch of extracting phrase features in attention mechanism during the training of neural network in text classification, a new method base on dense-pool connection and phrase attention mechanism is proposed. Firstly, the method is used to extracting features while alleviating the gradient disappearance problem through the residual network and reuse important features through dense pooling connection. Then, the phrase attention mechanism is used to solve the problem of phrase dimension mismatch in the traditional attention mechanism. Finally, the results show that the accuracy of the model can achieve 92.7% in the AG news dataset for all variants. In addition, the convergence and classification accuracy of three comparison models are analyzed in different hyperparameters. It is concluded that the improved model can effectively alleviate the disappearance of gradients and solve the problem of phrase feature extraction, thereby improving the classification accuracy.
Keywords:text classification  dense-pool connection  feature extraction  attention mechanism
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号