首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于XLNet+BiGRU+Att(Label)的文本分类模型
引用本文:刘柏霆,管卫利,李陶深.基于XLNet+BiGRU+Att(Label)的文本分类模型[J].广西科学院学报,2022,38(4):412-419.
作者姓名:刘柏霆  管卫利  李陶深
作者单位:广西大学计算机与电子信息学院, 广西南宁 530004;广西大学计算机与电子信息学院, 广西南宁 530004;南宁学院数字经济学院, 广西南宁 530299
基金项目:国家自然科学基金项目(61762010)和广西科技计划项目(桂科AD20297125)资助。
摘    要:传统的词向量嵌入模型,如Word2Vec、GloVe等模型无法实现一词多义表达;传统的文本分类模型也未能很好地利用标签词的语义信息。基于此,提出一种基于XLNet+BiGRU+Att(Label)的文本分类模型。首先用XLNet生成文本序列与标签序列的动态词向量表达;然后将文本向量输入到双向门控循环单元(BiGRU)中提取文本特征信息;最后将标签词与注意力机制结合,选出文本的倾向标签词,计算倾向标签词与文本向量的注意力得分,根据注意力得分更新文本向量。通过对比实验,本文模型比传统模型在文本分类任务中的准确率更高。使用XLNet作为词嵌入模型,在注意力计算时结合标签词能够提升模型的分类性能。

关 键 词:文本分类  XLNet  BiGRU  标签词  注意力机制
收稿时间:2022/4/15 0:00:00
修稿时间:2022/7/11 0:00:00

Text Classification Model Based on XLNet+BiGRU+Att (Label)
LIU Boting,GUAN Weili,LI Taoshen.Text Classification Model Based on XLNet+BiGRU+Att (Label)[J].Journal of Guangxi Academy of Sciences,2022,38(4):412-419.
Authors:LIU Boting  GUAN Weili  LI Taoshen
Institution:School of Computer, Electronics and Information, Guangxi University, Nanning, Guangxi, 530004, China;School of Computer, Electronics and Information, Guangxi University, Nanning, Guangxi, 530004, China;College of Digital Economics, Nanning University, Nanning, Guangxi, 530299, China
Abstract:Traditional word vector embedding models,such as Word2Vec and GloVe,cannot realize polysemy expression.Traditional text classification models also fail to make good use of the semantic information of label words.Based on this,a classification model based on XLNet+BiGRU+Att (Label) is proposed.Firstly,the dynamic word vector expression of text sequence and label sequence is generated by XLNet.Then,the text vector is input into the Bidirectional Gated Recurrent Unit (BiGRU) to extract the text feature information.At last,the label words are combined with the attention mechanism to select the tendency label words of the text,calculate the attention score of the tendency label words and the text vector,and update the text vector according to the attention score.Through comparative experiments,the model in this paper has higher accuracy than the traditional model in text classification tasks.Using XLNet as the word embedding model and combining label words in attention calculation can improve the classification performance of the model.
Keywords:text classification  XLNet  BiGRU  label word  attention mechanism
点击此处可从《广西科学院学报》浏览原始摘要信息
点击此处可从《广西科学院学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号