首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于改进卷积神经网络的短文本分类模型
引用本文:高云龙,吴川,朱明.基于改进卷积神经网络的短文本分类模型[J].吉林大学学报(理学版),2020,58(4):923-930.
作者姓名:高云龙  吴川  朱明
作者单位:1. 中国科学院 长春光学精密机械与物理研究所, 长春 130033; 2. 中国科学院 航空光学成像与测量重点实验室, 长春 130033
基金项目:国家自然科学基金;吉林省科技发展计划
摘    要:基于卷积神经网络, 提出一种基于改进卷积神经网络的短文本分类模型. 首先, 采用不同编码方式将短文本映射到不同空间下的分布式表示, 提取不同粒度的数字特征作为短文本分类模型的多通道输入, 并根据标准知识库提取概念特征作为先验知识, 提高短文本的语义表征能力; 其次, 在全连接层增加自编码学习策略, 在近似恒等的基础上进一步组合数字特征, 模拟数据内部的关联性; 最后, 利用相对熵原理为模型增加稀疏性限制, 降低模型复杂度的同时提高模型的泛化能力. 通过对开源数据集进行短文本分类实验, 验证了模型的有效性.

关 键 词:卷积神经网络    短文本    概念分布式表示    稀疏    自编码  
收稿时间:2019-11-13

Short Text Classification Model Based onImproved Convolutional Neural Network
GAO Yunlong,WU Chuan,ZHU Ming.Short Text Classification Model Based onImproved Convolutional Neural Network[J].Journal of Jilin University: Sci Ed,2020,58(4):923-930.
Authors:GAO Yunlong  WU Chuan  ZHU Ming
Institution:1. Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Science, Changchun 130033, China;
2. Key Laboratory of Airborne Optical Imaging and Measurement, Chinese Academy of Sciences, Changchun 130033, China
Abstract:We proposed a short text classification model based on improved convolutional neural network. Firstly, different coding methods were used to map short text to distributed representation in different spaces, and digital features of different granularities were extracted as multi-channel inputs of short text classification model. Extracting concept features from standard knowledge base as prior knowledge to improve the semantic representation ability of short text. Secondly, the self coding learning strategy was added to the full connection layer, on the basis of approximate identity, the digital features were further combined to simulate the relevance within the data. Finally, the principle of relative entropy were used to increase the sparsity limit of the model, reduce the complexity and improve the generalization ability of the model. The effectiveness of the proposed model was verified by short text classification experiments on the open source dataset.
Keywords:convolutional neural network  short text  concept distributed representation  sparsity  self coding  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号