首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于随机森林和支持向量机的分类特征选择
引用本文:迟呈英,梁晓月.基于随机森林和支持向量机的分类特征选择[J].辽宁科技大学学报,2016(2):146-152.
作者姓名:迟呈英  梁晓月
作者单位:辽宁科技大学 软件学院,辽宁 鞍山,114051
摘    要:由于数据具有海量、高相关性和非线性的特点,所以如何选择原始数据的本质特征,是关系到能否有效提高问题分类器推广能力的关键问题。本文讨论了目前基于所有特征以及词袋和词序列袋的特征选择方法,提出了采用随机森林和支持向量机(SVM)相结合的方法来进行特征选择。实验证明,此方法能够有效地选择分类特征,从而提升问题分类的效率和精度。

关 键 词:支持向量机  随机森林  特征选择

Feature selection in question classification based on random forests and support vector machine
CHI Chengying,LIANG Xiaoyue.Feature selection in question classification based on random forests and support vector machine[J].Journal of University of Science and Technology Liaoning,2016(2):146-152.
Authors:CHI Chengying  LIANG Xiaoyue
Abstract:The key points to improve the generalization ability of question classifier is how to extract the es-sence and internal characteristics from the high scale,high correlation and nonlinear original data. The feature selection method based on all features,word bag and word sequence is discussed in this paper. A combination approach of random forest and support vector machine (SVM) is proposed for feature selection. Experiments show that this method is simple and effective in selection of classification features,and can improve the effi-ciency and accuracy of question classification.
Keywords:support vector machine  random forest  feature selection
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号