首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种基于交叉验证思想的半监督分类方法
引用本文:赵建华.一种基于交叉验证思想的半监督分类方法[J].西南科技大学学报,2014(1):34-38,48.
作者姓名:赵建华
作者单位:[1]西北工业大学计算机学院,陕西西安710072 [2]商洛学院计算机科学系,陕西商洛726000
基金项目:陕西省教育厅科研计划项目资助(12JK0748).
摘    要:为了提高半监督分类的有效性,提出一种基于交叉验证思想的半监督分类方法(CV-S3VM)。通过对未标记样本进行伪标记,将伪标记后的样本加入到标记样本集中,参与交叉验证,选取能使SVM分类器误差最小的标记作为最终的标记,实现对未标记样本进行标记。依次挖掘未标记样本的隐含信息,增加标记样本的数目。使用UCI数据集模拟半监督分类实验环境,结果表明CV-S3VM具有较高的分类率,在标记样本较少的情况下效果更为明显。

关 键 词:机器学习  半监督分类  交叉验证  支持向量机

A Semi -supervised Classification Algorithm Based on the Idea of Cross Validation
Institution:ZHAO Jian - hua (1. College of Computer Science and Technology, Northwestern Polytechnical University, Xi -an 710072, Shaanxi, China; 2. Department of Computer Science and Technology, Shangluo College, Shangluo 726000, Shaanxi, China)
Abstract:In order to improve the performance of semi - supervised classifier, a kind of semi - supervisedclassification algorithm CV - S3VM based on the idea of cross validation was proposed. Unlabeled sampleswere labeled and added to the labeled sample set to participate in cross validation. The labels which makeSVM classifier error minimum were selected as the final lables to mark the unlabeled samples. In this waythe information embedded in the unlabeled samples were mined and the number of labeled samples wasexpanded. Finally, the UCI dataset was used to simulate the semi -supervised classification experimentalenvironment. The results show that CV - S3VM has a higher classification rate. In the case of few labeledsamples, the effect is more obvious.
Keywords:Machine learning  Semi - supervised classification  Cross validation  Support vector machine
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号