首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于K-means聚类和遗传算法的少数类样本采样方法研究
引用本文:杨永.基于K-means聚类和遗传算法的少数类样本采样方法研究[J].科学技术与工程,2010,10(10).
作者姓名:杨永
作者单位:大庆石油学院计算机与信息技术学院,大庆,163318
基金项目:黑龙江省教育厅科学技术研究项目
摘    要:传统的分类器对不均衡数据集的分类严重倾向于多数类.为了有效地提高不均衡数据集中少数类的分类性能,针对此问题提出了一种基于K-means聚类和遗传算法的少数类样本采样方法.通过K-means算法将少数类样本聚类分组,在每个聚类内使用遗传算法获取新样本并进行有效性验证,最后通过使用KNN和SVM分类器,在仿真实验中证明了方法的有效性.

关 键 词:K-means算法  聚类  遗传算法  不均衡数据集
收稿时间:1/7/2010 12:00:00 AM
修稿时间:3/9/2010 12:00:00 AM

The research of minority kind of sample sampling method based on K-means cluster and genetic algorithm
yangyong.The research of minority kind of sample sampling method based on K-means cluster and genetic algorithm[J].Science Technology and Engineering,2010,10(10).
Authors:yangyong
Abstract:The classification favors seriously to the most kinds when we use the traditional sorter to classify the imbalanced data set. In order to effectively enhance classified performance of the minority kind in the imbalanced data set, we proposed one kind minority kind of sample sampling method based on the K-means cluster and the genetic algorithm in view of this question. We used K-means algorithm to cluster and group the minority kind of sample, and in each cluster we use the genetic algorithm to gain the new sample and to carry on the valid confirmation. Finally, through using KNN and SVM sorter we proved the method validity in the simulation experiment.
Keywords:K-means algorithm  Cluster  Genetic algorithm  Imbalanced data set
本文献已被 万方数据 等数据库收录!
点击此处可从《科学技术与工程》浏览原始摘要信息
点击此处可从《科学技术与工程》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号