首页 | 本学科首页   官方微博 | 高级检索  
     检索      

多目标监督聚类GA研究
引用本文:索飞,张洪伟,邹书蓉.多目标监督聚类GA研究[J].成都大学学报(自然科学版),2013,32(1):58-60,63.
作者姓名:索飞  张洪伟  邹书蓉
作者单位:成都信息工程学院计算机学院,四川成都,610225;成都信息工程学院计算机学院,四川成都,610225;成都信息工程学院计算机学院,四川成都,610225
摘    要:提出了多目标监督聚类GA算法,即:根据样本的类标签有监督地将样本聚类,在每个类中根据样本属性的相似性有监督地聚成类簇.如果分属不同类标签的类簇出现相交,则相交类簇再次聚类,直到所有类簇均不相交.适应度矢量函数由类簇数和类内距离2个目标确定,类簇数和类簇中心由目标函数自动确定,从而类簇数和中心就不受主观因素的影响,并且保证了这2个关键要素的优化性质.预测分类时,删去单点类簇,并根据类簇号和离某个类簇中心距离的最近邻法则以及该类簇的类标签进行分类.算法模型采用C#实现,采用3个UCI数据集进行实例分析,实验结果表明,本算法优于著名的Native Bayes、Boost C4.5和KNN算法.

关 键 词:多目标GA  监督聚类  类标签  最近邻法则

Research of Multi-objective Supervised Clustering GA
SUO Fei , ZHANG Hongwei , ZOU Shurong.Research of Multi-objective Supervised Clustering GA[J].Journal of Chengdu University (Natural Science),2013,32(1):58-60,63.
Authors:SUO Fei  ZHANG Hongwei  ZOU Shurong
Institution:(College of Computer Science & Technology,Chengdu University of Information Technology,Chengdu 610225,China)
Abstract:This paper presents a new multi-objective supervised clustering genetic algorithm. Samples are supervisedly clustered into several classes by class labels. In each class, samples are supervisedly clustered into class clusters according to the similarity of the sample properties. If the class clusters which belong to different class labels intersect, these intersecting class clusters are clustered again into class clusters until all the class clusters don' t intersect. The fitness vector function is determined by the number of class clus- ters and within-class distance. The number and center of class clusters can be determined automatically by using the fitness vector function. The two key elements can be unaffected by subjective factors and have op- timization natures. During classification forcast, the single-point class cluster is deleted and then classifica- tion is done according to the class cluster number, the nearest neighbor rule and the class labels. The algo- rithm model is implemented with C #, using three UCI data sets as the experiment data. The experimental results indicate that this algorithm is better than Native Bayes, Boost C4.5 and KNN algorithms.
Keywords:multi-objective GA  supervised clustering  class label  nearest neighbor rule
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号