首页 | 本学科首页   官方微博 | 高级检索  
     

K-均值算法中聚类个数优化问题研究
引用本文:韩凌波. K-均值算法中聚类个数优化问题研究[J]. 四川理工学院学报(自然科学版), 2012, 0(2): 77-80
作者姓名:韩凌波
作者单位:中共湛江市委党校理论信息室
基金项目:广西科学基金项目(0640067);广西研究生教育创新计划项目(2007106020812M73)
摘    要:在传统的K-均值聚类算法中,聚类数K必须事先给定,然而,实际中K值很难被精确的确定,K值是否合理直接影响着K-均值算法的好坏。针对这个缺点,提出一种优化聚类数算法,根据聚类算法中类内相似度最大差异度最小和类间差异度最大相似度最小的基本原则,构建了距离评价函数F(S,K)作为最佳聚类数的检验函数,建立了相应的数学模型,并通过仿真实验进一步验证了新算法的有效性。

关 键 词:k-均值算法  聚类个数  距离价值函数

Optimization Study on Class Number of K-means Algorithm
HAN Ling-Bo. Optimization Study on Class Number of K-means Algorithm[J]. Journal of Sichuan University of Science & Engineering(Natural Science Editton), 2012, 0(2): 77-80
Authors:HAN Ling-Bo
Affiliation:HAN Ling-Bo(Department of Theory and Information,Zhanjiang Party Institute,Zhanjiang 524032,China)
Abstract:In traditional K-means algorithm,the class number must be confirmed in advance.However,it can not be clearly and easily confirmed in fact for its uncertainty.Whether the class number is optimized has a direct impact on the performance k-means algorithm.Considering this defection,a new improved algorithm is proposed.According to the basic principles of clustering algorithm that the Within-class similarity is Maximum and the within-class difference is least,the inter-class difference is maximum and the inter-class similarity is least,a distance cost of function F(S,K) to confirm the optimal class number is recommended in this paper.A corresponding math model is set up,and example results further verify the effectiveness of the new algorithm.
Keywords:K-means algorithm  clustering center  distance cost
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号