首页 | 本学科首页   官方微博 | 高级检索  
     检索      

一种对于K-means算法的改进
引用本文:李光明,李梁,张建刚.一种对于K-means算法的改进[J].重庆工商大学学报(自然科学版),2012,29(8):47-51.
作者姓名:李光明  李梁  张建刚
作者单位:重庆理工大学计算机科学与工程学院,重庆,400054
摘    要:数据聚类是一个功能强大的技术,它能够把数据特征相似的对象划分为一类,但是并不是所有的聚类算法的实现都能产生相同的聚类结果;并且K均值算法的结果很大程度上依赖它的初始中心的选择;提出了一种新颖的关于K均值初始中心选择的策略;该算法是基于反向最近邻(RNN)搜索,检索一个给定的数据集,其最近的邻居是一个给定的查询点中的所有点;使用这种方法计算初始聚类中心结果发现是非常接近聚类算法所需的迭代聚类中心;对提出的算法应用到K均值聚类中给予了证明;用几种流行的数据集的实验结果表明了该算法的优点。

关 键 词:聚类  最近邻查询  反向最近邻搜索  K均值

A Kind of Improvement for K-means Algorithm
LI Guang-ming,LI Liang,ZHANG Jian-gang.A Kind of Improvement for K-means Algorithm[J].Journal of Chongqing Technology and Business University:Natural Science Edition,2012,29(8):47-51.
Authors:LI Guang-ming  LI Liang  ZHANG Jian-gang
Institution:(School of Computer Science and Engineering,Chongqing University of Technology,Chongqing 400054,China)
Abstract:Data clustering is a powerful technology and can calssify the objects with similar data characteristics into a class,however,the implementation of all clustering algorithms does not produce the same clustering results,moreover,the results of K-means algorithm largely depend on the selection of initial clustering center.This paper proposes a novel strategy about K-means initial clustering center selection,whose algorithm is based on reverse nearest neighbor search and retrieves a given data set whose nearest neighbor is all point in a given inquiry point.The result by using this algorithm to t=calculate initial clustering center reveals that this center is very close to iterative clustering center needed by clustering algorithm.This paper also verifies the application of the proposed algorithm to K-means cluster and uses the experiment through several popular data sets to demonstrate the advantages of this algorithm.
Keywords:cluster  nearest neighbor inquiry  reverse nearest neighbor search  K-means value
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《重庆工商大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆工商大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号