首页 | 本学科首页   官方微博 | 高级检索  
     

一种新的密度加权粗糙K-均值聚类算法
引用本文:谢娟英,张琰,谢维信,高新波. 一种新的密度加权粗糙K-均值聚类算法[J]. 山东大学学报(理学版), 2010, 45(7): 1-6
作者姓名:谢娟英  张琰  谢维信  高新波
作者单位:陕西师范大学计算机科学学院,陕西,西安,710062;西安电子科技大学电子工程学院,陕西,西安,710071;陕西师范大学计算机科学学院,陕西,西安,710062;西安电子科技大学电子工程学院,陕西,西安,710071;深圳大学信息工程学院,广东,深圳,518060;西安电子科技大学电子工程学院,陕西,西安,710071
基金项目:中央高校基本科研业务费专项资金重点资助项目,陕西省自然科学基础研究计划项目 
摘    要:为了克服粗糙K-均值聚类算法初始聚类中心点随机选取,以及样本密度函数定义所存在的缺陷,基于数据对象所在区域的样本点密集程度,定义了新的样本密度函数,选择相互距离最远的K个高密度样本点作为初始聚类中心,克服了现有粗糙K-均值聚类算法的初始中心随机选取的缺点,从而使得聚类结果更接近于全局最优解。同时在类均值计算中,对每个样本根据定义的密度赋以不同的权重,得到不受噪音点影响的更合理的质心。利用UCI机器学习数据库的6组数据集,以及随机生成的带有噪音点的人工模拟数据集进行测试,证明本文算法具有更好的聚类效果,而且对噪音数据有很强的抗干扰性能。

关 键 词:聚类算法  粗糙K-均值  聚类中心  加权  密度
收稿时间:2010-04-02

A novel rough K-means clustering algorithm based on the weight of density
XIE Juan-ying,ZHANG Yan,XIE Wei-xin,GAO Xin-bo. A novel rough K-means clustering algorithm based on the weight of density[J]. Journal of Shandong University, 2010, 45(7): 1-6
Authors:XIE Juan-ying  ZHANG Yan  XIE Wei-xin  GAO Xin-bo
Affiliation:1. School of Computer Science, Shaanxi Normal University, Xi’an 710062, Shaanxi, China;
2. School of Electronic Engineering, Xidian University,  Xi’an 710071,  Shaanxi, China;
3. School of Information Engineering, Shenzhen University, Shenzhen 518060, Guangdong, China
Abstract: A novel rough K-means clustering algorithm was presented  based on the weight of exemplar density to overcome the drawback of selecting initial seeds randomly of available rough K-means algorithms. A new density function was defined for each sample according to the denseness of samples around it without any arbitrary parameter, and the top K samples with higher density and far away from each other were selected as initial centers of rough K-means clustering algorithm. Further more the new weight was defined for each exemplar according to the value of the new density function, so that the better could croids of each cluster could be calculated out without influenced by noisy data. Experiments on six UCI data sets and on synthetically geterated  data sets  with noise points proved that our algorithm got a better clustering result, and had a strong anti-interference performance for noise data.
 
Keywords:clustering algorithm   rough K-means   clustering center   weight   density
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《山东大学学报(理学版)》浏览原始摘要信息
点击此处可从《山东大学学报(理学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号