首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于最小生成树的多层次k-Means聚类算法及其在数据挖掘中的应用
引用本文:金晓民,张丽萍.基于最小生成树的多层次k-Means聚类算法及其在数据挖掘中的应用[J].吉林大学学报(理学版),2018,56(5):1187-1192.
作者姓名:金晓民  张丽萍
作者单位:1. 内蒙古大学 交通学院, 呼和浩特 010021;2. 内蒙古自治区桥梁检测与维修加固工程技术研究中心, 呼和浩特 010070;3. 内蒙古师范大学 计算机科学技术学院, 呼和浩特 010022
摘    要:针对传统聚类算法存在挖掘效率慢、 准确率低等问题, 提出一种基于最小生成树的多层次k-means聚类算法, 并应用于数据挖掘中. 先分析聚类样本的数据类型, 根据分析结果设计聚类准则函数; 再通过最小生成树对样本数据进行划分, 并选取初始聚类中心, 将样本的数据空间划分为矩形单元, 在矩形单元中对样本对象数据进行计算、 降序和选取, 得到有效的初始聚类中心, 减少数据挖掘时间. 实验结果表明, 与传统算法相比, 该算法可快速、 准确地挖掘数据, 且挖掘效率提升约50%.

关 键 词:最小生成树    多层次k-means聚类算法    数据挖掘  
收稿时间:2018-01-24

Multi level k-Means Clustering Algorithm Based onMinimum Spanning Tree and Its Application in Data Mining#br#
JIN Xiaomin,ZHANG Liping.Multi level k-Means Clustering Algorithm Based onMinimum Spanning Tree and Its Application in Data Mining#br#[J].Journal of Jilin University: Sci Ed,2018,56(5):1187-1192.
Authors:JIN Xiaomin  ZHANG Liping
Institution:1. Institute of Transportation, Inner Mongolia University, Hohhot 010021, China;2. Inner Mongolia Engineering Research Center of Testing and Strengthening for Bridges, Hohhot 010070, China;3. College of Computer Science and Technology, Inner Mongolia Normal University, Hohhot 010022, China
Abstract:Aiming at the problem of slow mining efficiency and lowaccuracy in traditional clustering algorithm, we proposed a multi level k-means clustering algorithm based on minimum spanning tree, and applied to datamining. Firstly, we analyzed the data types of the clustering samples and designed the clustering criterion function according to the analysis results. Secondly, we divided the sample data bythe minimum spanning tree, and selected the initial clustering center. The data space of the sample was divided into rectangular unit, the sample object data was calculated, descended and selected in the rectangular unit, the effective initial clustering center was obtained to reduce the time spent in data mining. The experimental results show that, compared with the traditional algorithm, the proposed method can quickly and accurately excavate the data, and the efficiency ofmining is increased by about 50%.
Keywords:minimum spanning tree  multi level k-means clustering algorithm  data mining
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号