首页 | 本学科首页   官方微博 | 高级检索  
     

一种高效的阴阳k-Means聚类算法
引用本文:李长明,张红臣,王超,李晓光,陆洋,钱超越. 一种高效的阴阳k-Means聚类算法[J]. 吉林大学学报(理学版), 2021, 59(6): 1455-1460. DOI: 10.13413/j.cnki.jdxblxb.2020406
作者姓名:李长明  张红臣  王超  李晓光  陆洋  钱超越
作者单位:1. 长春光华学院 工程技术研发中心, 长春 130033; 2. 长春光华学院 电气信息学院, 长春 130033;3. 长春理工大学 计算机科学技术学院, 长春 130022
摘    要:针对传统阴阳k-means算法未利用数据结构导致计算效率较低的问题, 提出一种高效阴阳k-means聚类算法. 该算法根据数据相似性将原始数据进行逐层分解, 并建立满m叉树结构存储各层数据, 以树结构各叶子节点中存储的数据信息建立加权数据, 运行加权阴阳k-means算法得到收敛中心. 在原始数据中以加权数据收敛中心为初始化条件运行传统阴阳k-means算法进一步优化目标函数值. 在5组UCI数据集上与k-means、传统阴阳k-means及另外两种加速算法进行对比实验, 实验结果表明, 该算法具有较高的加速比, 且求解精度与传统阴阳k-means聚类基本相同.

关 键 词:聚类分析  阴阳k-means算法  k-means算法  数据加权
收稿时间:2020-12-08

An Efficient Yinyang k-Means Clustering Algorithm
LI Changming,ZHANG Hongchen,WANG Chao,LI Xiaoguang,LU Yang,QIAN Chaoyue. An Efficient Yinyang k-Means Clustering Algorithm[J]. Journal of Jilin University: Sci Ed, 2021, 59(6): 1455-1460. DOI: 10.13413/j.cnki.jdxblxb.2020406
Authors:LI Changming  ZHANG Hongchen  WANG Chao  LI Xiaoguang  LU Yang  QIAN Chaoyue
Affiliation:1. Engineering Technology Research and Development Center, Changchun Guanghua University, Changchun 130033, China;
2. School of Electrical Information, Changchun Guanghua University, Changchun 130033, China;  
3. School of Computer Science and Technology, Changchun University of Science and Technology, Changchun 130022, China
Abstract:Aiming at the problem that the traditional Yinyang algorithm did not use the data structure, resulting in low computational efficiency, we proposed an efficient Yinyang k-means clustering algorithm. The algorithm decomposed the original data layer by layer according to the data similarity, and established a full m-tree structure to store the data of each layer. The weighted data was established based on the data information stored in each leaf node of the tree structure, and the weighted Yinyang k-means algorithm was run to obtain the convergence center. In the original data, the convergence centers of the weighted data were taken as the initial condition to run the traditional Yinyang k-means algorithm to further optimize the objective function value. Comparative experiments with k-means, traditional Yinyang k-means and other two acceleration algorithms on five UCI data sets show that the proposed algorithm has a high acceleration ratio, and the solution accuracy is basically equivalent to Yinyang k-means clustering.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《吉林大学学报(理学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(理学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号