首页 | 本学科首页   官方微博 | 高级检索  
     检索      

K近邻的自适应谱聚类快速算法
引用本文:范敏,王芬,李泽明,李志勇,张晓波.K近邻的自适应谱聚类快速算法[J].重庆大学学报(自然科学版),2015,38(6):147-152.
作者姓名:范敏  王芬  李泽明  李志勇  张晓波
作者单位:1. 重庆大学 自动化学院,重庆,400030;2. 国网重庆市电力公司 江北供电分公司,重庆,401147
基金项目:国家电网公司科技资助项目(SGZQJB00FZJS1400341),重庆市科技攻关资助项目(CSTC2012GG-YYJS40008)。
摘    要:谱聚类算法建立在谱图划分理论基础上,与传统的聚类算法相比,它具有能在任意形状的样本空间上聚类且收敛于全局最优解的优点。然而,谱聚类算法涉及如何选取合适的尺度参数σ构造相似度矩阵的问题。并且,在处理大规模数据集时,聚类的过程需要较大的时间和内存开销。研究从构造相似度矩阵入手,以传统NJW算法为基础,提出一种基于K近邻的自适应谱聚类快速算法FA-SC。该算法能自动确定尺度参数σ;同时,对输入数据集分块处理,并用基于K近邻的稀疏相似度矩阵保存样本信息,减少计算的内存开销,提高了运行速度。通过实验,与传统谱聚类算法比较,FA-SC算法在人工数据集和UCI数据集上能够取得更好的聚类效果。

关 键 词:谱聚类  K  近邻  稀疏矩阵  自适应  快速算法
收稿时间:2015/7/12 0:00:00

A fast algorithm for adaptive spectral clustering based on K-nearest neighbors
FAN Min,WANG Fen,LI Zeming,LI Zhiyong and ZHANG Xiaobo.A fast algorithm for adaptive spectral clustering based on K-nearest neighbors[J].Journal of Chongqing University(Natural Science Edition),2015,38(6):147-152.
Authors:FAN Min  WANG Fen  LI Zeming  LI Zhiyong and ZHANG Xiaobo
Institution:School of Automation, Chongqing University, Chongqing 400030, P.R.China;,School of Automation, Chongqing University, Chongqing 400030, P.R.China;,School of Automation, Chongqing University, Chongqing 400030, P.R.China;,Chongqing Jiangbei Branch of State Grid Corporation of China, Chongqing 401147, P.R.China and Chongqing Jiangbei Branch of State Grid Corporation of China, Chongqing 401147, P.R.China
Abstract:Based on spectral partition theory, spectral clustering algorithms are effective to solve the clustering of arbitrary sphere of sample spaces, and they can converge to global optimal solution. However, spectral clustering algorithms have to adopt the appropriate scaling parameter to calculate the whole similarity matrix, which may have a great impact on the clustering results. Moreover, when the number of data instances is large, computational complexity and memory use of the algorithm will greatly increase. So, we propose a fast algorithm for adaptive spectral clustering based on K-nearest neighbors, which can choose the scaling parameter automatically. Meantime, we divide the data set into different blocks and compute it separately. We also construct sparse matrix via retaining nearest neighbors to overcome the computational and the memory difficulties. Compared with traditional spectral clustering algorithms, experimental results show this algorithm can achieve better clustering effect on artificial datasets and UCI public databases.
Keywords:spectral clustering  K-nearest neighbors  sparse matrix  adaptive  fast algorithm
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《重庆大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号