首页 | 本学科首页   官方微博 | 高级检索  
     

噪声环境下复杂流形数据的势能层次聚类算法
引用本文:于晓飞,葛洪伟. 噪声环境下复杂流形数据的势能层次聚类算法[J]. 重庆邮电大学学报(自然科学版), 2018, 30(6): 848-854
作者姓名:于晓飞  葛洪伟
作者单位:江南大学 轻工过程先进控制教育部重点实验室,江苏 无锡 214122; 江南大学 物联网工程学院,江苏 无锡 214122,江南大学 轻工过程先进控制教育部重点实验室,江苏 无锡 214122; 江南大学 物联网工程学院,江苏 无锡 214122
基金项目:江苏省普通高校研究生科研创新计划项目(KYLX16_0781);江苏省普通高校研究生科研创新计划项目(KYLX16_0782);江苏省高校优势学科建设工程项目
摘    要:基于势能的快速凝聚层次聚类算法使用一种全新的相似性度量准则,可以更高效地得到聚类结果。针对该算法无法有效处理含噪声的复杂流形数据的缺陷,提出噪声环境下复杂流形数据的势能层次聚类算法。通过势能递增曲线识别噪声点,在新定义的势能最大、最小2层数据上进行自动聚类,以确定类簇的大体框架,并在此基础上对整个数据集进行层次聚类。人工数据集上的实验表明,新算法可以有效处理噪声环境下复杂流形数据;真实数据集上的实验表明,新算法具有更优的聚类效果。

关 键 词:聚类  PHA  势能分层  层次聚类  噪声识别
收稿时间:2017-11-15
修稿时间:2018-10-19

Hierarchical clustering algorithm based on potential in complex flow data sets with noise
YU Xiaofei and GE Hongwei. Hierarchical clustering algorithm based on potential in complex flow data sets with noise[J]. Journal of Chongqing University of Posts and Telecommunications, 2018, 30(6): 848-854
Authors:YU Xiaofei and GE Hongwei
Affiliation:Key Laboratory of Advanced Process Control for Light Industry, Ministry of Education, Jiangnan University, Wuxi 214122, P. R. China; School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, P. R. China and Key Laboratory of Advanced Process Control for Light Industry, Ministry of Education, Jiangnan University, Wuxi 214122, P. R. China; School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, P. R. China
Abstract:Potential-based hierarchical agglomerative (PHA) clustering uses a new similarity metric to get clustering results more efficiently. Aiming at the problem that PHA can not effectively deal with the complex structure data sets with noise, we proposed a hierarchical clustering algorithm based on potential in complex flow data sets with noise. Firstly, the noise points were identified by means of the potential increase curve; Secondly, the maximum and minimum layers of the data set based on potential were defined, and these two data layers were made to cluster automatically to determine the general framework of the clusters; Finally, clustering of the entire data set was hierarchied based on the precious clustering results. The experiments on artificial data sets show that the new algorithm can deal with the complex flow data sets with noise effectively. The experiments on real data sets show that the new algorithm can get the better clustering results.
Keywords:clustering   potential-based hierarchical agglomerative (PHA)   potential hierarchy   hierarchical agglomerative clustering   noise recognition
点击此处可从《重庆邮电大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《重庆邮电大学学报(自然科学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号