首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于密度峰值剪枝后的最短路径聚类算法
引用本文:胡恩祥,汪春雨,潘美芹.基于密度峰值剪枝后的最短路径聚类算法[J].应用科学学报,2020,38(5):792-802.
作者姓名:胡恩祥  汪春雨  潘美芹
作者单位:1. 上海外国语大学 国际工商管理学院, 上海 201600;2. 华东师范大学 计算机科学与技术学院, 上海 200062
基金项目:上海外国语大学规划项目基金(No.2019114009)资助
摘    要:聚类是通过数据标签或者属性,将一系列经验数据按照相似性或者相近性进行归类.基于密度属性展开的聚类算法,主要聚焦在聚类中心的确定和剩余点如何分配的问题上展开讨论.针对基于密度峰值的可训练最短路径算法,通过密度峰值确定聚类中心,提出使用截断阈值、对路径图进行剪枝的算法改进.然后基于最短路径法对剩余点进行全局分配.实验结果证明,在保持聚类精度的同时,有效地提升了算法执行效率.

关 键 词:聚类  密度峰值  最短路径法  路径剪枝  
收稿时间:2020-05-25

Clustering by Pruning Paths Based on Shortest Paths from Density Peaks
HU Enxiang,WANG Chunyu,PAN Meiqin.Clustering by Pruning Paths Based on Shortest Paths from Density Peaks[J].Journal of Applied Sciences,2020,38(5):792-802.
Authors:HU Enxiang  WANG Chunyu  PAN Meiqin
Institution:1. School of Business and Management, Shanghai International Studies University, Shanghai 201600, China;2. School of Computer Science and Technology, East China Normal University, Shanghai 200062, China
Abstract:Clustering is to classify multiple empirical data according to their similarity or proximity based on data labels and properties. For the clustering algorithm based on the density peaks, it mainly focuses on the determination of the clustering center and how to allocate the remaining points. In this paper, according to a trainable clustering algorithm based on shortest paths to density peaks, the clustering center is determined by the density peaks. We propose that using a cutoff threshold and pruning the path graph to improve the algorithm. The remaining points are allocated globally based on the shortest path method. It is proved that the algorithm can significantly improve the efficiency while maintaining the clustering accuracy.
Keywords:clustering  density peak  shortest path method  pruning path  
本文献已被 CNKI 等数据库收录!
点击此处可从《应用科学学报》浏览原始摘要信息
点击此处可从《应用科学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号