首页 | 本学科首页   官方微博 | 高级检索  
     检索      

快速鲁棒的密度基聚类算法
引用本文:侯越先,石陆魁,何丕廉,张莹.快速鲁棒的密度基聚类算法[J].天津大学学报(自然科学与工程技术版),2005,38(12):1091-1095.
作者姓名:侯越先  石陆魁  何丕廉  张莹
作者单位:天津大学电子信息工程学院,天津大学电子信息工程学院,河北工业大学计算机科学与软件学院 天津 300130,天津大学电子信息工程学院,天津大学管理学院,天津 300072,天津 300072,天津 300072,天津 300072
基金项目:天津市科技发展计划基金资助(04310941R) 天津市应用基础研究计划基金资助(05YFJMJC11700)
摘    要:为有效改善基于密度的聚类算法的聚类效果,提出了判定顺序聚类算法的聚类合理性的形式判据,简述了其神经生物学证据,并据此给出了可有效改善DBSCAN算法的聚类精度和时间效率的新算法DBSCANJZPoll.该算法首先以随机处理次序多次执行一个顺序依赖的子进程;再根据子进程的各次执行结果,由“合理聚类”的形式判据和简单的统计原则确定最终聚类结果.在聚类精度方面,仿真实验表明,DBSCANJZPoll可有效处理密度不均匀的样本集,聚类效果显著优于DBSCAN.在时间效率方面,理论分析表明,对于中高维样本集,DBSCANJZPoll的时间复杂性渐近于O(N),优于DBSCAN算法族中其他算法至少O(N log N)的复杂性;对于低维样本集的仿真也表明, DBSCANJZPoll具有相对优化的时间效率.

关 键 词:聚类  顺序无关性  盒索引
文章编号:0493-2137(2005)12-1091-05
收稿时间:2004-08-11
修稿时间:2004-08-112005-02-20

Improved Density-Based Algorithm for Robust Clustering
HOU Yue-xian,SHI Lu-kui, HE Pi-lian, ZHANG Ying.Improved Density-Based Algorithm for Robust Clustering[J].Journal of Tianjin University(Science and Technology),2005,38(12):1091-1095.
Authors:HOU Yue-xian  SHI Lu-kui  HE Pi-lian  ZHANG Ying
Institution:1. School of Electronic Information Engineering, Tianjin University, Tianjin 300072, China; 2. School of Computer Science and Engineering, Hebei University of Technology, Tianjin 300130, China; 3. School of Management, Tianjin University, Tianjin 300072, China
Abstract:A formal criterion to verify the validity of sequential clustering algorithms is presented. Based on the criterion, DBSCANJZPoll, an algorithmic variation of DBSCAN clustering algorithm, is proposed. DBSCANJZ-Poll performs an order-dependent clustering sub-procedure a few times with random processing order, and determines the last clustering results by means of a simple statistical principle, which is derived from the above formal criterion. Simulation results show that in the case of noisy data, the clustering precision of DBSCANJZPoll is superior to DBSCAN algorithm. Moreover, theoretical analysis indicates that, for high dimensionality data, the time complexity of DBSCANJZPoll is asymptotical to O(N) , which is superior to other algorithms in DBSCAN family. The simulation of low dimensionality data shows superior time efficiency of DBSCANJZPoll.
Keywords:clustering  order-independency  boxes-based index
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号