首页 | 本学科首页   官方微博 | 高级检索  
     检索      

DBSCAN算法在高性能计算中心用户分类的应用研究
引用本文:徐海啸,麻婧,吴旗.DBSCAN算法在高性能计算中心用户分类的应用研究[J].吉林大学学报(信息科学版),2013,31(5):528-534.
作者姓名:徐海啸  麻婧  吴旗
作者单位:吉林大学计算机科学与技术学院,长春130012;吉林大学高性能计算中心,长春130012;吉林大学计算机科学与技术学院,长春,130012
基金项目:大学生创新实验国家级基金资助项目(2011A53101)
摘    要:为提高集群资源使用效率, 管理员需要对用户进行分类, 从而对不同用户提出资源使用策略。DBSCAN(Density Based Spatial Clustering of Applications with Noise)聚类算法可对用户进行分类, 但对初始参数敏感。为此, 提出改进算法, 首先将密度进行层次划分, 由此得出各层次的密度阈值, 在每种阈值下采用DBSCAN算法, 解决全局参数问题。在此基础上, 创新地使用一个直接可达距离排序队列, 将排序信息作为可变参数, 减小初始参数对结果的影响。通过高性能计算中心用户数据的实例验证了其可行性。实验结果表明, 改进后的算法提高了用户分类的准确性和全面性。

关 键 词:聚类分析  DBSCAN算法  高性能计算中心  用户分类  数据挖掘
收稿时间:2013-02-19

Application Research of DBSCAN Algorithm Based on High-Performance Computing Center Users Classification
XU Hai-xiao;MA Jing;WU Qi.Application Research of DBSCAN Algorithm Based on High-Performance Computing Center Users Classification[J].Journal of Jilin University:Information Sci Ed,2013,31(5):528-534.
Authors:XU Hai-xiao;MA Jing;WU Qi
Institution:a. College of Computer Science and Technology; b. High Performance Computing Center, Jilin University, Changchun 130012, China
Abstract:To enhance service efficiency on cluster resource, administrator needs to make classification of users, and provide various strategies on resource utilization to different users. DBSCAN (Density Based Spatial Clustering of Applications with Noise) algorithm can achieve usersclassification, but the initial parameters are very sensitive. The improved algorithm classifies the level of density firstly, then gets the densitythreshold of each level, and uses DBSCAN under each threshold which solves the problem of global parameters. It uses a sorted queue of directly accessible distance as an innovation, makes the sorting information as variable parameter to decrease the influence of initial parameter. The algorithm has verified its feasibility through example data of HPC users. The experimental result demonstrates that this improved algorithm can achieve a more accurate and comprehensive user classification.
Keywords:clustering analysis  density based spatial clustering of applications with noise (DBSCAN)  high-performance computing center  users classification  data mining  
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《吉林大学学报(信息科学版)》浏览原始摘要信息
点击此处可从《吉林大学学报(信息科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号