首页 | 本学科首页   官方微博 | 高级检索  
     检索      

分级与密度相结合的Web文本聚类算法
引用本文:林国平.分级与密度相结合的Web文本聚类算法[J].太原师范学院学报(自然科学版),2008,7(3):45-48.
作者姓名:林国平
作者单位:漳州师范学院,数学与信息科学系,福建,漳州,363000
摘    要:考虑到实验数据的大规模及样本数据形状的复杂性等特点,提出一种基于分级聚类与DBSCAN聚类相结合的HL-DBSCAN聚类算法,避免了DBSCAN的聚类算法较大的时间复杂度,适用性更广,更能体现一个聚簇的规律,提高分类精度.通过实验与结果分析,取得较好的聚类结果,证明了该算法在文本聚类处理中的可行性.

关 键 词:分级聚类  DBSCAN算法  Web文本分类

Algorithm of Web Text Classification Based on Hierarchical and Density Clustering
Lin Guoping.Algorithm of Web Text Classification Based on Hierarchical and Density Clustering[J].Journal of Taiyuan Normal University:Natural Science Edition,2008,7(3):45-48.
Authors:Lin Guoping
Institution:Lin Guoping (Department of Mathematics Information Science ,Zhangzhou Teachers University,Zhangzhou 363000,China)
Abstract:Due to the complexity of text classification. The DBSCAN algorithm is modified with hierarchical idea to overcome its thread limitation, which can only adapt to small spatial data structure so that its clustering result can be more widely used and reflect the character of clustering better. The modified algorithm can also increase classification accuracy. According to the result of experiments for HL-DBSCAN algorithm,it is proved that the clustering result is not bad. At the same time,it also indicates that HL-DBSCAN algorithm is feasible for text clustering miming.
Keywords:hierarchical clustering  DBSCAN algorithm  web text clustering
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号