首页 | 本学科首页   官方微博 | 高级检索  
     检索      

CFSBC: Clustering in High-Dimensional Space Based on Closed Frequent Item Set
作者姓名:NIWei-wei  SUNZhi-hui
作者单位:DepartmentofComputerScienceandEngineering.SoutheastUniversity,Nanjing210096,Jiangsu,China
基金项目:theNationalNaturalScienceFoundationofChina(70371015)
摘    要:Clustering in high-dimensional space is an important domain in data mining. It is the process of discovering groups in a high-dimensional dataset, in such way, that the similarity between the elements of the same cluster is maximum and between different clusters is minimal. Many clustering algorithms are not applicable to high dimensional space for its sparseness and decline properties. Dimensionality reduction is an effective method to solve this problem. The paper proposes a novel clustering algorithm CFSBC based onclosed frequent hemsets derived from association rule mining. which can get the clustering attributes with high efficiency. The algorithm has several advantages. First, it deals effectively with the problem of dimensionality reduction. Second, it is applicable to different kinds of attributes, Third, it is suitable for very large data sets. Experiment shows that the proposed algorithm is effective and efficient

关 键 词:CFSBC  聚类算法  多维空间  闭频项目设置  聚类归因  数据挖掘
收稿时间:26 May 2004

CFSBC: Clustering in high-dimensional space based on closed frequent item set
NIWei-wei SUNZhi-hui.CFSBC: Clustering in high-dimensional space based on closed frequent item set[J].Wuhan University Journal of Natural Sciences,2004,9(5):590-594.
Authors:Ni?Wei-wei  Email author" target="_blank">Sun?Zhi-huiEmail author
Institution:(1) Department of Computer Science and Engineering, Southeast University, 210096 Nanjing, Jiangsu, China
Abstract:Clustering in high-dimensional space is an important domain in data mining. It is the process of discovering groups in a high-dimensional dataset, in such way, that the similarity between the elements of the same cluster is maximum and between different clusters is minimal. Many clustering algorithms are not applicable to high-dimensional space for its sparseness and decline properties. Dimensionality reduction is an effective method to solve this problem. The paper proposes a novel clustering algorithm CFSBC based on closed frequent itemsets derived from association rule mining, which can get the clustering attributes with high efficiency. The algorithm has several advantages. First, it deals effectively with the problem of dimensionality reduction. Second, it is applicable to different kinds of attributes. Third, it is suitable for very large data sets. Experiment shows that the proposed algorithm is effective and efficient.
Keywords:clustering  closed frequent itemsets  association rule  clustering attributes
本文献已被 CNKI 维普 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号