首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于映射簇的Web数据挖掘研究
引用本文:陈晓红,秦杨.基于映射簇的Web数据挖掘研究[J].系统工程,2004,22(7):80-83.
作者姓名:陈晓红  秦杨
作者单位:中南大学,商学院,湖南,长沙,410083
基金项目:国家自然科学基金委国家杰出青年科学基金资助项目(70125002)
摘    要:传统特征选择算法在多维Web数据中由于其数据对象自身固有的稀缺性而常常失效。在典型多维Web数据挖掘应用中,不同数据对象集合对于不同雏度集合而言可能聚类会更好,且在每个簇的具体子空间中维度数将可能非常大。事实上,为所有簇查找出单个的小雏度集合是不可能的。本文应用映射簇的概念来明确簇与雏度的关系,将聚类问题转化为映射簇问题,从而简化计算提高挖掘效率。最后给出相应的算法。

关 键 词:多维Web数据  Web数据挖掘  聚类  映射簇
文章编号:1001-4098(2004)07-0080-04

Study on the Projected Cluster Based Web Data Mining
CHEN Xiao-hong,QIN Yang.Study on the Projected Cluster Based Web Data Mining[J].Systems Engineering,2004,22(7):80-83.
Authors:CHEN Xiao-hong  QIN Yang
Abstract:Traditional feature selection algorithms trends to break down in high dimensional Web spaces because of the (inherent) sparsity of the data object. In the typical high dimensional Web data mining applications different sets of points may cluster better for different subsets of dimensions and the number of dimensions in each such cluster-specific subspace may also vary. In fact, it may be impossible to find a single small subset of dimensions for all the clusters. So in the paper we use the concept of projected cluster to discuss the relation of cluster and its dimensions, and realize clustering in high (dimensional) data by solving the projected cluster problem. Finally, corresponding fast algorithm is developed based on (Projected Cluster.)
Keywords:High Dimensional Web Data  Web Data Mining  Clustering  Projected Cluster
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号