首页 | 本学科首页   官方微博 | 高级检索  
     检索      

密度相关的数据流偏倚抽样
引用本文:杨宜东,孙志挥.密度相关的数据流偏倚抽样[J].应用科学学报,2006,24(2):203-207.
作者姓名:杨宜东  孙志挥
作者单位:东南大学计算机科学与工程系, 江苏南京 210096
基金项目:中国科学院资助项目;教育部高等学校博士学科点科研项目;南瑞继保学位基金
摘    要:利用数据空间动态网格划分的方法,对数据流空间的数据分布密度情况进行模拟,并在此基础上提出了一种基于密度的偏倚抽样方法.为验证该抽样方法的有效性,将其应用到数据流中的聚类挖掘,实验结果表明该算法具有良好的适用性和有效性.

关 键 词:数据流  偏倚抽样  聚类  
文章编号:0255-8297(2006)02-0203-05
收稿时间:2004-12-29
修稿时间:2004-12-292005-03-29

Biased Sampling of Data Streams Based on Density
YANG Yi-dong,SUN Zhi-hui.Biased Sampling of Data Streams Based on Density[J].Journal of Applied Sciences,2006,24(2):203-207.
Authors:YANG Yi-dong  SUN Zhi-hui
Institution:Department of Computer Science and Engineering, Southeast University, Nanjing 210096, China
Abstract:As an important kind of data source,data stream has received increasing attention.Data stream management systems and data mining based on data streams have also attracted much research interest.With dynamical grid-partitioning of the data space,distribution density of data streams is approximated,and based on which a density biased sampling method is presented.To test its efficiency,the proposed sampling method is applied to clustering data streams.Experimental results show promising applicability of the approach.
Keywords:data streams  biased sampling  clustering
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《应用科学学报》浏览原始摘要信息
点击此处可从《应用科学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号