首页 | 本学科首页   官方微博 | 高级检索  
     

基于近似等深柱状图的数据流并行聚集算法
引用本文:侯燕,王永利. 基于近似等深柱状图的数据流并行聚集算法[J]. 解放军理工大学学报(自然科学版), 2008, 9(1): 29-33
作者姓名:侯燕  王永利
作者单位:解放军94402部队,山东,济南,250002;南京理工大学,计算机科学与技术学院,江苏,南京,210094
摘    要:针对数据流并行聚集问题,提出了一种不同于关系数据和时间序列数据处理的并行聚集方法.为解决已经划分出的数据流元组无法再现的特点,提出能够感知数据流变化的采样算法对数据流采样.利用近似等深柱状图技术描述采样数据的分布特征,平均分配数据流量.使用时间聚集森林结构计算时间窗聚集.通过验证采样个数对并行聚集的影响,数据分布对近似划分向量算法性能的影响,测试数据流量与并行聚集加速比的关系,证明本算法能够高效地计算数据流聚集查询.

关 键 词:数据流  并行处理  近似技术  柱状图  聚集
文章编号:1009-3443(2008)01-0029-05
修稿时间:2006-12-21

Parallel aggregation algorithm over data streams basedon approximate equal depth histogram
HOU Yan and WANG Yong-li. Parallel aggregation algorithm over data streams basedon approximate equal depth histogram[J]. Journal of PLA University of Science and Technology(Natural Science Edition), 2008, 9(1): 29-33
Authors:HOU Yan and WANG Yong-li
Affiliation:No.94402 Unit of PLA,Jinan 250002,China;School of Computer Science and Technology,Nanjing Univ.of Sci.& Tech.,Nanjing 210094,China
Abstract:In light of parallel ag gregat ing algor ithm for data st ream, a new parallel agg reg ating st rateg yw ith relational data and temporal sequence data was put forw ard. To tackle the pro blem that part it ionedtuples w ere unable to recur over data st reams, a chang e-aw are sampling algorithm w as used to sample datast reams f irst , and an appr oximate equal depth histog ram algorithm w as used to describe data dist ributionin quer y w indow to part it io n data st reams av erag ely , and then a nov el m-order Temporal Ag greg ate forestdata str ucture w as applied to comput ing aggr eg ate in t ime w indow . It ver if ied the effect of numbers ofsampling on parallel ag gregat ion, the ef fect of data dist ribut io n on the perfo rmance o f approx imate part ition vector algorithm, and the relat io n between quant ity of data st reams and par al lel ag gr eg at ion speed ratio. Ex periment s pr ove that the proposed algorithm can pr ovide accurate answ er to parallel aggr eg ationover data str eams ef ficiently .
Keywords:data st reams   parallel processing   approx imate technique   histogr am   ag gregat ion
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《解放军理工大学学报(自然科学版)》浏览原始摘要信息
点击此处可从《解放军理工大学学报(自然科学版)》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号