首页 | 本学科首页   官方微博 | 高级检索  
     检索      

基于时间序列的Global Skyline并行算法
引用本文:李媛媛,曲雯毓,栗志扬,季长清,吴俊峰.基于时间序列的Global Skyline并行算法[J].系统工程与电子技术,2016,38(1):215-222.
作者姓名:李媛媛  曲雯毓  栗志扬  季长清  吴俊峰
作者单位:1. 大连海事大学信息科学技术学院, 辽宁 大连 116026; 2. 大连交通大学软件学院,; 辽宁 大连 116028; 3. 大连大学物理科学与技术学院, 辽宁 大连 116622;; 4. 大连海洋大学信息工程学院, 辽宁 大连 116023
摘    要:Global Skyline 查询是Skyline查询的一种变种,它和动态Skyline查询、反Skyline查询关系密切,已被广泛应用于多目标决策、网络监控、数据挖掘等方面。随着数据的积累,传统集中式的Skyline查询已经不能满足大数据的处理要求。为了高效解决大规模的基于时间序列的数据处理难题,提出了基于MapReduce框架并行的Global Skyline Cell查询算法。首先,通过对实际应用需求进行分析,本文提出了基于时间序列数据Skyline查询的时间倒排索引模型;并提出了Global Skyline格概念,利用格间的支配关系进行粗粒度高效剪枝,避免了大部分的无效运算;其次查询点将数据空间分割成不同象限,基于各象限进行轮询,实现了Global Skyline 格的查询,在此候选结果中得到Global Skyline点,为下一步实现动态Skyline和反Skyline查询奠定基础。最后,我们在Hadoop集群环境中实现了该算法。实验结果表明,该算法能有效解决基于时间序列的大规模数据Skyline查询的时间和空间矛盾,能够满足实际应用需求。


Parallel algorithm of Global Skyline on time series
LI Yuan-yuan,QU Wen-yu,LI Zhi-yang,JI Chang-qing,WU Jun-feng.Parallel algorithm of Global Skyline on time series[J].System Engineering and Electronics,2016,38(1):215-222.
Authors:LI Yuan-yuan  QU Wen-yu  LI Zhi-yang  JI Chang-qing  WU Jun-feng
Institution:1. College of Information Science and Technology, Dalian Maritime University, Dalian 116026, China;; 2. College of Software technology, Dailian Jiaotong University, Dalian 116028, China; 3. College of; Physical Science and Technology, Dalian University, Dalian 116622, China; 4. College of; Information Engineering, Dalian Ocean University, Dalian 116023, China
Abstract:Global Skyline query is a variant of the Skyline query which has been used for multiple objective decision making, business planning, network monitoring and data mining etc. The result set of Global Skyline query is close to the ones of dynamic Skyline query and reverse Skyline query. With the number of historical data increases, Skyline query on centralized system is not competent for big data and Skyline query for large scale data on time series is a challenge. A parallel algorithm of Global Skyline on time series is proposed. Firstly, we present a inverted index based on data on time series. Secondly, we provide the concept of Global Skyline cell which can eliminate the dominated cells according to the cell dominance relationship. The coarse grained pruning strategy can help to avoid a lot of meaningless computation. The query point divides the data space into the four quadrants, Global Skyline query can be executed in eachquadrant circularly. Lastly through extensive experiments with both real world and synthetic datasets, we show that our algorithm is much more efficient for big data on time series.
Keywords:
本文献已被 万方数据 等数据库收录!
点击此处可从《系统工程与电子技术》浏览原始摘要信息
点击此处可从《系统工程与电子技术》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号