首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Concurrent and Storage-Aware Data Streaming for Data Processing Workflows in Grid Environments
Authors:Wen Zhang  Junwei Cao  Yisheng Zhong  Lianchen Liu  Cheng Wu
Institution:1. Department of Automation, Tsinghua University, Beijing 100084, China;2. Research Institute of Information Technology, Tsinghua University, Beijing 100084, China;3. Tsinghua National Laboratory for Information Science and Technology, Beijing 100084, China
Abstract:Data streaming applications, usually composed of sequential/parallel data processing tasks organized as a workflow, bring new challenges to workflow scheduling and resource allocation in grid environments. Due to the high volumes of data and relatively limited storage capability, resource allocation and data streaming have to be storage aware. Also to improve system performance, the data streaming and processing have to be concurrent. This study used a genetic algorithm (GA) for workflow scheduling, using on-line measurements and predictions with gray model (GM). On-demand data streaming is used to avoid data overflow through repertory strategies. Tests show that tasks with on-demand data streaming must be balanced to improve overall performance, to avoid system bottlenecks and backlogs of intermediate data, and to increase data throughput for the data processing workflows as a whole.
Keywords:
本文献已被 CNKI ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号