首页 | 本学科首页   官方微博 | 高级检索  
     

Fault-Tolerant Technique in the Cluster Computation of the Digital Watershed Model
作者单位:State Key Laboratory of Hydroscience and Engineering Tsinghua University,State Key Laboratory of Hydroscience and Engineering,Tsinghua University,State Key Laboratory of Hydroscience and Engineering,Tsinghua University,State Key Laboratory of Hydroscience and Engineering,Tsinghua University,Beijing 100084,China,Beijing 100084,China,Beijing 100084,China,Beijing 100084,China
摘    要:This paper describes a parallel computing platform using the existing facilities for the digital watershed model. In this paper, distributed multi-layered structure is applied to the computer cluster system, and the MPI-2 is adopted as a mature parallel programming standard. An agent is introduced which makes it possible to be multi-level fault-tolerant in software development. The communication protocol based on checkpointing and rollback recovery mechanism can realize the transaction reprocessing. Compared with conventional platform, the new system is able to make better use of the computing resource. Experimental results show the speedup ratio of the platform is almost 4 times as that of the conventional one, which demonstrates the high efficiency and good performance of the new approach.


Fault-Tolerant Technique in the Cluster Computation of the Digital Watershed Model
SHANG Yizi,WU Baosheng,LI Tiejian,FANG Shenguang. Fault-Tolerant Technique in the Cluster Computation of the Digital Watershed Model[J]. Tsinghua Science and Technology, 2007, 12(Z1): 162-168
Authors:SHANG Yizi  WU Baosheng  LI Tiejian  FANG Shenguang
Abstract:This paper describes a parallel computing platform using the existing facilities for the digital watershed model. In this paper, distributed multi-layered structure is applied to the computer cluster system, and the MPI-2 is adopted as a mature parallel programming standard. An agent is introduced which makes it possible to be multi-level fault-tolerant in software development. The communication protocol based on checkpointing and rollback recovery mechanism can realize the transaction reprocessing. Compared with conventional platform, the new system is able to make better use of the computing resource. Experimental results show the speedup ratio of the platform is almost 4 times as that of the conventional one, which demonstrates the high efficiency and good performance of the new approach.
Keywords:digital watershed model  computer cluster  MPI-2  fault-tolerant
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号