Fault-Tolerant Technique in the Cluster Computation of the Digital Watershed Model |
| |
作者单位: | State Key Laboratory of Hydroscience and Engineering Tsinghua University,State Key Laboratory of Hydroscience and Engineering,Tsinghua University,State Key Laboratory of Hydroscience and Engineering,Tsinghua University,State Key Laboratory of Hydroscience and Engineering,Tsinghua University,Beijing 100084,China,Beijing 100084,China,Beijing 100084,China,Beijing 100084,China |
| |
摘 要: | This paper describes a parallel computing platform using the existing facilities for the digital watershed model. In this paper, distributed multi-layered structure is applied to the computer cluster system, and the MPI-2 is adopted as a mature parallel programming standard. An agent is introduced which makes it possible to be multi-level fault-tolerant in software development. The communication protocol based on checkpointing and rollback recovery mechanism can realize the transaction reprocessing. Compared with conventional platform, the new system is able to make better use of the computing resource. Experimental results show the speedup ratio of the platform is almost 4 times as that of the conventional one, which demonstrates the high efficiency and good performance of the new approach.
|
Fault-Tolerant Technique in the Cluster Computation of the Digital Watershed Model |
| |
Authors: | SHANG Yizi WU Baosheng LI Tiejian FANG Shenguang |
| |
Abstract: | This paper describes a parallel computing platform using the existing facilities for the digital watershed model. In this paper, distributed multi-layered structure is applied to the computer cluster system, and the MPI-2 is adopted as a mature parallel programming standard. An agent is introduced which makes it possible to be multi-level fault-tolerant in software development. The communication protocol based on checkpointing and rollback recovery mechanism can realize the transaction reprocessing. Compared with conventional platform, the new system is able to make better use of the computing resource. Experimental results show the speedup ratio of the platform is almost 4 times as that of the conventional one, which demonstrates the high efficiency and good performance of the new approach. |
| |
Keywords: | digital watershed model computer cluster MPI-2 fault-tolerant |
本文献已被 CNKI 万方数据 等数据库收录! |