An Asychronous Checkpointing Algorithm Using Virtual Checkpointing On Distributed Systems

분산시스템에서 가상 체크포인팅을 이용한 비동기화 체크포인팅 알고리즘

  • 김도형 (한국전자통신연구원 컴퓨터소프트웨어기술연구소) ;
  • 박창순 (한국전자통신연구원 컴퓨터소프트웨어기술연구소) ;
  • 김종 (포항공과대학교 전산학과)
  • Published : 1999.05.01

Abstract

Checkpointing is the one of fault-tolerant techniques to restore faults and to restart job fast. Checkpointing algorithms in distributed systems have been studied for many years. These algorithms can be classified into synchronous Checkpointing algorithms and asynchronous Checkpoiting algorithms. In this paper, we propose an independent Checkpointing algorithm that has a minimum Checkpointing counts equal to periodic Checkpointing algorithm, and relatively short rollback distance at faulty situation. Checkpointing count is directly related to task completion time in a fault-free situation and short rollback distance is directly related to task completion time in a faulty situation. The proposed algorithm is compared with the previously proposed asynchronous Checkpointing algorithms using simulation. In the simulation, the proposed Checkpointing algorithm produces better results than other algorithms in terms of task completion time in fault-free as well as faulty situations.

Keywords