DOI QR코드

DOI QR Code

Efficient Checkpoint Algorithm for Message-Passing Parallel Applications on Cloud Computing

클라우드컴퓨팅에서 메시지패싱방식 응용프로그램의 효율적인 체크포인트 알고리즘

  • Le, Duc Tai (School of Information and Communication Engineering, Sungkyunkwan University) ;
  • Dao, Manh Thuong Quan (School of Information and Communication Engineering, Sungkyunkwan University) ;
  • Ahn, Min-Joon (School of Information and Communication Engineering, Sungkyunkwan University) ;
  • Choo, Hyun-Seung (School of Information and Communication Engineering, Sungkyunkwan University)
  • Published : 2011.04.30

Abstract

In this work, we study the checkpoint/restart problem for message-passing parallel applications running on cloud computing environment. This is a new direction which arises from the trend of enabling the applications to run on the cloud computing environment. The main objective is to propose an efficient checkpoint algorithm for message-passing parallel applications considering communications with external systems. We further implement the novel algorithm by modifying gSOAP and OpenMPI (the open source libraries) which support service calls and checkpoint message-passing parallel programs, especially. The simulation showed that additional costs to the executing and checkpointing application of the algorithm are negligible. Ultimately, the algorithm supports efficiently the checkpoint/restart service for message-passing parallel applications, that send requests to external services.

Keywords