Page-level Incremental Checkpointing for Efficient Use of Stable Storage

안정 저장장치의 효율적 사용을 위한 페이지 기반 점진적 검사점 기법

  • 허준영 (서울대학교 컴퓨터공학부) ;
  • 이상호 (서울대학교 컴퓨터공학부) ;
  • 구본철 (서울대학교 컴퓨터공학부) ;
  • 조유근 (서울대학교 컴퓨터공학부) ;
  • 홍지만 (숭실대학교 컴퓨터공학과)
  • Published : 2007.12.15

Abstract

Incremental checkpointing, which is intended to minimize checkpointing overhead, saves only the modified pages of a process. However, the cumulative site of incremental checkpoints increases at a steady rate over time because a number of updated values may be saved for the same page. In this paper, we present a comprehensive overview of Pickpt, a page-level incremental checkpointing facility. Pickpt provides space-efficient techniques aiming to minimizing the use of disk space. For our experiments, the results showed that the use of disk space using Pickpt was significantly reduced, compared with existing incremental checkpointing.

페이지 기반 점진적 검사점은 검사점 오버헤드를 줄이기 위해 프로세스의 메모리 상태 중 변경된 페이지만 저장하는 기법이다. 그러나 점진적 검사점의 누적 크기는 검사점 횟수가 증가함에 따라 서서히 증가하게 된다. 이는 한 페이지가 검사점 작성 이후에 변경되어 검사점 작성시에 검사점에 저장되는 과정이 되풀이 되고, 이후에 삭제되지 않기 때문이다. 복구 시에 프로세스의 저장된 상태를 만들기 위해 검사점들이 모두 필요할 수 있으므로 함부로 검사점을 삭제를 할 수 없다. 본 논문에서는 페이지 기반 검사점 도구인 Pickpt를 소개하고, Pickpt가 검사점의 누적 크기 증가 문제를 해결하는 방법을 설명한다. 실험을 통해 기존 점진적 검사점에 비해 Pickpt가 점진적 검사점의 누적 크기를 현저히 줄임을 보였다.

Keywords

References

  1. J. Hong, S. Kim, and Y. Cho, 'Cost analysis of optimistic recovery model for forked checkpointing,' IEICE Transactions on Information and Systems, E86-D(9), 2003, pp. 1534-1541
  2. J. Plank, M. Beck, and G. Kingsley, 'Compiler-assisted memory exclusion for fast checkpointing,' in IEEE Technical Committee on Operating Systems and Application Environments, Special Issue on Fault-Tolerance, 1995, pp. 62-67
  3. A. Ziv and J. Bruck, 'An on-line algorithm for checkpoint placement,' IEEE Transactions on Computers, 46(9), 1997, pp. 976-985 https://doi.org/10.1109/12.620479
  4. Nitin H. Vaidya, 'Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme,' IEEE Transactions on Computers, Vol.46, No.8, pp. 942-947, 1997 https://doi.org/10.1109/12.609281
  5. Andrzej Duda, 'The Effects of Checkpointing on Program Execution Time,' Information Processing Letters, Vol.16, pp. 221-229, 1983 https://doi.org/10.1016/0020-0190(83)90093-5
  6. J. Plank, K. Li, and M. Puening, 'Diskless checkpointing,' IEEE Transactions on Parallel and Distributed Systems, 9(10), 1998, pp. 303-308erpo
  7. S. Yi, J. Heo, Y. Cho and J. Hong, 'Adaptive Mobile Checkpointing Facility for Wireless Sensor Networks,' LNCS Vol.3981, pp. 701-709, 2006
  8. J. Plank, J. Xu, and R. Netzer, 'Compressed differences: An algorithm for fast incremental checkpointing,' Technical Report CS-95-302, University of Tennessee, August 1995
  9. J. Plank, Y. Chen, M. B. K. Li, and G. Kingsley, 'Memory exclusion: optimizing the performance of checkpointing systems,' Software Practice and Experience, 29(2), 1999, pp. 125-142 https://doi.org/10.1002/(SICI)1097-024X(199902)29:2<125::AID-SPE224>3.0.CO;2-7
  10. J. Lawall and G. Muller, 'Efficient incremental checkpointing of java programs,' in IEEE Proceedings of the International Conference on Dependable Systems and Networks, 2000, pp. 61-70
  11. J. Heo, S. Yi, Y. Cho, J. Hong and S. Y. Shin, 'Space-efficient Page-level Incremental Checkpointing,' in Proceedings of the 2005 ACM Symposium on Applied Computing, pp. 1558-1562, 2005
  12. S. Yi, J. Heo, Y.Cho and J. Hong, 'Adaptive page-level incremental checkpointing based on expected recovery time,' in Proceedings of the 2006 ACM symposium on Applied computing, pp. 1472-1476, 2006
  13. T. H. Feng and E. A. Lee, 'Incremental checkpointing with application to distributed discrete event simulation,' in Proceedings of the 38th conference on Winter simulation, pp. 1004-1011, 2006
  14. J. Heo, S. Yi, J. Hong, Y. Cho, and J. Choi, 'An efficient merging algorithm for recovery and garbage collection in incremental checkpointing,' in IASTED International Conference on Parallel and Distributed and Networks, 2004, pp. 365-368
  15. J. Plank, M. Beck, G. Kingsley, and K. Li, 'Libckpt:transparent checkpointing under unix,' in Usenix Winter Technical Conference, 1995, pp. 213- 223
  16. M. Beck, J. S. Plank, and G. Kingsley, 'Compiler-assisted checkpointing,' Technical Report, UTCS- 94-269, University of Tennessee, 1994
  17. M. Litzkow, T. Tannenbaun, J. Basney, and M. Livny, 'Checkpoint and migration of unix processes in the condor distributed processing system,' Technical Report 1346, Department of Computer Science, University of Wisconsin-Madison, 1997
  18. J. Hong, T. Park, H. Yeom, and Y. Cho, 'Kckpt : An efficient checkpoint facility on unixware,' in International Conference on Computers and Their Applications, 2000, pp. 303-308
  19. 조유근, 최종무, 홍지만, '리눅스 매니아를 위한 커널 프로그래밍 Vol. 1', 교학사(주), 2002
  20. E. N. Elnozahy, Lorenzo Alvisi, Yi-ming Wang and David B. Johnson, 'A Survey of Rollback- Recovery Protocols in Message-Passing Systems,' ACM Computing Surveys, Vol. 34, No. 3, pp. 375-408, 2002 https://doi.org/10.1145/568522.568525
  21. J. Heo, X. Piao, S. Yi, G. Park, M. Park, J. Hong and Y. Cho, 'Impact on the Writing Granularity for Incremental Checkpointing,' LNCS Vol. 3614, pp. 1154-1157, 2005