Browse > Article

A Checkpointing Framework for Dependable Real-Time Systems  

Lee, Hyo-Soon (서울대학교 컴퓨터공학부)
Shin, Heonshik-Sin (서울대학교 컴퓨터공학부)
Abstract
We provide a checkpointing framework reflecting both the timeliness and the dependability in order to make checkpointing applicable to dependable real-time systems. The predictability of real-time tasks with checkpointing is guaranteed by the worst case execution time (WCET) based on the allocated number of checkpoints and the permissible number of failures. The permissible number of failures is derived from fault tolerance requirements, thus guaranteeing the dependability of tasks. Using the WCET and the permissible number of failures of tasks, we develop an algorithm that determines the minimum number of checkpoints allocated to each task in order to guarantee the schedulability of a task set. Since the framework is based on the amount of time redundancy caused by checkpointing, it can be extended to other time redundancy techniques.
Keywords
Fault tolerance; Checkpointing; Real-time systems; Schedulability;
Citations & Related Records
연도 인용수 순위
  • Reference
1 N. Bowen and D. Pradhan, 'Processor and memory based chekpoint and rollback recorery,' IEEE Computer, vol. 26, pp. 22-29, Feb. 1993   DOI   ScienceOn
2 J. Plank, M. Beck, G. Kingsley, and K. Li, 'Libckpt:Transparent checkpointing under unix,' Proceedings of USENIX Winter 1995 Technical Conference, pp. 213-223, 1995
3 D. Pradhan and N. H. Vaidya, 'Roll-forward checkpointing scheme: A novel fault-tolerant architecture,' IEEE Trans. Computers, vol. 43, pp. 1163-1174, Oct. 1994   DOI   ScienceOn
4 J. Long, W. Fuchs, and J. Abraham, 'A forward recovery strategy using checkpointing in parallel systems,' in Proceedings of International Conference on Parallel Porcessing, pp. 272-275, 1990
5 A. Burns, R. Davis, and S. Punnekkat, 'Feasibility analysis of fault-tolerant real-time task sets,' Euromicro Workshop on Real-Time Systems, pp. 29-33, 1996
6 S. Punnekkat, Schedulability Analysis for Fault Tolerant Real-Time Systems, PhD thesis, University of York, 1997
7 H. Lee, H. Shin, N. Chang, 'Checkpoint placement for fault-tolerant real-time systems,' in Preprints of IFAC Workshop on Distributed Computer Control Systems, pp. 61-66, 2000
8 S. Punnekkat, A. Burns and R. Davis, 'Analysis of checkpointing for real-time systems,' Journal of Real-Time Systems, Jan. 2001   DOI
9 R. Iyer, D. Rossetti, and M. Hsueh, 'Measurement and modelings of computer reliability as affected by system activity,' ACM Trans. Computer Systems, vol. 4, pp. 214-237, Aug. 1986   DOI
10 X. Castillo, S. McConnel, and D. Siewiorek, 'Derivation and calibration of a transient error reliability model,' IEEE Trans. Computers, vol. 31, pp. 658-671, July 1982   DOI   ScienceOn
11 A. Tantawi and M. Ruschitzka, 'Performance analysis of checkpointing strategies,' ACM Trans. Computer Systems, vol. 2, pp. 123-144, May 1984   DOI
12 D. Pradhan, Fault-Tolerant Computer System Design, Prentice-Hall, 1995
13 J. Lehoczky, L. Shar, and Y. Ding, 'The rate monotonic scheduling algorithm:Exact characterization and average case behavior,' in Proceedings of Real-Time Systems Symposium, pp. 166-171, 1989   DOI
14 A. Bertossi and L. Mancini, 'Scheduling algorithms for fault-tolerance in hard real-time systems,' Journal of Real-Time Systems, vo. 7, pp. 229-245, Nov. 1994   DOI
15 S. Ghosh, R. Melhem, D. Mosse, and J. Sarma, 'Enhancing real-time schedules to tolerate transient faults,' in Proceedings of Real-Time Systems Symposium, 1995   DOI
16 V. Nicola, 'Checkpointing and the modeling of propram execution time,' in Software Fault Tolerance (M. Lyu, ed.), ch. 7, pp. 167-188, Chichester: John Wiley & Sons, 1995
17 S. Ghosh, Guaranteeing Fault Tolerance Through Scheduling in Real-Time Systems, PhD thesis, University of Pittsburgh, 1996
18 M. Pandya and M. Malek, 'Minimum achievable utilization for fault-tolerant processing of periodic tasks,' IEEE Trans. Computers, vol. 47, pp. 1102-1112, Oct. 1998   DOI   ScienceOn
19 K. Shin, T. Lin, and Y. Lee, 'Optimal checkpointing of real-time tasks,' IEEE Trans Computers, vol. 36, pp. 1328-1341, Nov. 1987   DOI