Determination of Optimal Checkpoint Interval for RM Scheduled Real-time Tasks

RM 스케줄링된 실시간 태스크에서의 최적 체크 포인터 구간 선정

  • Published : 2007.06.01

Abstract

For a system with multiple real-time tasks of different deadlines, it is very difficult to find the optimal checkpoint interval because of the complexity in considering the scheduling of tasks. In this paper, we determine the optimal checkpoint interval for multiple real-time tasks that are scheduled by RM(Rate Monotonic) algorithm. Faults are assumed to occur with Poisson distribution. Checkpoints are inserted in the execution of task with equal distance in the same task, but different distances in other tasks. When faults occur, rollback to the latest checkpoint and re-execute task after the checkpoint. We derive the equation of maximum slack time for each task, and determine the number of re-executable checkpoint intervals for fault recovery. The equation to check the schedulibility of tasks is also derived. Based on these equations, we find the probability of all tasks executed within their deadlines successfully. Checkpoint intervals which make the probability maximum is the optimal.

Keywords

References

  1. C. M. Krishna and A. D. Singh, 'Optimal configuration of redundant real-time systems in the face of correlated failure.' IEEE Trans. on Reliability, vol 44. pp. 587-594. Dec.1995 https://doi.org/10.1109/24.475977
  2. Seong Woo Kwak and Dyung Kook Kim. 'Task Scheduling Strategies for Reliable TMR Controllers using Task Grouping and Assignment'. IEEE Trans. on Reliability. vol. 49, no.4. pp. 355-362. Dec. 2000 https://doi.org/10.1109/24.922488
  3. C. M. Krishina and Kang G. Shin. Real-Time Systems. New York: McGraw-Hill. 1997
  4. D. P. Siewiorek and R S. Swarz, Reliable Computer Systems. Digital Press. 1992
  5. Avi Ziv and Jehoshua Bruck, 'An on-line algorithm for checkpoint placement,' IEEE Trans. on Computers, vol. 46, pp. 976-984, Sep. 1997 https://doi.org/10.1109/12.620479
  6. R. Geist. R. Reynolds, and J. Westall. 'Selection of a checkpoint interval in a critical-task environment,' IEEE Trans. on Reliability. vol. 37, pp. 395-400, Oct. 1988 https://doi.org/10.1109/24.9847
  7. Kang G. Shin. Tein-Hsiang Lin. and Yann -Hang Lee, 'Optimal checkpointing of real-time tasks,' IEEE Trans. on Computers, vol. C-36, pp. 1328-1341. Nov. 1987 https://doi.org/10.1109/TC.1987.5009472
  8. C. M. Krishna and A. D. Singh. 'Reliability of checkpointed real-time systems using time redundancy.' IEEE Trans. on Reliability, vol. 42, pp 427-435, Sep. 1993 https://doi.org/10.1109/24.257826
  9. Seong Woo Kwak. Byung Jae Choi and Byung Kook Kim. 'Optimal Checkpointing Strategy for Real-Time Control Systems under Faults with Exponential Duration'. IEEE Trans. on Reliability, vol.50. no.3. pp. 293-301. Sep. 2001 https://doi.org/10.1109/24.974127
  10. Seong Woo, Kwak, 'Reliability Analysis and Design of Real-time Fault Tolerant Control Systems under Transient Faults', Ph.D thesis. KAIST. 2000
  11. John W. Young, 'A first order approximation to the optimal checkpoint intervals.' Comm. of the ACM. vol. 17, pp.530-531. Nov. 1974 https://doi.org/10.1145/361147.361115
  12. Hagbae Kim and Kang G. Shin. 'Modeling of externally-induced/common-cause faults In fault-tolerant systems.' IEEE/ AIAA Digital Avionics System Conference. pp. 402-407, Oct. 1994 https://doi.org/10.1109/DASC.1994.369450
  13. Sunondo Ghosh.Rami G. Melhem. Daniel Mosse.' Fault-Tolerance Through Scheduling of Aperiodic Tasks in Hard Real-Time Multiprocessor Systems'. IEEE Trans. on Parallel Disgrib. Syst. Vol.8, No. 3 pp. 272-284, 1997 https://doi.org/10.1109/71.584093
  14. Frank Liberato. Rami Melhem. Daniel Mosse. 'Tolerance to Multiple Transient Faults for Aperiodic Tasks in Hard Real-Time Systems'. vol49 No.9. IEEE Trans. on Computers. pp.906-914. sep. 2000 https://doi.org/10.1109/12.869322
  15. H. Kim and K. G. Shin, 'Design and Analysis of an Optimal Instruction Retry Policy for TMR Controller Computers', IEEE Trans. on Computers, vol 45, pp. 1217-1225, Nov. 1996 https://doi.org/10.1109/12.544478
  16. 곽성우, 하드데드라인을 가지는 다중 실시간 주기적 태스크에서의 체크포인팅 기법, 전기학회논문지-D, 제53권 제8호, pp. 594-601, 2004. 8