DOI QR코드

DOI QR Code

Research on Data Replication Method for Building an Enterprise Disaster Recovery System

엔터프라이즈 재해복구시스템 구축을 위한 데이터 복제 방안 연구

  • Hyun-sun Kang (College of General Education, Namseoul University)
  • 강현선 (남서울대학교 교양대학)
  • Received : 2023.10.05
  • Accepted : 2023.11.10
  • Published : 2024.01.31

Abstract

In the event of a disaster, it is essential to establish a disaster recovery plan and disaster recovery system to minimize disruption to major IT infrastructure and provide continuous business services. In the process of building a disaster recovery system, data replication is a key element of data recovery to provide uninterrupted and continuous business services in the event of a disaster. The data replication method can be determined depending on the system configuration environment and disaster recovery goal level. In this paper, we present a method for determining a data replication method suitable for the configuration environment and disaster recovery target level when building a disaster recovery system. In addition, the replication method decision procedure is applied to build a disaster recovery system and analyze the construction results. After establishing the disaster recovery system, a test was conducted to determine whether the service was transferred to the disaster recovery center in a disaster situation and normal service was provided, and the results were analyzed. As a result, it was possible to systematically select the optimal data replication method during the disaster recovery system construction phase. The established disaster recovery system has an RTO of 3.7 hours for service conversion to the disaster recovery center to provide continuous business services, and the disaster recovery level, which was Tier 2, has been improved to the target level within 4 hours of RTO and RPO=0.

재해 발생 시 주요 IT 인프라 중단을 최소화하고 연속적인 비즈니스 서비스를 제공하기 위한 재해복구 계획 및 재해복구시스템의 구축은 반드시 필요하다. 재해복구시스템 구축과정에서 데이터 복제는 재해 발생 시 중단 없는 연속적인 비즈니스 서비스 제공을 위한 데이터 복구의 핵심요소로 데이터 복제방식은 시스템 구성환경과 재해복구 목표수준에 따라 결정할 수 있다. 본 논문에서는 재해복구시스템 구축에서 구성환경과 재해복구 목표수준에 적합한 데이터 복제방식 결정 방안에 대해 제시한다. 또한 복제방식 결정 절차를 적용하여 재해복구시스템을 구축하고 구축 결과를 분석한다. 재해복구시스템 구축 후 재해 상황에서 재해복구센터로 서비스가 전환, 정상적인 서비스가 진행되는지를 판단하기 위한 모의 테스트를 진행하고 결과를 분석하였다. 그 결과 재해복구시스템 구축 단계에서는 체계적으로 최적의 데이터 복제방식의 선정이 가능했다. 구축된 재해복구시스템은 연속적인 비즈니스 서비스 제공을 위해 재해복구센터로 서비스 전환되는 시간 RTO는 3.7시간으로, Tier 2였던 재해복구 수준이 목표수준 RTO 4시간 이내, RPO=0으로 개선되었다.

Keywords

Acknowledgement

이 논문은 2023년도 남서울대학교 학술연구비 지원에 의해 연구되었음

References

  1. C. Brooks, C. Leung, A. Mirza, C. Neal, Y. L. Qiu, J. Sing, F. TH Wong, and I. R Wright, "IBM System Storage Business Continuity: Part 1 Planning Guide," ibm.com/Redbooks, March 2007.
  2. C. Brooks, M. Bedernjak, I. Juran, and J. Merryman, "Disaster Recovery Strategies with Tivoli Storage Management," Red Books Series, IBM, Chap. 2, pp. 21-36, 2002.
  3. H. A. R. Mohamed, "A Proposed Model for IT Disaster Recovery Plan," I.J. Modern Education and Computer Science, Vol. 4, pp. 57-67, April 2014. DOI: 10.5815/ijmecs.2014.04.08
  4. V. Jorrigala, "Business Continuity and Disaster Recovery Plan for Information Security," Culminating Projects in Information Assurance. 44, 2017.
  5. F. C. Benavente, M. R. Gallardo, M. B. Esquivel, Y. Akakura, and K. Ono, "Methodology and procedure of business impact analysis for improving port logistics business continuity management," Journal of Integrated Disaster Risk Management, June 2016. DOI: 10.5595/idrim.2016.0114
  6. I. H. Sawalha, "Views on business continuity and disaster recovery," International Journal of Emergency Services, Vol. 10(3), pp. 351-365, October 2021. DOI: 10.1108/IJES-12-2020-0074
  7. C. Dwyer and J. Horney, "Validating Indicators of Disaster Recovery with Qualitative Research," Version 1. PLoS Curr, December 2014.
  8. N. Mansouri, M. M. Javidi, and B. M. H. Zade, "Hierarchical data replication strategy to improve performance in cloud computing," Frontiers of Computer Science, Vol. 15, December 2020.
  9. R. Mokadem and A. Hameurlain, "Data replication strategies with performance objective in data grid systems: a survey," International Journal of Grid and Utility Computing, Vol. 6, pp. 30-46, 2015. DOI: 10.1504/IJGUC.2015.066395
  10. A. Natanzon, P. Shilane, M. Abashkin, L. Baruch, E. Bachmat, "Hybrid Replication: Optimizing Network Bandwidth and Primary Storage Performance for Remote Replication," 2016 IEEE International Conference on Networking, Architecture and Storage (NAS), August 2016. DOI: 10.1109/NAS.2016.7549405
  11. A. Natanzon and E. Bachmat, "Dynamic Synchronous/Asynchronous Replication," ACM Transactions on Storage, Vol. 9, pp. 1-19, August 2013. DOI: 10.1145/2508011
  12. R. Chen and H. Chen, "Asymmetric virtual machine replication for low latency and high available service," Science China Information Sciences, Vol. 61, June 2018.