Browse > Article
http://dx.doi.org/10.1633/JISTaP.2022.10.S.6

A Disk-based Archival Storage System Using the EOS Erasure Coding Implementation for the ALICE Experiment at the CERN LHC  

Ahn, Sang Un (Korea Institute of Science and Technology Information (KISTI), Global Science experimental Data hub Center (GSDC))
Betev, Latchezar (European Organization for Nuclear Research (CERN))
Bonfillou, Eric (European Organization for Nuclear Research (CERN))
Han, Heejune (Korea Institute of Science and Technology Information (KISTI))
Kim, Jeongheon (Korea Institute of Science and Technology Information (KISTI))
Lee, Seung Hee (Korea Institute of Science and Technology Information (KISTI))
Panzer-Steindel, Bernd (European Organization for Nuclear Research (CERN))
Peters, Andreas-Joachim (European Organization for Nuclear Research (CERN))
Yoon, Heejun (Korea Institute of Science and Technology Information (KISTI))
Publication Information
Journal of Information Science Theory and Practice / v.10, no.spc, 2022 , pp. 56-65 More about this Journal
Abstract
Korea Institute of Science and Technology Information (KISTI) is a Worldwide LHC Computing Grid (WLCG) Tier-1 center mandated to preserve raw data produced from A Large Ion Collider Experiment (ALICE) experiment using the world's largest particle accelerator, the Large Hadron Collider (LHC) at European Organization for Nuclear Research (CERN). Physical medium used widely for long-term data preservation is tape, thanks to its reliability and least price per capacity compared to other media such as optical disk, hard disk, and solid-state disk. However, decreasing numbers of manufacturers for both tape drives and cartridges, and patent disputes among them escalated risk of market. As alternative to tape-based data preservation strategy, we proposed disk-only erasure-coded archival storage system, Custodial Disk Storage (CDS), powered by Exascale Open Storage (EOS), an open-source storage management software developed by CERN. CDS system consists of 18 high density Just-Bunch-Of-Disks (JBOD) enclosures attached to 9 servers through 12 Gbps Serial Attached SCSI (SAS) Host Bus Adapter (HBA) interfaces via multiple paths for redundancy and multiplexing. For data protection, we introduced Reed-Solomon (RS) (16, 4) Erasure Coding (EC) layout, where the number of data and parity blocks are 12 and 4 respectively, which gives the annual data loss probability equivalent to 5×10-14. In this paper, we discuss CDS system design based on JBOD products, performance limitations, and data protection strategy accommodating EOS EC implementation. We present CDS operations for ALICE experiment and long-term power consumption measurement.
Keywords
Worldwide LHC Computing Grid Tier-1; A Large Ion Collider Experiment; Custodial Disk Storage; Exascal Open Storage; Erasure Coding;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Broadcom. (2013). 12Gb/s SAS: Busting through the storage performance bottlenecks. https://docs.broadcom.com/docs/12353459.
2 Peters, A. J., & Janyst, L. (2011). Exabyte scale storage at CERN. Journal of Physics: Conference Series, 331(5), 052015. https://doi.org/10.1088/1742-6596/331/5/052015.   DOI
3 IBM Corporation. (2012). TS3500 tape library power requirements for frames. https://www.ibm.com/docs/en/ts3500-tape-library?topic=requirements-power-frames.
4 Peters, A. J., Sindrilaru, E. A., & Adde, G. (2015). EOS as the present and future solution for data storage at CERN. Journal of Physics: Conference Series, 664, 042042. https://doi.org/10.1088/1742-6596/664/4/042042.   DOI
5 Colarelli, D., & Grunwald, D. (2002). Massive arrays of idle disks for storage archives. Paper presented at SC '02: Proceedings of the 2002 ACM/IEEE Conference on Supercomputing, Baltimore, MD, USA.
6 Sindrilaru, E. A., Peters, A. J., Adde, G. M., & Duellmann, D. (2017). EOS developments. J Phys: Conf Ser, 898, 062032. https://doi.org/10.1088/1742-6596/898/6/062032.   DOI
7 Spectra Logic. (2021). Data storage Outlook 2021. https://spectralogic.com/wp-content/uploads/DSO_2021.pdf.
8 Peters, A. J., Simon, M. K., & Sindrilaru, E. A. (2020). Erasure Coding for production in the EOS Open Storage system. EPJ Web of Conferences, 245, 04008. https://doi.org/10.1051/epjconf/202024504008.   DOI
9 Ahn, S. U., Betev, L., Bonfillou, E., Han, H., Kim, J., Lee, S. H., Panzer-Steindel, B., Peters, A. J., & Yoon, H. (2020). Seeking an alternative to tape-based custodial storage. EPJ Web of Conferences, 245, 04001. https://doi.org/10.1051/epjconf/202024504001.   DOI
10 Arslan, S. S. (2014). Durability and availability of erasure-coded systems with concurrent maintenance. http://www.suaybarslan.com/Reliability_Systems_14.pdf.