DOI QR코드

DOI QR Code

A Study on SIARD Verification as a Preservation Format for Data Set Records

행정정보 데이터세트 보존포맷으로서 SIARD 검증에 관한 연구

  • 윤성호 (전북대학교 기록관리학과) ;
  • 이정은 (전북대학교 기록관리학과 4단계 BK21 교육연구단) ;
  • 양동민 (전북대학교 기록관리학과, 문화융복합아카이빙연구소)
  • Received : 2021.07.20
  • Accepted : 2021.08.03
  • Published : 2021.08.31

Abstract

As the importance of data grows because of the advent of the next industrial revolution, foreign countries are pushing for long-term data preservation technology research. On the other hand, in Korea, administrative information data sets have been legislated as records management areas without specific long-term preservation measures. As a response, this study conducted basic, cross-validation tests on the Software Independent Archiving of Relational Database (SIARD), which was proposed as an administrative information data set preservation format in several prior works. First, the underlying verification test focuses on deriving the data, structure, and functionality of the data set that SIARD can preserve. The second cross-validation test aimed at verifying the interoperability of SIARD independent of the DBMS class. In addition, two verification tests have confirmed the SIARD feature delivery range. Consequently, the differences between the feature types specified in the SIARD 2.0 standard and those provided by the actual SIARD Suite have been derived. Based on verification test results, we are proposing a development plan to broaden SIARD functionality and set a direction to efficiently enhance SIARD for local situations.

4차 산업혁명의 도래로 데이터의 중요성이 커지는 상황에 따라, 해외 각국은 데이터 장기보존 기술 연구를 추진하고 있다. 반면 우리나라는 행정정보 데이터세트가 기록관리 영역으로 법제화됐으나, 구체적인 장기보존 방안이 부재한 상황이다. 이에 본 연구는 여러 선행연구에서 행정정보 데이터세트 보존포맷으로 제안된 SIARD(Software Independent Archiving of Relational Database)에 대한 기초, 교차 검증 시험을 수행했다. 먼저 기초 검증 시험은 SIARD 포맷이 보존할 수 있는 데이터세트의 데이터, 구조, 기능 등을 도출하는데 방점을 두었다. 두 번째 교차 검증 시험은 DBMS 종류에 구애받지 않는 SIARD의 상호호환성 검증에 목적을 두었다. 2차례 검증 시험 결과, SIARD 포맷으로 JSON, UROWID 데이터 타입, FK(Foreign Key), 함수 계열 요소를 보존할 수 없으며, SIARD 2.0 표준에 명시된 기능과 실제 SIARD Suite이 제공하는 기능에 차이가 있음을 확인하였다. 본 연구는 실증적 검증 시험을 진행했으며, SIARD Suite의 기능을 보완하는 개발 방안과 SIARD Suite을 국내 환경에 맞춰 효율적으로 개발할 수 있는 방향성을 제시했다는 점에서 의의가 있다.

Keywords

Acknowledgement

본 연구는 "2019년 행정안전부 국가기록원기록관리연구개발사업"의 연구비를 지원받아 수행되었음. 이 논문은 2019년 대한민국 교육부와 한국연구재단의 지원을 받아 수행된 연구임(NRF-2019S1A5B8099507).

References

  1. Han, Hui-Jeong, Yoon, Sung-Ho, Oh, Hyo-Jung, & Yang, Dong-Min (2020). Empirical Verification of Conversion and Restoration of Preservation Format for Dataset: Application of Dataset with Disaster Safety Information to SIARD. Journal of Korean Society for Information Management, 37(2), 251-287. http://dx.doi.org/10.3743/KOSIM.2020.37.2.251
  2. Kim, Joo-Yeon (2020). A Study on the Long-term Preservation of Administrative Information Datasets Using SIARD. Master's thesis, Graduate School of Records, Archives & Information Science, Myongji University.
  3. Korea. Ministry of the Interior and Safety (2020). 2020 National Government EA based Public Sector Information Resources Statistical Report.
  4. Korean Library Association (2010). Dictionary of Libraries and Information Sciences. Available: http://www.kla.kr/jsp/fileboard/termdic.do
  5. Lee, Kyu-Chul (2016). Understanding for administration information system dataset and considerations for recordkeeping. Records Management Standard Forum Resources, 72-78.
  6. National Archives of Korea (2019a). Study on long-term preservation technology of dataset-type electronic records.
  7. National Archives of Korea (2019b). Policy of Format by Electronic Records Type.
  8. National Archives of Korea (2019c). Policy of Long-Term Preservation of Electronic Records.
  9. Oh, Seh-Ra & Rieh, Hae-young (2019). Managing Data Set in Administrative Information Systems as Records. Journal of Korean Society of Archives and Records Management, 19(2), 51-76. https://doi.org/10.14404/JKSARM.2019.19.2.051
  10. Oh, Seh-Ra, Park, Seung-Hoon, & Yim, Jin-Hee (2018). A Case Study of Dataset Records in Information Management System. Journal of Korean Society of Archives and Records Management, 18(2), 109-133. https://doi.org/10.14404/JKSARM.2018.18.2.109
  11. Record Keeping Criteria for Dataset: Composition of Dataset Management Reference Table & Exchange of Dataset. NAK 35:2020(v1.0).
  12. Roh, Jong-Won & So, Jeong-Eui (2020). A Study on the Management Plan for Preservation and Long-Term Use of Datasets. Journal of D-Culture Archives, 3(1), 51-64.
  13. So, Jeong-Eui, Han, Hui-Jeong, & Yang, Dong-Min (2018). A Comparative Analysis of Long-Term Preservation Policies in Foreign Electronic Records: NARA, LAC, TNA, NAA, and SFA. Journal of Korean Society of Archives and Records Management, 18(4), 125-148. https://doi.org/10.14404/JKSARM.2018.18.4.125
  14. Telecommunications Technology Association (2018). Dictionary of Information Technology. Available: http://terms.tta.or.kr/main.do
  15. Wang, Ho-Sung & Seol, Moon-won (2017). A Study on Managing Dataset Records in Government Information Systems. Journal of Korean Society of Archives and Records Management, 17(3), 23-47. https://doi.org/10.14404/JKSARM.2017.17.3.023
  16. Digital Preservation Guidance Note 1 - Selecting File Formats for Long-Term Preservation. DPGN-01.
  17. Swiss Federal Archives (2021.07.01.). SIARD Suite. Swiss Federal Archives. Available: https://www.bar.admin.ch/bar/en/home/archiving/tools/siard-suite.html