DOI QR코드

DOI QR Code

메타데이터의 통합을 위한 스키마 매핑 및 데이터 변환 시스템

Schema Mapping and Data Conversion System for Integrating Article Metadata

  • 이민호 (한국과학기술정보연구원 소프트웨어연구실) ;
  • 이원구 (한국과학기술정보연구원 소프트웨어연구실) ;
  • 최윤수 (한국과학기술정보연구원 소프트웨어연구실) ;
  • 윤화묵 (한국과학기술정보연구원 소프트웨어연구실) ;
  • 송사광 (한국과학기술정보연구원 소프트웨어연구실) ;
  • 정한민 (한국과학기술정보연구원 소프트웨어연구실)
  • Lee, Min-Ho (Dept. of Software Research, Korea Institute of Science and Technology Information) ;
  • Lee, Won-Goo (Dept. of Software Research, Korea Institute of Science and Technology Information) ;
  • Choi, Yun-Soo (Dept. of Software Research, Korea Institute of Science and Technology Information) ;
  • Yoon, Hwa-Mook (Dept. of Software Research, Korea Institute of Science and Technology Information) ;
  • Song, Sa-Kwang (Dept. of Software Research, Korea Institute of Science and Technology Information) ;
  • Jung, Han-Min (Dept. of Software Research, Korea Institute of Science and Technology Information)
  • 투고 : 2012.09.27
  • 심사 : 2012.10.22
  • 발행 : 2012.10.31

초록

본 논문에서는 논문 메타 데이터 특성 분석 연구를 토대로 데이터 변환 방법들을 고안하고 스키마 매핑 및 변환시스템을 구현한다. 빅 데이터 분석을 위해서는 다양한 시스템의 데이터베이스에 축적된 데이터를 공통의 형식으로 변환하는데, 현재의 데이터 변환 시스템들은 구문 의존적 문제와 사용의 불편함을 가지고 있다. 본 논문에서 구현된 시스템은 논문 메타데이터 분야에 특화된 시스템으로, 사용하기 쉬운 스키마 매핑 인터페이스를 가지고 있으며 다양한 논문 데이터 구문을 변환할 수 있다. 또한 시스템에 등록되지 않은 새로운 스키마를 가진 데이터가 입력되더라도 시스템의 재 컴파일이 필요 없다. 본 시스템은 사용성 평가를 통하여 시스템 사용성 평균 점수로 89.25점을 받았다.

We devise data conversion methods and implement schema mapping and conversion system based on the study on research paper metadata characteristics analysis. Data conversion in unified form from databases of various systems is necessary for big data analysis. Legacy data conversion systems have drawbacks of syntax dependent problem and inconvenience for use. The implemented system, which is specialized system for research paper metadata, has easy schema mapping interface and can convert data with various syntax. In addition to that, Recompiling of the system is not necessary even if new schema which is not preregistered in the system comes in. We proved its usefulness by usability evaluation.

키워드

참고문헌

  1. Margaret St. Pierre, "Issues in Crosswalking Content Metadata Standards", NISO White Paper, 1998.
  2. Erhard Rahm, and Phillip A. Bernstein, "A survey of approached to automatic schema matching", The VLDB Journal, Vol.10, pp.334-350, 2001. https://doi.org/10.1007/s007780100057
  3. Metadata Registry, http://metadata-stds.org/11179/
  4. Md. Sumon Shahriar, and Jixue Liu, "Towards the Preservation of Referential Constraints in XML Data Transformation for Integration", International Journal of Database Theory and Application, Vol.3, No.2, 2010.
  5. Won-Goo Lee, Hwa-Mook Yoon, and Won-Kyung Sung, "An Efficient Management System on Digital Contents: Literature Data", Poceedings of the International Conference on Convergence Content, Nara, Japan, 2010 December.
  6. Min-Ho Lee, Won-Goo Lee, Hwa-Mook Yoon, Sung-Ho Shin, and Jae-Jeol Ryou, "Comparison and Analysis of Science and Technology Journal Metadata", Vol. 11, No. 9, pp. 515-523, 2011.
  7. Priscilla Caplan, and Dong-Geun Oh, "The Understanding of Metadata", Tae-il publishers, 2004.
  8. Kyung-Ho Lee, and Jun-Seoung Lee, "XMLSchema Matching based on Ontology Renewal for XML Document Conversation", KIISE Journal, Vol. 33, No. 7, pp. 727-740, 2006.
  9. Md. Sumon Shahriar, and Jixue Liu, "Constraint-Based Data Transformation for Integration: An Information System Approach", International Journal of Database Theory and Application, Vol. 3, No. 1, 2010.
  10. Jae-Hong Kim, and Sang-Jo Lee, "An Algorithm for Ontology Merging and Alignment using Local and Global Semantic Set", IEEK Journal, Vol. 41, No. 4, pp. 23-30, 2004.
  11. Tae-jin Ha, "Realization of XSLT-based Schema Mapper using RCP", thesis paper, Seoul National University of Science and Technology, 2009.
  12. Turner, Steven, "The HEP test for grading web site usability, Computers in Libraries", Complete coverage of library information technology, Vol.10, pp.37-39, 2002.
  13. Ho-Wan Gwak, Ji-Eun Gwak, Su-jin Kim, and Jeong-Mo Lee. "Usability Survey on Domestic Website Design: Questionnaire Survey and Heuristic Evaluation", Cognitive Science, Vol.1, pp.33-45, 2000.

피인용 문헌

  1. '바다지도' 데이터 입력 모듈 설계 및 구현 vol.19, pp.2, 2012, https://doi.org/10.9708/jksci.2014.19.2.091