• Title/Summary/Keyword: Data Repository

Search Result 423, Processing Time 0.029 seconds

Global Data Repository Status and Analysis: Based on Korea, China and Japan Data in re3data.org

  • Kim, Suntae
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.8 no.1
    • /
    • pp.79-89
    • /
    • 2018
  • We collected and analyzed data from e3data.org, which is a global registry of data repository services. We analyzed data profile for three leading Asian economies-Korea, China, and Japan-against the reference data for other participating countries. In particular, we examined how individual countries contribute to the repository, organizational type, versioning and product quality management, and subject tagging. We come to the conclusion that all three Asian countries still fall short in terms of involvement. As for participating institutions, there are 7 from Korea, 64 from China, and 120 from Japan. Among Chinese organizations, 3 are profit, 61 non-profit, and 37 organizations (which yields 1.8%) are involved in repository building. In Japan, there is 1 is commercial and 119 non-profit organizations, of which 57 (3.0%) are involved in repository building. All 7 organizations from Korea are non-profit, and 6 of them (0.3%) are involved in repository building. As regards versioning and product quality management, Korea, China, and Japan are up to par with other countries. Subject analysis reveals that Korea contributes more to geosciences, Japan to physics and geosciences, while China, unlike Korea and Japan, is more active in life sciences. It is hoped that this study will help planning domestic infrastructure for research data repositories with proper consideration for specific research domains and national characteristics.

Functional Requirements of Data Repository for DMP Support and CoreTrustSeal Authentication

  • Kim, Sun-Tae
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.10 no.1
    • /
    • pp.7-20
    • /
    • 2020
  • For research data to be shared without legal, financial and technical barriers in the Open Science era, data repositories must have the functional requirements asked by DMP and CoreTrustSeal. In order to derive functional requirements for the data repository, this study analyzed the Data Management Plan (DMP) and CoreTrustSeal, the criteria for certification of research data repositories. Deposit, Ethics, License, Discovery, Identification, Reuse, Security, Preservation, Accessibility, Availability, and (Meta) Data Quality, commonly required by DMP and CoreTrustSeal, were derived as functional requirements that should be implemented first in implementing data repositories. Confidentiality, Integrity, Reliability, Archiving, Technical Infrastructure, Documented Storage Procedure, Organizational Infrastructure, (Meta) Data Evaluation, and Policy functions were further derived from CoreTrustSeal. The functional requirements of the data repository derived from this study may be required as a key function when developing the repository. It is also believed that it could be used as a key item to introduce repository functions to researchers for depositing data.

Comparative Analysis of Centralized Vs. Distributed Locality-based Repository over IoT-Enabled Big Data in Smart Grid Environment

  • Siddiqui, Isma Farah;Abbas, Asad;Lee, Scott Uk-Jin
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2017.01a
    • /
    • pp.75-78
    • /
    • 2017
  • This paper compares operational and network analysis of centralized and distributed repository for big data solutions in the IoT enabled Smart Grid environment. The comparative analysis clearly depicts that centralize repository consumes less memory consumption while distributed locality-based repository reduce network complexity issues than centralize repository in state-of-the-art Big Data Solution.

  • PDF

Analysis of the Current Status of Data Repositories in the Field of Ecological Research

  • Kim, Suntae
    • Proceedings of the National Institute of Ecology of the Republic of Korea
    • /
    • v.2 no.2
    • /
    • pp.139-143
    • /
    • 2021
  • In this study, data repository information registered in re3data (re3data.org), a research data registry, was collected. Based on collected data, the current status was analyzed for 354 repositories (approximately 14% of total repositories) in the field using keywords in the ecological field suggested by two experts. Major metadata formats used to describe data in ecological research data repositories include Federal Geographic Data Committee Content Standard for Digital Geospatial Metadata (FGDC/CSDGM), Dublin Core, ISO 19115, Ecological Metadata Language (EML), Directory Interchange Format (DIF), Darwin Core, Data Documentation Initiative (DDI), and DataCite Metadata Schema. The number of ecological repositories according to country is 102 in the US, 34 in Germany, 31 in Canada, and one in Korea. A total of 771 non-profit organizations and 12 for-profit organizations are involved in the construction of the ecological field research data repository. Data version control ratio of the ecological field research data repositories registered in re3data was analyzed to be somewhat higher (86.6%) than the total ratio (83.9%). Results of this study can be used to establish policies to build and operate a research data repository in the ecological field.

An Enterprise Repository System : Architecture and ERP Repositiory Case (기업 리파지토리 시스템 : 아키텍쳐 및 ERP 리파지토리 사례)

  • 이희석;서우종;김태훈;이충석;손명호;백종명;손주찬;박성진
    • The Journal of Information Technology and Database
    • /
    • v.7 no.1
    • /
    • pp.1-15
    • /
    • 2000
  • A repository has been conceived as a critical weapon for managing organizational information resources. The system can help control the heterogeneous data in a variety of CASE (Computer-Aided Software Engineering) tools. However, current repository systems have limitation in creating a synergetic effect by integrating information resources. Therefore, it is important to develop an integrative repository system, called Enterprise Repository System (ERS). This paper (i) defines ERS on the basis of a framework for repository systems, and (ii) suggests an ERS architecture and its detailed components. Finally, a real-life case of developing ERP repository system is illustrated according to the proposed architecture and components. This illustration may demonstrate the usefulness of this research for help developing an advanced repository system.

  • PDF

Functional Requirements for Research Data Repositories

  • Kim, Suntae
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.8 no.1
    • /
    • pp.25-36
    • /
    • 2018
  • Research data must be testable. Science is all about verification and testing. To make data testable, tools used to produce, collect, and examine data during the research must be available. Quite often, however, these data become inaccessible once the work is over and the results being published. Hence, information and the related context must be provided on how research data are preserved and how they can be reproduced. Open Science is the international movement for making scientific research data properly accessible for research community. One of its major goals is building data repositories to foster wide dissemination of open data. The objectives of this research are to examine the features of research data, common repository platforms, and community requests for the purpose of designing functional requirements for research data repositories. To analyze the features of the research data, we use data curation profiles available from the Data Curation Center of the Purdue University, USA. For common repository platforms we examine Fedora Commons, iRODS, DataONE, Dataverse, Open Science Data Cloud (OSDC), and Figshare. We also analyze the requests from research community. To design a technical solution that would meet public needs for data accessibility and sharing, we take the requirements of RDA Repository Interest Group and the requests for the DataNest Community Platform developed by the Korea Institute of Science and Technology Information (KISTI). As a result, we particularize 75 requirement items grouped into 13 categories (metadata; identifiers; authentication and permission management; data access, policy support; publication; submission/ingest/management, data configuration, location; integration, preservation and sustainability, user interface; data and product quality). We hope that functional requirements set down in this study will be of help to organizations that consider deploying or designing data repositories.

A Study on Strategies to Promote the Activation of Institutional Research Data Repositories in the Field of Science and Technology (과학기술분야 기관 연구데이터 리포지터리 운영 활성화 방안 연구)

  • Ye Hyeon Kim;Jihyun Kim
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.34 no.3
    • /
    • pp.109-134
    • /
    • 2023
  • The purpose of this study is to identify the current status of institutional research data repositories in the field of science and technology and to suggest ways to activate them. The study conducted literature research, case analysis, and interviews with repository managers both domestically and internationally. The study suggested strategies with a focus on establishing repository regulations and policies, improving awareness of research data sharing, and enhancing research data quality management. First, in terms of repository regulations and policy establishment, it was considered necessary to promote the status of the National R&D Information Processing Standards, a regulation related to research data, and clarify repository basis regulations. Second, to enhance awareness of research data sharing, the need for comprehensive research data education and the identification of exemplary cases were suggested. Third, in terms of strengthening research data quality management, the need for preparation for interaction between researchers-persons in charge-committees, standardization work, and long-term preservation was suggested.

An Integrated Repository System with the Change Detection Functionality for XML Documents (XML 문서 변경 탐지 기능을 갖는 통합 리파지토리 시스템)

  • Park, Seong-Jin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.10
    • /
    • pp.2696-2707
    • /
    • 2009
  • Although, a number of DBMS vendors are scrambling to extend their products to handle XML, there is a need for a lightweight, DBMS and platform-independent XML repository as well. In this paper, we describe such an XML integrated repository system, that solves the following functions : generating relational schema from XML DTDs for storage of XML documents, importing data from XML documents into relational tables, creating XML documents according to a XMLQL(XML Query Language) from data extracted from a database and synchronizing the replicated XML documents. In the XML repository systems, the efficient change detection techniques for XML documents is required to maintain the consistency of replicated XML data because the same data in the repository can be replicated between so many different XML documents. In this paper, we propose a message digest based change detection technique to maintain the consistency of replicated data between client XML documents and a XML data in XML repository systems.

A Study on the Operation of a Collaborative Repository of the Regional Central Library: Focused on the Busan Metropolitan Library (지역대표도서관 공동보존서고 운영에 관한 연구 - 부산도서관을 중심으로 -)

  • Kang, Eun-Yeong
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.33 no.3
    • /
    • pp.55-76
    • /
    • 2022
  • The 3rd Library Development Plan raises the need to secure space through the establishment of a regional repository library as the issue of holding books is highlighted as a common problem in public libraries. The Korean Library Law Act also impose the responsibility of integrated management of local library materials on the regional representative library. Accordingly, this study aimed at Busan Metropolitan Library, which is operating a collaborative repository in earnest among regional representative libraries, and investigated the operation status of the collaborative repository and the perception of public librarians about the collaborative repository. The data necessary for the study were obtained through surveys, interviews, field surveys, and internal data analysis. Through this, the purpose of this study was to provide basic data that will help the Busan Metropolitan Library to operate the collaborative repository efficiently in the future, and at the same time, to present basic data that can be used as a reference for the operation of the collaborative repository of the representative libraries of other regions.

Database Modeling and Environmental Information for a Radioactive Waste Repository Site

  • Park S. M.;Rhee C. G.;Park J. B.;Lee H. J.;Kim Chang Lak
    • Nuclear Engineering and Technology
    • /
    • v.36 no.3
    • /
    • pp.263-275
    • /
    • 2004
  • For the safe management of nuclear facilities, including a radioactive waste repository, data about the facility site and the surrounding environment must be collected and managed systematically. This is particularly true for a radwaste repository, which has to be institutionally controlled for a long period after closure. The objectives of this study are (1) to establish a systematical management plan for information about a radwaste repository site and its environment, and (2) to design a database management program for this information, based on the Relative Database Management System (RDBMS). The spatial data are designed by the geodatabase, which is a new object, based on the RDBMS, to manage spatial information related to the database. To meet this requirement, a new program called 'Site Information and Total Environmental data management System (SITES)' is being developed. The scope that produced from the first step of the present study for development of the SITES is introduced. The database is designed to combine spatial and attribute data, and is designed for the establishment of the Geographic Information System (GIS). The hardware and software systems are designed with consideration given to the total data management of the items within the radioactive environment.