• Title/Summary/Keyword: research data repository

Search Result 165, Processing Time 0.026 seconds

Global Data Repository Status and Analysis: Based on Korea, China and Japan Data in re3data.org

  • Kim, Suntae
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.8 no.1
    • /
    • pp.79-89
    • /
    • 2018
  • We collected and analyzed data from e3data.org, which is a global registry of data repository services. We analyzed data profile for three leading Asian economies-Korea, China, and Japan-against the reference data for other participating countries. In particular, we examined how individual countries contribute to the repository, organizational type, versioning and product quality management, and subject tagging. We come to the conclusion that all three Asian countries still fall short in terms of involvement. As for participating institutions, there are 7 from Korea, 64 from China, and 120 from Japan. Among Chinese organizations, 3 are profit, 61 non-profit, and 37 organizations (which yields 1.8%) are involved in repository building. In Japan, there is 1 is commercial and 119 non-profit organizations, of which 57 (3.0%) are involved in repository building. All 7 organizations from Korea are non-profit, and 6 of them (0.3%) are involved in repository building. As regards versioning and product quality management, Korea, China, and Japan are up to par with other countries. Subject analysis reveals that Korea contributes more to geosciences, Japan to physics and geosciences, while China, unlike Korea and Japan, is more active in life sciences. It is hoped that this study will help planning domestic infrastructure for research data repositories with proper consideration for specific research domains and national characteristics.

Functional Requirements of Data Repository for DMP Support and CoreTrustSeal Authentication

  • Kim, Sun-Tae
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.10 no.1
    • /
    • pp.7-20
    • /
    • 2020
  • For research data to be shared without legal, financial and technical barriers in the Open Science era, data repositories must have the functional requirements asked by DMP and CoreTrustSeal. In order to derive functional requirements for the data repository, this study analyzed the Data Management Plan (DMP) and CoreTrustSeal, the criteria for certification of research data repositories. Deposit, Ethics, License, Discovery, Identification, Reuse, Security, Preservation, Accessibility, Availability, and (Meta) Data Quality, commonly required by DMP and CoreTrustSeal, were derived as functional requirements that should be implemented first in implementing data repositories. Confidentiality, Integrity, Reliability, Archiving, Technical Infrastructure, Documented Storage Procedure, Organizational Infrastructure, (Meta) Data Evaluation, and Policy functions were further derived from CoreTrustSeal. The functional requirements of the data repository derived from this study may be required as a key function when developing the repository. It is also believed that it could be used as a key item to introduce repository functions to researchers for depositing data.

Analysis of the Current Status of Data Repositories in the Field of Ecological Research

  • Kim, Suntae
    • Proceedings of the National Institute of Ecology of the Republic of Korea
    • /
    • v.2 no.2
    • /
    • pp.139-143
    • /
    • 2021
  • In this study, data repository information registered in re3data (re3data.org), a research data registry, was collected. Based on collected data, the current status was analyzed for 354 repositories (approximately 14% of total repositories) in the field using keywords in the ecological field suggested by two experts. Major metadata formats used to describe data in ecological research data repositories include Federal Geographic Data Committee Content Standard for Digital Geospatial Metadata (FGDC/CSDGM), Dublin Core, ISO 19115, Ecological Metadata Language (EML), Directory Interchange Format (DIF), Darwin Core, Data Documentation Initiative (DDI), and DataCite Metadata Schema. The number of ecological repositories according to country is 102 in the US, 34 in Germany, 31 in Canada, and one in Korea. A total of 771 non-profit organizations and 12 for-profit organizations are involved in the construction of the ecological field research data repository. Data version control ratio of the ecological field research data repositories registered in re3data was analyzed to be somewhat higher (86.6%) than the total ratio (83.9%). Results of this study can be used to establish policies to build and operate a research data repository in the ecological field.

Functional Requirements for Research Data Repositories

  • Kim, Suntae
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.8 no.1
    • /
    • pp.25-36
    • /
    • 2018
  • Research data must be testable. Science is all about verification and testing. To make data testable, tools used to produce, collect, and examine data during the research must be available. Quite often, however, these data become inaccessible once the work is over and the results being published. Hence, information and the related context must be provided on how research data are preserved and how they can be reproduced. Open Science is the international movement for making scientific research data properly accessible for research community. One of its major goals is building data repositories to foster wide dissemination of open data. The objectives of this research are to examine the features of research data, common repository platforms, and community requests for the purpose of designing functional requirements for research data repositories. To analyze the features of the research data, we use data curation profiles available from the Data Curation Center of the Purdue University, USA. For common repository platforms we examine Fedora Commons, iRODS, DataONE, Dataverse, Open Science Data Cloud (OSDC), and Figshare. We also analyze the requests from research community. To design a technical solution that would meet public needs for data accessibility and sharing, we take the requirements of RDA Repository Interest Group and the requests for the DataNest Community Platform developed by the Korea Institute of Science and Technology Information (KISTI). As a result, we particularize 75 requirement items grouped into 13 categories (metadata; identifiers; authentication and permission management; data access, policy support; publication; submission/ingest/management, data configuration, location; integration, preservation and sustainability, user interface; data and product quality). We hope that functional requirements set down in this study will be of help to organizations that consider deploying or designing data repositories.

A Study on Strategies to Promote the Activation of Institutional Research Data Repositories in the Field of Science and Technology (과학기술분야 기관 연구데이터 리포지터리 운영 활성화 방안 연구)

  • Ye Hyeon Kim;Jihyun Kim
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.34 no.3
    • /
    • pp.109-134
    • /
    • 2023
  • The purpose of this study is to identify the current status of institutional research data repositories in the field of science and technology and to suggest ways to activate them. The study conducted literature research, case analysis, and interviews with repository managers both domestically and internationally. The study suggested strategies with a focus on establishing repository regulations and policies, improving awareness of research data sharing, and enhancing research data quality management. First, in terms of repository regulations and policy establishment, it was considered necessary to promote the status of the National R&D Information Processing Standards, a regulation related to research data, and clarify repository basis regulations. Second, to enhance awareness of research data sharing, the need for comprehensive research data education and the identification of exemplary cases were suggested. Third, in terms of strengthening research data quality management, the need for preparation for interaction between researchers-persons in charge-committees, standardization work, and long-term preservation was suggested.

Research data repository requirements: A case study from universities in North Macedonia

  • Fidan Limani;Arben Hajra;Mexhid Ferati;Vladimir Radevski
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.13 no.1
    • /
    • pp.75-100
    • /
    • 2023
  • With research data generation on the rise, Institutional Repositories (IR) are one of the tools to manage it. However, the variety of data practices across institutions, domains, communities, etc., often requires dedicated studies in order to identify the research data management (RDM) require- ments and mapping them to IR features to support them. In this study, we investigated the data practices for a few national universities in North Macedonia, including 110 participants from different departments. The methodology we adopted to this end enabled us to derive some of the key RDM requirements for a variety of data-related activities. Finally, we mapped these requirements to 6 features that our participants asked for in an IR solution: (1) create (meta)data and documentation, (2) distribute, share, and promote data, (3) provide access control, (4) store, (5) backup, and (6) archive. This list of IR features could prove useful for any university that has not yet established an IR solution.

An Enterprise Repository System : Architecture and ERP Repositiory Case (기업 리파지토리 시스템 : 아키텍쳐 및 ERP 리파지토리 사례)

  • 이희석;서우종;김태훈;이충석;손명호;백종명;손주찬;박성진
    • The Journal of Information Technology and Database
    • /
    • v.7 no.1
    • /
    • pp.1-15
    • /
    • 2000
  • A repository has been conceived as a critical weapon for managing organizational information resources. The system can help control the heterogeneous data in a variety of CASE (Computer-Aided Software Engineering) tools. However, current repository systems have limitation in creating a synergetic effect by integrating information resources. Therefore, it is important to develop an integrative repository system, called Enterprise Repository System (ERS). This paper (i) defines ERS on the basis of a framework for repository systems, and (ii) suggests an ERS architecture and its detailed components. Finally, a real-life case of developing ERP repository system is illustrated according to the proposed architecture and components. This illustration may demonstrate the usefulness of this research for help developing an advanced repository system.

  • PDF

Registry Metadata Quality Assessment by the Example of re3data.org Schema

  • Kim, Suntae;Choi, Myung-Seok
    • International Journal of Knowledge Content Development & Technology
    • /
    • v.7 no.2
    • /
    • pp.41-51
    • /
    • 2017
  • Nowadays, research data repositories (RDR) have become progressively widespread all over the world. To expand repository services and build up inbound linking strategy, organizations list their repositories with so called Global Registries. Accordingly, such registries should be carefully described by the related data. In this study, I explore the metadata schema of re3data.org. I collect and analyze descriptions from the listed repositories, and come up with some suggestions concerning possible improvements to the metadata schema. To accomplish this, I develop a crawler program, which collects necessary data from the re3data.org. Based on the analysis results, I have identified two issues that required elements is missing, one issue that required element value is missing when the corresponding property is applied, five inconsistency issues with re3data controlled vocabulary, six issues with undescribed optional elements, and two inconsistency issues between the elements and their attributes which do not pair with. I believe this discussion can facilitate improvements to the existing re3data.org schema and further help researchers who analyze data repository trends.

Analysis of Ecological Data Repository Operation Status and EcoBank Service Proposal (생태 분야 데이터 리포지터리 운영 현황 분석 및 EcoBank 서비스 제안)

  • Juseop Kim;Hyosuk Kang;Suntae Kim
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.57 no.4
    • /
    • pp.289-310
    • /
    • 2023
  • Sharing and reusing data has become essential. Data repositories are a key tool for sharing and reusing this data. The purpose of this study is to propose the service of EcoBank, which is being built and operated by the National Institute of Ecology. To achieve the research purpose, 10 out of 123 foreign data repositories in the field of ecology registered on re3data.org were selected, investigated, and analyzed. As a result of the analysis, three services were derived in common. The three services consist of first, research data policy, second, research data quality review, and research data management training and workshops. Here, in order to share EcoBank's global data, it is necessary to register with a data repository registry such as re3data.org, and it is proposed that certification be promoted to ensure the reliability and quality of the repository.

The Research for Ontology Repository Management (온톨로지 저장소 관리에 관한 연구)

  • Lee D.H.;Yang J.J.
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2005.10a
    • /
    • pp.124-127
    • /
    • 2005
  • The increased use of ontologies fur knowledge sharing emerges in many applications where knowledge applicability plays a critical role. The trend demands the need for an infrastructure that allows management tools to use ontology more easily such as ontology editors, storing, integration and inference engines towards comprehensive ontology-based solutions. We call such an infrastructure as ontology repository. This paper designed ontology repository for scalable ontology data

  • PDF