• Title/Summary/Keyword: Data Archive

Search Result 301, Processing Time 0.026 seconds

Massive Electronic Record Management System using iRODS (iRODS를 이용한 대용량 전자기록물 관리 시스템)

  • Han, Yong-Koo;Kim, Jin-Seung;Lee, Seung-Hyun;Lee, Young-Koo
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.8
    • /
    • pp.825-836
    • /
    • 2010
  • The advancement of electronic records brought great changes of the records management system. One of the biggest changes is the transition from passive to automatic management system, which manages massive records more efficiently. The integrated Rule-Oriented Data System (iRODS) is a rule-oriented grid system S/W which provides an infrastructure for building massive archive through virtualization. It also allows to define rules for data distribution and back-up. Therefore, iRODS is an ideal tool to build an electronic record management system that manages electronic records automatically. In this paper we describe the issues related to design and implementation of the electronic record management system using iRODS. We also propose a system that serves automatic processing of distribution and back-up of records according to their types by defining iRODS rules. It also provides functions to store and retrieve metadata using iRODS Catalog (iCAT) Database.

Research on Analytical Technique for Satellite Observstion of the Arctic Sea Ice (극지 해빙 위성관측을 위한 분석 기술 개발)

  • Kim, Hyun-cheol;Han, Hyangsun;Hyun, Chang-Uk;Chi, Junhwa;Son, Young-sun;Lee, Sungjae
    • Korean Journal of Remote Sensing
    • /
    • v.34 no.6_2
    • /
    • pp.1283-1298
    • /
    • 2018
  • KOPRI(Korea Polar Research Institute) have researhed Arctic sea ice by using satellite remote sensing data since 2017 as a mission of KOPRI. The title of the reseach is "Development of Satellite Observation and Analysis for Arctc sea-ice". This project has three major aims; 1) development of prototype satellite data archive/manage system for Arctic sea ice monitoring, 2) development of sea ice remote sensing data processing and analysis technique, and 3) development of international satellite observing network for Arcitc. This reseach will give us that 1) deveolpment of sea ice observing system for northern sea route, 2) development of optimal remote sensing data processing technique for sea ice and selected satelite sensors, 3) development of international satellite onbservation network. I hope that this letter of introducton KOPRI satellite program for Arctic will help to understand Arctic remote sensing and will introduce you to step into the Arctic remote sensing, which Iis like a blue ocean of remote sensing.

A Knowledge Graph on Japanese "Comfort Women": Interlinking Fragmented Digital Archival Resources (일본군 '위안부' 지식그래프: 파편화된 디지털 기록의 연결)

  • Park, Haram;Kim, Haklae
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.21 no.3
    • /
    • pp.61-78
    • /
    • 2021
  • Records on Japanese "Comfort Women" have been individually managed by private sectors or institutions, and some are provided as digital archives on the Internet. However, records of digital archives differ in the composition and representation of metadata by individual institutions. Meanwhile, there is a lack of a consistent structure to describe the relationships between and among these records, leading to their fragmentation and disconnectedness. This paper proposes a knowledge model for interlinking the digital archival resources and builds a knowledge graph by integrating the records from distributed digital archives. It derives common elements by analyzing metadata from the diverse digital archives and expresses them in standard vocabularies to semantically describe multiple entities and relationships of the digital archival resources. In particular, the study includes the refinement of collected data to search and thread dispersed records and the enrichment of external data to provide significant contextual information of records. An evaluation of the knowledge graph is performed via a query measuring the (dis)connectivity between the distributed records. As a result, the knowledge graph is capable of interlinking and retrieving fragmented records, providing substantial contextual information on the records with external data enrichment, and searching accurately to match the user's intentions through semantic-based queries.

Research Trends in Record Management Using Unstructured Text Data Analysis (비정형 텍스트 데이터 분석을 활용한 기록관리 분야 연구동향)

  • Deokyong Hong;Junseok Heo
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.23 no.4
    • /
    • pp.73-89
    • /
    • 2023
  • This study aims to analyze the frequency of keywords used in Korean abstracts, which are unstructured text data in the domestic record management research field, using text mining techniques to identify domestic record management research trends through distance analysis between keywords. To this end, 1,157 keywords of 77,578 journals were visualized by extracting 1,157 articles from 7 journal types (28 types) searched by major category (complex study) and middle category (literature informatics) from the institutional statistics (registered site, candidate site) of the Korean Citation Index (KCI). Analysis of t-Distributed Stochastic Neighbor Embedding (t-SNE) and Scattertext using Word2vec was performed. As a result of the analysis, first, it was confirmed that keywords such as "record management" (889 times), "analysis" (888 times), "archive" (742 times), "record" (562 times), and "utilization" (449 times) were treated as significant topics by researchers. Second, Word2vec analysis generated vector representations between keywords, and similarity distances were investigated and visualized using t-SNE and Scattertext. In the visualization results, the research area for record management was divided into two groups, with keywords such as "archiving," "national record management," "standardization," "official documents," and "record management systems" occurring frequently in the first group (past). On the other hand, keywords such as "community," "data," "record information service," "online," and "digital archives" in the second group (current) were garnering substantial focus.

Application and Analysis of Ocean Remote-Sensing Reflectance Quality Assurance Algorithm for GOCI-II (천리안해양위성 2호(GOCI-II) 원격반사도 품질 검증 시스템 적용 및 결과)

  • Sujung Bae;Eunkyung Lee;Jianwei Wei;Kyeong-sang Lee;Minsang Kim;Jong-kuk Choi;Jae Hyun Ahn
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.6_2
    • /
    • pp.1565-1576
    • /
    • 2023
  • An atmospheric correction algorithm based on the radiative transfer model is required to obtain remote-sensing reflectance (Rrs) from the Geostationary Ocean Color Imager-II (GOCI-II) observed at the top-of-atmosphere. This Rrs derived from the atmospheric correction is utilized to estimate various marine environmental parameters such as chlorophyll-a concentration, total suspended materials concentration, and absorption of dissolved organic matter. Therefore, an atmospheric correction is a fundamental algorithm as it significantly impacts the reliability of all other color products. However, in clear waters, for example, atmospheric path radiance exceeds more than ten times higher than the water-leaving radiance in the blue wavelengths. This implies atmospheric correction is a highly error-sensitive process with a 1% error in estimating atmospheric radiance in the atmospheric correction process can cause more than 10% errors. Therefore, the quality assessment of Rrs after the atmospheric correction is essential for ensuring reliable ocean environment analysis using ocean color satellite data. In this study, a Quality Assurance (QA) algorithm based on in-situ Rrs data, which has been archived into a database using Sea-viewing Wide Field-of-view Sensor (SeaWiFS) Bio-optical Archive and Storage System (SeaBASS), was applied and modified to consider the different spectral characteristics of GOCI-II. This method is officially employed in the National Oceanic and Atmospheric Administration (NOAA)'s ocean color satellite data processing system. It provides quality analysis scores for Rrs ranging from 0 to 1 and classifies the water types into 23 categories. When the QA algorithm is applied to the initial phase of GOCI-II data with less calibration, it shows the highest frequency at a relatively low score of 0.625. However, when the algorithm is applied to the improved GOCI-II atmospheric correction results with updated calibrations, it shows the highest frequency at a higher score of 0.875 compared to the previous results. The water types analysis using the QA algorithm indicated that parts of the East Sea, South Sea, and the Northwest Pacific Ocean are primarily characterized as relatively clear case-I waters, while the coastal areas of the Yellow Sea and the East China Sea are mainly classified as highly turbid case-II waters. We expect that the QA algorithm will support GOCI-II users in terms of not only statistically identifying Rrs resulted with significant errors but also more reliable calibration with quality assured data. The algorithm will be included in the level-2 flag data provided with GOCI-II atmospheric correction.

A Study on Designing the Metadata for Integrated Management of Individually Managed Presidential Records (개별관리 대통령기록물의 연계관리를 위한 통합 메타데이터 설계 방안 연구)

  • Cho, Hyun-Yang;Jang, Bo-Seong
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.47 no.1
    • /
    • pp.105-124
    • /
    • 2013
  • Metadata standardization of resources, having a heterogeneous metadata structure for each presidential archive and presidential library and museum is preferentially required for utilizing and sharing presidential records. An integrated operation model of metadata to manage various types of presidential records is then needed. The purpose of this study is to create a design principle of integrated metadata, and to suggest relationships and attributes of metadata, needed for developing integrated metadata operation system on presidential records. The design principle includes "creation of relationship among presidential records", "design of each entity, applicable multiple entity data model", "design to describe various types of presidential records", "design to reflect lifelong management on records of holding institutes", and "designing hybrid metadata for long term preservation". Metadata element set consists of elements for common attributes with all types of presidential records for a unique attribute for a specific presidential record and for reference information among different records related to the production of presidential records.

Development of SNP marker set for marker-assisted backcrossing (MABC) in cultivating tomato varieties

  • Park, GiRim;Jang, Hyun A;Jo, Sung-Hwan;Park, Younghoon;Oh, Sang-Keun;Nam, Moon
    • Korean Journal of Agricultural Science
    • /
    • v.45 no.3
    • /
    • pp.385-400
    • /
    • 2018
  • Marker-assisted backcrossing (MABC) is useful for selecting offspring with a highly recovered genetic background for a recurrent parent at early generation unlike rice and other field crops. Molecular marker sets applicable to practical MABC are scarce in vegetable crops including tomatoes. In this study, we used the National Center for Biotechnology Information- short read archive (NCBI-SRA) database that provided the whole genome sequences of 234 tomato accessions and selected 27,680 tag-single nucleotide polymorphisms (tag-SNPs) that can identify haplotypes in the tomato genome. From this SNP dataset, a total of 143 tag-SNPs that have a high polymorphism information content (PIC) value (> 0.3) and are physically evenly distributed on each chromosome were selected as a MABC marker set. This marker set was tested for its polymorphism in each pairwise cross combination constructed with 124 of the 234 tomato accessions, and a relatively high number of SNP markers polymorphic for the cross combination was observed. The reliability of the MABC SNP set was assessed by converting 18 SNPs into Luna probe-based high-resolution melting (HRM) markers and genotyping nine tomato accessions. The results show that the SNP information and HRM marker genotype matched in 98.6% of the experiment data points, indicating that our sequence analysis pipeline for SNP mining worked successfully. The tag-SNP set for the MABC developed in this study can be useful for not only a practical backcrossing program but also for cultivar identification and F1 seed purity test in tomatoes.

Study on the Spatial Standard for Reading Rooms in University Libraries (대학도서관 열람실 공간기준에 관한 연구)

  • Lim, Ho-Kyun
    • Korean Institute of Interior Design Journal
    • /
    • v.25 no.5
    • /
    • pp.140-147
    • /
    • 2016
  • This research aims to establish the size standard of university library's user space, and present the standard and method to calculate total area required in the planning of new building construction and remodeling. Nine university libraries newly constructed or remodeled since 2000 were selected among the libraries of large scale universities with more than 10,000 enrolled students as the target libraries in this research. The target libraries were classified into A group (five cases partially remodeled) and B group (four cases newly constructed or fully remodeled) on the basis of the change of times. A university library can be divided into three spaces (user space, administration space and public space). This research classified the reading room in the user space into bookshelf zone, reading zone, information/office zone and hall/other zone, and analyzed area ratio according to each zone. B group's bookshelf zone decreased 12% more than A group, and B group's reading zone increased 10% more than A group. However, there was no big change in the area ratio of information/office zone and hall/other zone. This can be interpreted that university library changes from book and archive preservation-oriented space to user-oriented space. This research presented a proper reading room area calculation method, based on the capacity of books, by reflecting such a change. Each zone's standard was set up through classification of domestic and international standards, based on which, the calculation method of university library's total floor area required was presented. The reason why there was difference in university library's total floor area required according to domestic standard and international standard was that the number of enrolled students per seat in the reading room was different. The area calculation methods presented in this research can be utilized as useful data upon planning university library construction or remodeling.

Changes of Morphological and Growth Characteristics Collected Miscanthus Germplasm in Korea (국내 억새 유전자원 수집 후 형태 및 생육 특성 변화)

  • Song, Yeon-Sang;Lee, Ji-Eun;Moon, Youn-Ho;Yu, Gyeong-Dan;Choi, In-Seong;Cha, Young-Lok;Kim, Kwang-Soo
    • Weed & Turfgrass Science
    • /
    • v.7 no.1
    • /
    • pp.22-34
    • /
    • 2018
  • Miscanthus has been considered as the most promising bioenergy crop for lignocellulosic biomass production. In Korea, M. sacchariflorus and M. sinensis can be found easily in all regions. It is a great advantage to utilize as important species with respect to genetic and cross-breeding programs materials for creation of novel hybrids. For successful breeding programs, it is important to precisely understand the variability of morphological and growth characteristics among Miscanthus species as breeding parent materials. In this study, morphological and growth characteristics were observed in 960 germplasms of two Miscanthus species (M. sacchariflorus and M. sinensis) for growing seasons over three years. Due to the inherent characteristics of these species, the germplasm of M. sacchariflorus among the collected germplasm were reduced in plant height than in the collection area. In M. sinensis, the plant height of germplasm collected mainly from Jeju-do increased more than those collected from collection area. Sixty-one of the collected 960 germplasms were selected and investigated to the morphological characteristics. Based on the investigated morphological data, the phylogenic tree was developed. As the results, it was confirmed that there exist germplasm in which the characteristics of M. sacchariflorus and M. sinensis are mixed. This study of Miscanthus may provide an important information in order to expedite the introduction as breeding materials for creation of new hybrid.

Design of Line Scratch Detection and Restoration Algorithm using GPU (GPU를 이용한 선형 스크래치 탐지와 복원 알고리즘의 설계)

  • Lee, Joon-Goo;Shim, She-Yong;You, Byoung-Moon;Hwang, Doo-Sung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.4
    • /
    • pp.9-16
    • /
    • 2014
  • This paper proposes a linear scratch detection and restoration algorithm using pixel data comparison in a single frame or consecutive frames. There exists a high parallelism in that a scratch detection and restoration algorithm needs a large amount of comparison operations. The proposed scratch detection and restoration algorithm is designed with a GPU for fast computation. We test the proposed algorithm in sequential and parallel processing with the set of digital videos in National Archive of Korea. In the experiments, the scratch detection rate of consecutive frames is as fast as about 20% for that of a single frame. The detection and restoration rates of a GPU-based algorithm are similar to those of a CPU-based algorithm, but the parallel implementation speeds up to about 50 times.