• Title/Summary/Keyword: Standard Metadata

Search Result 303, Processing Time 0.028 seconds

Compression Conversion and Storing of Large RDF datasets based on MapReduce (맵리듀스 기반 대량 RDF 데이터셋 압축 변환 및 저장 방법)

  • Kim, InA;Lee, Kyong-Ha;Lee, Kyu-Chul
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.4
    • /
    • pp.487-494
    • /
    • 2022
  • With the recent demand for analysis using data, the size of the knowledge graph, which is the data to be analyzed, gradually increased, reaching about 82 billion edges when extracted from the web as a knowledge graph. A lot of knowledge graphs are represented in the form of Resource Description Framework (RDF), which is a standard of W3C for representing metadata for web resources. Because of the characteristics of RDF, existing RDF storages have the limitations of processing time overhead when converting and storing large amounts of RDF data. To resolve these limitations, in this paper, we propose a method of compressing and converting large amounts of RDF data into integer IDs using MapReduce, and vertically partitioning and storing them. Our proposed method demonstrated a high performance improvement of up to 25.2 times compared to RDF-3X and up to 3.7 times compared to H2RDF+.

A Study on Data Linkage Between Public Data Portals and Individual Portals (공공데이터 포털과 개별 포털 간의 데이터 연계방안 연구)

  • Jin Ho, Park;Sang Woo, Han
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.33 no.4
    • /
    • pp.249-269
    • /
    • 2022
  • The Public Data Portal(data.go.kr) is a gateway for searching and using public data in South Korea. In 2021, the Ministry of Public Administration and Security established individual portal maintenance plans. Individual portals refer to portals built by public institutions in Korea other than the public data portal. According to the maintenance plan, the Korea Intelligence Information Society, the operator of the public data portal, needs to establish operating and data integration plans to link the public data portal and individual portals. In this study, we investigated the current operating status and data integration methods of the public data portal in South Korea, the United States, the United Kingdom, and France, and proposed that the adoption of a top-down approach is efficient when integrating data. In addition, we divided the specific procedures that should be pursued when integrating data into five stages: determination of data integration standard methods, analysis of metadata status, expansion of operating infrastructure, confirmation of data import, and launch of services.

A Study on Gathering & Connecting Online Reference Resources for Improving the Quality of Online Knowledge Service (온라인지식정보서비스 품질 향상을 위한 온라인지식정보원 확보 및 연계전략에 관한 연구)

  • Noh, Young-Hee
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.20 no.2
    • /
    • pp.17-30
    • /
    • 2009
  • This study is to improve the quality of online reference service which is serviced in the world. For this it suggests the strategic methods for collecting and connecting of internet information resources. It makes information service circumstance for professional librarians to retrieve and service. To achieve the purpose, this study have drawn implications by analysing the theoretical discussions and the cases of online reference resources system. This study suggests some strategies as follows; 1) Collaborative constructing and using of online reference resources, 2) Developing classification scheme and subdividing the subjects of it, 3) Developing the standard format of the data as like metadata, 4) Developing the guidelines to select the proper resources from many internet information resources, 5) Subdividing the 'knowledge DB' by subjects, 6) Connecting as much as possible the DBs as like the National DB and open access resources.

An Efficient Query-based XML Access Control Enforcement Mechanism (효율적인 질의 기반 XML 접근제어 수행 메커니즘)

  • Byun, Chang-Woo;Park, Seog
    • Journal of KIISE:Databases
    • /
    • v.34 no.1
    • /
    • pp.1-17
    • /
    • 2007
  • As XML is becoming a de facto standard for distribution and sharing of information, the need for an efficient yet secure access of XML data has become very important. To enforce the fine-level granularity requirement, authorization models for regulating access to XML documents use XPath which is a standard for specifying parts of XML data and a suitable language for both query processing. An access control environment for XML documents and some techniques to deal with authorization priorities and conflict resolution issues are proposed. Despite this, relatively little work has been done to enforce access controls particularly for XML databases in the case of query access. Developing an efficient mechanism for XML databases to control query-based access is therefore the central theme of this paper. This work is a proposal for an efficient yet secure XML access control system. The basic idea utilized is that a user query interaction with only necessary access control rules is modified to an alternative form which is guaranteed to have no access violations using tree-aware metadata of XML schemes and set operators supported by XPath 2.0. The scheme can be applied to any XML database management system and has several advantages over other suggested schemes. These include implementation easiness, small execution time overhead, fine-grained controls, and safe and correct query modification. The experimental results clearly demonstrate the efficiency of the approach.

A Study on the Development of the National Assembly Archives and Records Integrated Management System (국회기록정보 통합관리시스템 개발 방향에 관한 연구)

  • Kim, Jang-Hwan;Lee, Eun Byol
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.15 no.2
    • /
    • pp.103-136
    • /
    • 2015
  • The National Assembly Archives of the Republic of Korea has been using the National Assembly Archives and Records Management System, which added some archival function to the Standard Records Management System that they had previously developed. However, the Standard Records Management System has limits in order to reflect all the business functions of the National Assembly Archives, which also acts as an archival institution, because the system had been developed and distributed to perform the function of a records center. Moreover, the current National Assembly Archives and Records Management System focuses on the management of official records transferred in accordance with the regulations. For this reason, it is difficult to register and manage various record types such as records of the members of the National Assembly (related to legislative activities), oral history collected from the National Assembly leader, audiovisual records of proceedings, and so on. As such, this study analyzed the problems of the current National Assembly Archives and Records Management System and conducted case studies of the systems in the National Archives, the Presidential Archives, Changwon City, and the Cultural Heritage Administration. Through this research, it proposed that system functions, metadata, the target system of the National Assembly Archives, and the Records Integrated Management System need a development plan.

Improving Archival Descriptive Standard Based on the Analysis of the Reviews by Archival Communities on RiC-CM Draft (RiC에 대한 기록공동체의 리뷰를 통해 본 기록물 기술표준 개선을 위한 제안)

  • Park, Ziyoung
    • The Korean Journal of Archival Studies
    • /
    • no.54
    • /
    • pp.81-109
    • /
    • 2017
  • This study suggests an analysis of the reviews provided by international archival professionals on the RiC-CM draft published by ICA EGAD. Some implications for the Korean archival management environment were also suggested. Some professional reviews were accessible through the internet. Italian archival professionals held workshops at various levels for the analysis and discussion of the draft. Duranti, the project director of InterPARES, also gave opinions about the draft in cluding the perspective of digital preservation. In the review of Artefactual, the draft was discussed in terms of system implementation. Reed, the director of Recordkeeping Innovation, also gave a feedback based on the record management experiences in Australia. Some implications can be suggested based on these professional opinions. First, we should try to build a test bed for the adoption of RiC to archival description in the Korean environment. Second, a minimum level of data elements that can secure authenticity and integrity will also be needed. Third and lastly, rich authority data for agents and functions related to archival records and records groups are essential to take full advantage of the standard.

A Study on the Establishment Case of Technical Standard for Electronic Record Information Package (전자문서 정보패키지 구축 사례 연구 - '공인전자문서보관소 전자문서 정보패키지 기술규격 개발 연구'를 중심으로-)

  • Kim, Sung-Kyum
    • The Korean Journal of Archival Studies
    • /
    • no.16
    • /
    • pp.97-146
    • /
    • 2007
  • Those days when people used paper to make up and manage all kinds of documents in the process of their jobs are gone now. Today electronic types of documents have replaced paper. Unlike paper documents, electronic ones contribute to the maximum job efficiency with their convenience in production and storage. But they too have some disadvantages; it's difficult to distinguish originals and copies like paper documents; it's not easy to examine if there is a change or damage to the documents; they are also prone to alteration and damage by the external influences in the electronic environment; and electronic documents require enormous amounts of workforce and costs for immediate measures to be taken according to the changes to the S/W and H/W environment. Despite all those weaknesses, however, electronic documents increasingly account for more percentage in the current job environment thanks to their job convenience and efficiency of production costs. Both the government and private sector have made efforts to come up with plans to maximize their advantages and minimize their risks at the same time. One of the methods is the Authorized Retention Center which is described in the study. There are a couple of prerequisites for its smooth operation; they should guarantee the legal validity of electronic documents in the administrative aspects and first secure the reliability and authenticity of electronic documents in the technological aspects. Responding to those needs, the Ministry of Commerce, Industry and Energy and the Korea Institute for Electronic Commerce, which were the two main bodies to drive the Authorized Retention Center project, revised the Electronic Commerce Act and supplemented the provisions to guarantee the legal validity of electronic documents in 2005 and conducted researches on the ways to preserve electronic documents for a long term and secure their reliability, which had been demanded by the users of the center, in 2006. In an attempt to fulfill those goals of the Authorized Retention Center, this study researched technical standard for electronic record information package of the center and applied the ISO 14721 information package model that's the standard for the long-term preservation of digital data. It also suggested a process to produce and manage information package so that there would be the SIP, AIP and DIP metadata features for the production, preservation, and utilization by users points of electronic documents and they could be implemented according to the center's policies. Based on the previous study, the study introduced the flow charts among the production and progress process, application methods and packages of technical standard for electronic record information package at the center and suggested some issues that should be consistently researched in the field of records management based on the results.

XML Web Services for Learning ContentsBased on a Pedagogical Design Model (교수법적 설계 모델링에 기반한 학습 컨텐츠의 XML 웹 서비스 구축)

  • Shin, Haeng-Ja;Park, Kyung-Hwan
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.8
    • /
    • pp.1131-1144
    • /
    • 2004
  • In this paper, we investigate a problem with an e-learning system for e-business environments and introduce the solving method of the problem. To be more accurate, existing Web-hosted and ASP (Application Service Provider)-oriented service model is difficult to cooperate and integrate among the different kinds of systems. So we have produced sharable and reusable learning object, they have extracted a principle from pedagogical designs for units of reuse. We call LIO (Learning Item Object). This modeling makes use of a constructing for XML Web Services. So to speak, units of reuse from pedagogical designs are test tutorial, resource, case example, simulation, problem, test, discovery and discussion and then map introduction, fact, try, quiz, test, link-more, tell-more LIO learning object. These typed LIOs are stored in metadata along with the information for a content location. Each one of LIOs is designed with components and exposed in an interface for XML Web services. These services are module applications, which are used a standard SOAP (Simple Object Access Protocol) and locate any computer over Internet and publish, find and bind to services. This guarantees the interoperation and integration of the different kinds of systems. As a result, the problem of e-learning systems for e-business environments was resolved and then the power of understanding about learning objects based on pedagogical design was increased for learner and instruction designers. And organizations of education hope for particular decreased costs in constructing e-learning systems.

  • PDF

Research on Text Classification of Research Reports using Korea National Science and Technology Standards Classification Codes (국가 과학기술 표준분류 체계 기반 연구보고서 문서의 자동 분류 연구)

  • Choi, Jong-Yun;Hahn, Hyuk;Jung, Yuchul
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.1
    • /
    • pp.169-177
    • /
    • 2020
  • In South Korea, the results of R&D in science and technology are submitted to the National Science and Technology Information Service (NTIS) in reports that have Korea national science and technology standard classification codes (K-NSCC). However, considering there are more than 2000 sub-categories, it is non-trivial to choose correct classification codes without a clear understanding of the K-NSCC. In addition, there are few cases of automatic document classification research based on the K-NSCC, and there are no training data in the public domain. To the best of our knowledge, this study is the first attempt to build a highly performing K-NSCC classification system based on NTIS report meta-information from the last five years (2013-2017). To this end, about 210 mid-level categories were selected, and we conducted preprocessing considering the characteristics of research report metadata. More specifically, we propose a convolutional neural network (CNN) technique using only task names and keywords, which are the most influential fields. The proposed model is compared with several machine learning methods (e.g., the linear support vector classifier, CNN, gated recurrent unit, etc.) that show good performance in text classification, and that have a performance advantage of 1% to 7% based on a top-three F1 score.

A Study on the Application of Blockchain Technology to the Record Management Model (블록체인기술을 적용한 기록관리 모델 구축 방법 연구)

  • Hong, Deok-Yong
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.19 no.3
    • /
    • pp.223-245
    • /
    • 2019
  • As the foundation for the Fourth Industrial Revolution, blockchain is becoming an essential core infrastructure and technology that creates new growth engines in various industries and is rapidly spreading to the environment of businesses and institutions worldwide. In this study, the characteristics and trends of blockchain technology were investigated and arranged, its application to the records management section of public institutions was required, and the procedures and methods of construction in the records management field of public institutions were studied in literature. Finally, blockchain technology was applied to the records management to propose an archive chain model and describe possible expectations. When the transactions that record the records management process of electronic documents are loaded into the blockchain, all the step information can be checked at once in the activity of processing the records management standard tasks that were fragmentarily nonlinked. If a blockchain function is installed in the electronic records management system, the person who produces the document by acquiring and registering the document enters the metadata and information, as well as stores and classifies all contents. This would simplify the process of reporting the production status and provide real-time information through the original text information disclosure service. Archivechain is a model that applies a cloud infrastructure as a backend as a service (BaaS) by applying a hyperledger platform based on the assumption that an electronic document production system and a records management system are integrated. Creating a smart, electronic system of the records management is the solution to bringing scattered information together by placing all life cycles of public records management in a blockchain.