• Title/Summary/Keyword: 메타데이터 품질

Search Result 106, Processing Time 0.027 seconds

Development of Data Profiling Software Supporting a Microservice Architecture (마이크로 서비스 아키텍처를 지원하는 데이터 프로파일링 소프트웨어의 개발)

  • Chang, Jae-Young;Kim, Jihoon;Jee, Seowoo
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.5
    • /
    • pp.127-134
    • /
    • 2021
  • Recently, acquisition of high quality data has become an important issue as the expansion of the big data industry. In order to acquiring high quality data, accurate evaluation of data quality should be preceded first. The quality of data can be evaluated through meta-information such as statistics on data, and the task to extract such meta-information is called data profiling. Until now, data profiling software has typically been provided as a component or an additional service of traditional data quality or visualization tools. Hence, it was not suitable for utilizing directly in various environments. To address this problem, this paper presents the development result of data profiling software based on a microservice architecture that can be serviced in various environments. The presented data profiler provides an easy-to-use interface that requests of meta-information can be serviced through the restful API. Also, a proposed data profiler is independent of a specific environment, thus can be integrated efficiently with the various big data platforms or data analysis tools.

A Study on Construction Method of Foreign Scientific Database by Utilizing Available Information Resources (가용자원을 활용한 해외학술정보 데이터베이스제작방법에 관한 연구)

  • 노경란;권오진
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2003.05a
    • /
    • pp.323-326
    • /
    • 2003
  • There are marv problems in conventional database Construction of foreign scientific journal from material aquisition to DB loading. This paper's purpose is to design database construction model which utilizes available information resources scattered several locations and uses agents technology to gather essential metadata efficiently. This paper describes component informations of foreign scientific database and related available resources. And it describes a process of DB construction that include metadata gathering method, automatic metadata classification method, and metadata quality monitoring method.

  • PDF

Constructing a Knowledge Graph for Improving Quality and Interlinking Basic Information of Cultural and Artistic Institutions (문화예술기관 기본정보의 품질개선과 연계를 위한 지식그래프 구축)

  • Euntaek Seon;Haklae Kim
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.4
    • /
    • pp.329-349
    • /
    • 2023
  • With the rapid development of information and communication technology, the speed of data production has increased rapidly, and this is represented by the concept of big data. Discussions on quality and reliability are also underway for big data whose data scale has rapidly increased in a short period of time. On the other hand, small data is minimal data of excellent quality and means data necessary for a specific problem situation. In the field of culture and arts, data of various types and topics exist, and research using big data technology is being conducted. However, research on whether basic information about culture and arts institutions is accurately provided and utilized is insufficient. The basic information of an institution can be an essential basis used in most big data analysis and becomes a starting point for identifying an institution. This study collected data dealing with the basic information of culture and arts institutions to define common metadata and constructed small data in the form of a knowledge graph linking institutions around common metadata. This can be a way to explore the types and characteristics of culture and arts institutions in an integrated way.

A Reference Model Design for Management of Educational Internet Multimedia Contents (교육정보 인터넷 동영상 관리를 위한 참조 모델 설계)

  • Kang, Yun-Hee
    • Proceedings of the KAIS Fall Conference
    • /
    • 2007.05a
    • /
    • pp.162-164
    • /
    • 2007
  • 현재 전국 16개 시도교육청에서는 제작한 대량의 인터넷 동영상을 서비스하고 있으나 제작 인터넷 동영상 콘텐츠의 재사용 및 공동활용을 높이기 위해서는 인터넷 동영상 개발 과정에서 필요한 효율적 콘텐츠 개발 방법과 품질관리를 포함한 콘텐츠 관리가 요구된다. 인터넷 동영상 공동 활용을 위해 다수의 정보저장소를 사용함으로 메타데이터 등록의 중복 작업이 발생하며, 이를 해결하기 위해서는 인터넷 동영상 공동 활용을 위한 시스템적인 연계가 필요하다. 이 논문에서는 인터넷 동영상 콘텐츠의 유통을 위한 인터넷 동영상 메타데이터의 역할과 요건, 그리고 교육정보 메타데이타의 특성을 살펴보고 인터넷 동영상 메타데이타 적용 원칙 및 메타데이타 필수 구성 요소를 도출하였다. 또한 인터넷 동영상 메타데이터의 전국교육정보공유체제 및 사이버가정학습서비스와 효과적으로 연계 하기 위한 모델을 제시한다

  • PDF

Quality Evaluation of the Open Standard Data (공공데이터 개방표준 데이터의 품질평가)

  • Kim, Haklae
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.9
    • /
    • pp.439-447
    • /
    • 2020
  • Public data refers to all data or information created by public institutions, and public information that leads to communication and cooperation among all people. Public data is an important method to lead the next generation of new industries such as artificial intelligence and smart cities, Korea is continuously ranked high in the international evaluation related to public data. However, despite the continuous efforts, the use of public data or industrial influence is insufficient. Quality issues are continuously discussed in the use of public data, but the criteria for quantitatively evaluating data are insufficient. This paper reviews indicators for public data quality evaluation and performs quantitative evaluation on selected public data. In particular, the quality of open standard data constructed and opened based on public data management guidelines is examined to determine whether government guidelines are appropriate. The data quality assessment includes the metadata and data values of open standard data, and is reviewed based on completeness and accuracy indicators. Based on the data analysis results, this paper proposes policy and technical measures for quality improvement.

Suggestions on how to convert official documents to Machine Readable (공문서의 기계가독형(Machine Readable) 전환 방법 제언)

  • Yim, Jin Hee
    • The Korean Journal of Archival Studies
    • /
    • no.67
    • /
    • pp.99-138
    • /
    • 2021
  • In the era of big data, analyzing not only structured data but also unstructured data is emerging as an important task. Official documents produced by government agencies are also subject to big data analysis as large text-based unstructured data. From the perspective of internal work efficiency, knowledge management, records management, etc, it is necessary to analyze big data of public documents to derive useful implications. However, since many of the public documents currently held by public institutions are not in open format, a pre-processing process of extracting text from a bitstream is required for big data analysis. In addition, since contextual metadata is not sufficiently stored in the document file, separate efforts to secure metadata are required for high-quality analysis. In conclusion, the current official documents have a low level of machine readability, so big data analysis becomes expensive.

Extending Sensor Registry System Using Network Coverage Information (네트워크 커버리지를 이용한 센서 레지스트리 시스템 확장)

  • Jung, Hyunjun;Jeong, Dongwon;Lee, Sukhoon;Baik, Doo-Kwon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.9
    • /
    • pp.425-430
    • /
    • 2015
  • The Sensor Registry System(SRS) provides sensor metadata to a user for instant use and seamless interpretation of sensor data in a heterogeneous sensor network environment. The existing sensor registry system cannot provide sensor metadata in case that the network connection is not available or is unstable. To resolve the problem, this paper proposes an extension of sensor registry system using network coverage information. The extended system sends a set of sensor metadata to the user by using network coverage open data (mobile vendors, signal strength, communication type). The extended SRS proposed in this paper supports a safer sensor metadata provision than the existing SRS, and it thus improves the quality of application services.

A Study on the Model of Internet Public Library in Korea (IPL-Korea) (인터넷 공공도서관 구축 모형 연구)

  • 고영만;오삼균
    • Journal of the Korean Society for information Management
    • /
    • v.16 no.4
    • /
    • pp.109-123
    • /
    • 1999
  • We are faced with a paradox in the age of information as finding quality information on the Internet becomes a more challenging task because of information overload. This paper describes the prototype for “IPL-Korea” (Internet Public Library in Korea) project which is an attempt to provide the public with quality information in the form of a metadata system. The system involves cataloging of resources, i.e. websites, that are filtered by library and information science majors as well as information professionals. The user focus of this system is on children, youth, women, and seniors; various classification schemes and resource descriptions relevant for each user group are incorporated into the system to allow efficient browsing of the resources. A thesaurus for “IPL-Korea”, which is based on the ERIC thesaurus, is being constructed for easy manipulation of the breath of searching. The “IPL-Korea” metadata system employs the entity-relationship model in the design of its conceptual schema. Metadata is being stored in the Oracle database system and Web interfaces to this database are provided through ASP, ColdFusion, and JAVA technology.

  • PDF

A Study of the Workflow and the Metadata for Web Records Archiving (웹 기록물 아카이빙을 위한 워크플로우 및 메타데이터 연구)

  • Seung-Jun Cha;Dong-Suk Chun;Kyu-Chul Lee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.11a
    • /
    • pp.1379-1382
    • /
    • 2008
  • 웹은 급속하게 변화하는 현대사회에서 정부와 시민들의 주요 의사소통의 채널이 되고 있다. 웹에서 유통되는 정보량이 급증하면서 정보원으로서의 웹에 대한 의존도가 크게 높아졌을 뿐만 아니라 전적으로 웹에만 존재하는 정보자원도 증가하고 있다. 중요한 가치를 지닌 웹사이트는 짧은 수명주기와 수집, 보존, 활용에 대한 방안이 없어 소멸되고 있는 실정이다. 이러한 문제를 해결하기 위해 웹 기록물 아카이빙을 위한 기반기술로 워크플로우 및 메타데이터 정의가 필요하다. 따라서 본 논문에서는 웹 기록물을 아카이빙하기 위해 선별, 수집, 품질관리 및 목록화, 보존, 저장으로 구성되는 워크플로우 및 장기 보존과 검색에 필수적인 메타데이터를 정의하였다. 이러한 연구 개발 및 적용을 통해 사라져 가는 중요한 자원인 웹 기록물을 후대에 중요한 기록물 자원으로 저장 및 관리할 수 있게 될 것이다.

A Study on Book Metadata Creation and Distribution on Supply Chain (공급사슬상의 도서메타데이터 생성.유통에 관한 고찰)

  • Cho, Jane
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.44 no.3
    • /
    • pp.61-80
    • /
    • 2010
  • Recently, the publishing community now recognizes the importance of metadata in customers' buying decisions. As a result, they are more interested in effective metadata creation and quality maintenance, as well as standardization of exchanging system in the supply chain. As the library community also investigates the economic effectiveness of creating metadata, they try to find the best model for simplifying metadata creation by using sources close to the original. This study analyzes metadata work flow which had same source but be used in different fields by their own type and standard. It also discusses the same issues about each section and possibility about interoperation. Finally this paper tries to find an effective creation and distribution model of book metadata which can be used in domestic publishing and the library community.