• Title/Summary/Keyword: 데이터품질 평가모델

Search Result 196, Processing Time 0.027 seconds

Quality Evaluation of Chest X-ray Open Dataset through Pixel Value Analysis by Region (영역별 화소값 분석을 통한 흉부 X선 오픈 데이터셋 품질 평가)

  • Choi, Hyeon-Jin;Bea, Su-Bin;Sun, Joo-Sung;Lee, Jung-Won
    • Annual Conference of KIPS
    • /
    • 2022.05a
    • /
    • pp.614-617
    • /
    • 2022
  • 인공지능의 발전으로 의료영상 분야에서 딥러닝 기반 질병 진단 연구가 활발하다. 그러나 모델 개발 시 학습 데이터의 개수와 품질은 매우 중요한데, 의료 분야 특성상 접근 가능한 데이터셋이 적으며 오픈 데이터셋은 서로 다른 기관에서 배포되거나 웹상에서 수집된 것으로 진단에 적합한 품질을 기대하기 어렵다. 또한, 기존 연구는 데이터셋이 학습에 적합한지에 대한 품질검증 없이 사용한다. 따라서 본 논문에서는 임상에서 사용하는 화질 평가 요소에 근거를 두고 영역별 화소값 분석을 통한 흉부 X선 영상 품질 평가 기법을 제안한다. 오픈 데이터셋 JSRT, Chest14와 국내 A 병원 데이터셋 AUH에 제안한 기법을 적용한 결과 민감도 91.5%, 특이도 96.1%의 우수한 성능을 확인하였다.

The Software Quality Testing on the basis of the International Standard ISO/IEC 25023 (국제표준 ISO/IEC 25023 을 기반으로 한 소프트웨어 품질평가)

  • Jung, Hye-Jung
    • Journal of the Korea Convergence Society
    • /
    • v.7 no.6
    • /
    • pp.35-41
    • /
    • 2016
  • As software is very important, modern men are interesting software quality testing. In this paper, we analyze the Internation standard and Test data, so, we propose the testing method by analysing testing data. We compare ISO/IEC 9126-2 testing model with ISO/IEC 25023 testing model. On the basis of ISO/IEC 25023, we classify the test data and we analyze the difference of International Standard to functionality, reliability, usability, efficiency, maintainability, portability, compatability, and security. By reality 331 testing data, we classify test data, and analyze difference according to sex. We find regression model by functionality, usability and testing date and we prove difference of testing date and the number of error by tester. Also, we prove difference of the number of error in software type.

Verification of the Suitability of Fine Dust and Air Quality Management Systems Based on Artificial Intelligence Evaluation Models

  • Heungsup Sim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.8
    • /
    • pp.165-170
    • /
    • 2024
  • This study aims to verify the accuracy of the air quality management system in Yangju City using an artificial intelligence (AI) evaluation model. The consistency and reliability of fine dust data were assessed by comparing public data from the Ministry of Environment with data from Yangju City's air quality management system. To this end, we analyzed the completeness, uniqueness, validity, consistency, accuracy, and integrity of the data. Exploratory statistical analysis was employed to compare data consistency. The results of the AI-based data quality index evaluation revealed no statistically significant differences between the two datasets. Among AI-based algorithms, the random forest model demonstrated the highest predictive accuracy, with its performance evaluated through ROC curves and AUC. Notably, the random forest model was identified as a valuable tool for optimizing the air quality management system. This study confirms that the reliability and suitability of fine dust data can be effectively assessed using AI-based model performance evaluation, contributing to the advancement of air quality management strategies.

Model for Quality Assessment of Data Analytics Software in Manufacturing-Based IIoT Environments (제조 기반 IIoT 환경에서 데이터 분석 소프트웨어의 품질 평가를 위한 모델)

  • Choi, Jongseok;Shin, Yongtae
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.14 no.4
    • /
    • pp.292-299
    • /
    • 2021
  • A form of data mining software, based on manufacturing-based IIoT environment with the development of IT technologies are increasingly growing. However, it is difficult to evaluate the software quality in the same form as general software due to the characteristics of the software of a manufacturing company that has a large amount of data that needs to be carried out with big data and data mining. In addition, in a manufacturing-based environment where heterogeneous equipment and software are mixed, it is difficult to perform quality judgment on software used by applying existing quality characteristics. Therefore, in this paper, the characteristics of the manufacturing base are investigated, and a software quality evaluation model suitable for it is developed and evaluated.

Developing an Assessment Model of Library Open Data Quality (도서관의 오픈 데이터 품질측정모델 개발)

  • Park, Jin Ho
    • Journal of the Korean Society for information Management
    • /
    • v.35 no.1
    • /
    • pp.33-59
    • /
    • 2018
  • This study draws on the current momentum to diversify open government data research through multidimensional scaling and model development. It formulates a quality assessment model applicable to library open data, taking into consideration the paucity of such research in the field. The model was developed using the Delphi method and verified for validity and reliability on the basis of a survey administered to library open data users. The results of the fourth round exhibited an average of 4.00 for all measured elements and a minimum validity of .75, rendering the model appropriate for use in quality assessments of library open data. The convergence and stability results provided by the expert panel fell below .50, confirming that there was no need to conduct further surveys in order to establish the validity of the Delphi method. The model's reliability likewise garnered results of .60 and above in all three dimensions. This Model completed with the input of the Delphi panel was put through a verification process in which library open data users such as domestic and international librarians, developers, and open data activists reviewed the model for validity and reliability. The model scored low on validity on account of its failure to load all measure factors and elements pertaining to the three dimensions. Reliability results, on the other hand, were at 0.6 and above for all dimensions and measured elements.

A Study on the Derivation of Items for Development of Data Quality Standard for 3D Building Data in National Digital Twin (디지털 트윈국토 건물 데이터 품질 표준 개발을 위한 항목 도출에 관한 연구)

  • Kim, Byeongsun;Lee, Heeseok;Hong, Sangki
    • Journal of Cadastre & Land InformatiX
    • /
    • v.52 no.1
    • /
    • pp.37-55
    • /
    • 2022
  • This study presents the plans to derive quality items for develop the data quality standard for ensuring the quality of 3D building geospatial data in NDT(National Digital Twin). This paper is organized as follows. The first section briefly examines various factors that impact the quality of 3D geospatial data, and proposes the role and necessity of the data quality standard as a means of addressing the data errors properly and also meeting the minimum requirements of stakeholders. The second section analyzes the relationship between the standards - building data model for NDT and ISO 19157: Geospatial data quality - in order to consider directly relevant standards. Finally, we suggest three plans on developing NDT data quality standard: (1) the scope for evaluating data quality, (2) additional quality elements(geometric integrity, geometric fidelity, positional accuracy and semantic classification accuracy), and (3) NDT data quality items model based on ISO 19157. The plans reveled through the study would contribute to establish a way for the national standard on NDT data quality as well as the other standards associated with NDT over the coming years.

An Analysis of Vertical Position Accuracy for the Three-Dimensional Spatial Data Object Utilizing the Public Information (공공데이터를 활용한 3차원 공간정보 객체의 수직위치 정확도 분석)

  • Kim, Jeong Taek;Yi, Su Hyun;Kim, Jong Il;Bae, Sang Won
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.22 no.3
    • /
    • pp.137-143
    • /
    • 2014
  • Recently, as new paradigm for government operation called government 3.0, government is actively operating policy opening and sharing public data. In addition, the Ministry of Land are operating an open platform integrated map service (the VWorld) which provides a variety of video contents such as the country's national spatial information, traffic information and three-dimensional building for the public. According to W3C Foundation's Open Data Status Report(2013), our country has the evaluated results that the part of the government's policy support and planning is good while the part of the data management is vulnerable. So our country needs the quality improvement for the data management. In addition, a digital aerial photograph image data is required to be up-to-date for the three-dimensional spatial object data. In this paper, we present the method for enhancement of the accuracy of vertical position and for maintainment of up-to-date vertical position. Our methods evaluate the data quality and analyze the cause of error of measurement utilizing the national standard quality assessment method. The result of research shows that the accuracy of vertical position is improved if the height of the building captain is adjusted by the quality assessment values and a three-dimensional model has up-to-date data if reconstruction and extension information of construction register is utilized.

Quality Evaluation Model about Efficiency for Fingerprint Recognition System (지문인식 시스템의 효율성에 관한 품질평가 모델)

  • Lee, Ha-Young;Kim, Jung-Gyu
    • Journal of Digital Convergence
    • /
    • v.12 no.6
    • /
    • pp.215-221
    • /
    • 2014
  • The Fingerprint recognition system is a system which identify the user's identify by verifying user's fingerprint and prepared data. The performance of fingerprint recognition system is dependent on 'fingerprint recognition time' and 'fingerprint recognition accuracy' and so on. In this paper, we developed a evaluation model about efficiency based on ISO quality evaluation standard for evaluating of quality level of fingerprint recognition system. We expect to contribute to construct and use of evaluation criteria based on quality evaluation standard by this study.

Data Asset Valuation Model Review (데이터 자산 가치 평가 모델 리뷰)

  • Kim, Ok-ki;Park, Jung;Park, Cheon-woong;Cho, Wan-Sup
    • The Journal of Bigdata
    • /
    • v.6 no.1
    • /
    • pp.153-160
    • /
    • 2021
  • This study examines previous studies on the income (profit) model, which is most used for valuation of data held by companies or institutions, and discusses key factors of the model and considerations in the data asset valuation process. Through this, it was confirmed that the shareability and utilization period of data assets are different from those of other companies. In addition, the value of data should be reviewed from various perspectives such as timeliness and accuracy. And for data asset value evaluation, it was derived that the user's use, ability to use, and value chain should be reviewed as a whole. As a future research direction, continuous research and development of models to be applied to actual business and revision of accounting law were proposed.

A Study on the Domain Discrimination Model of CSV Format Public Open Data

  • Ha-Na Jeong;Jae-Woong Kim;Young-Suk Chung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.12
    • /
    • pp.129-136
    • /
    • 2023
  • The government of the Republic of Korea is conducting quality management of public open data by conducting a public data quality management level evaluation. Public open data is provided in various open formats such as XML, JSON, and CSV, with CSV format accounting for the majority. When diagnosing the quality of public open data in CSV format, the quality diagnosis manager determines and diagnoses the domain for each field based on the field name and data within the field of the public open data file. However, it takes a lot of time because quality diagnosis is performed on large amounts of open data files. Additionally, in the case of fields whose meaning is difficult to understand, the accuracy of quality diagnosis is affected by the quality diagnosis person's ability to understand the data. This paper proposes a domain discrimination model for public open data in CSV format using field names and data distribution statistics to ensure consistency and accuracy so that quality diagnosis results are not influenced by the capabilities of the quality diagnosis person in charge, and to support shortening of diagnosis time. As a result of applying the model in this paper, the correct answer rate was about 77%, which is 2.8% higher than the file format open data diagnostic tool provided by the Ministry of Public Administration and Security. Through this, we expect to be able to improve accuracy when applying the proposed model to diagnosing and evaluating the quality management level of public data.