• Title/Summary/Keyword: 데이터 기준

Search Result 4,293, Processing Time 0.036 seconds

한의학에서의 사상체질판별함수 개발에 관한 연구 (II) - 도수분석에 의한 변수선택 -

  • Kim, Gyu-Gon;Jo, Min-Hyeong
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2004.04a
    • /
    • pp.69-77
    • /
    • 2004
  • 본 논문에서는 한방병원에서 사상체질분류검사설문지를 이용하여 사상체질을 진단할 때 진단의 정확도를 향상시키기 위한 사상체질분류함수를 개발하기 위하여 데이터마이닝에서의 판별분석모형을 이용한다. 데이터 정제 과정에서 양질의 데이터를 확보하기 위한 기준은 상반되는 설문의 응답 패턴과 체질별 설문의 응답 비율을 이용하며, 변수선택의 기준은 도수분석의 비율차이검정과 선형판별함수의 계수를 이용한다.

  • PDF

Ananlyzing Customer Management Data by Datamining (Focused on Apartment Customer Classification) (데이터마이닝을 통한 고객관리데이터의 분석 (아파트고객 세분화를 중심으로))

  • Baek, Shin Jung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2004.05a
    • /
    • pp.69-72
    • /
    • 2004
  • 기업간의 경쟁이 심화되고 정보의 중요성에 대한 인식이 확대되어 가는 상황에서 다량의 데이터로부터 가치 있는 데이터를 추출하는 CRM 데이터 마이닝은 중대한 관심사가 아닐 수 없다. 본 연구는 데이터마이닝의 여러 활용 분야 중 고객세분화를 위해 최근 많이 사용되고 있는 데이터마이닝 기법인 로지스틱 회귀분석, 의사결정나무, 신경망 알고리즘 기법들을 비교하며, 이를 실제 아파트 고객의 데이터를 이용하여 검증하고자 한다. 따라서, 아파트 고객 세분화를 위한 데이터마이닝 수행시 기법 선택의 기준과 비교 평가의 기준을 제시하는 데 연구목적 있다.

  • PDF

항로표지 데이터 품질지수 산출에 관한 연구

  • 정제한;한윤석;이예경;다이리;탕멍위엔;장준혁;신상문
    • Proceedings of the Korean Institute of Navigation and Port Research Conference
    • /
    • 2022.06a
    • /
    • pp.100-102
    • /
    • 2022
  • 데이터의 품질을 파악하고 그 기준을 선정하는 것은 해양 항로 표지와 같은 분석에 있어서 중요한 역할을 한다. 본 연구에서는 해양 분야에서 디지털 항로표지 데이터의 품질 진단을 위해 공정능력지수를 이용하여 데이터의 품질을 정량적으로 산출하고 그 결과에 대한 판정 기준을 명확히 하여 데이터에 대한 품질을 판단할 수 있는 척도를 제시하였다.

  • PDF

Standardization of Data Quality and Management Regulation for Korean CORS (국내 GNSS 상시관측소 데이터 품질 및 관리규정 표준화에 관한 연구)

  • Jin Sang, Hwang;Hyuk Gil, Kim;Hong Sik, Yun;Jae Myoung, Cho
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.33 no.4
    • /
    • pp.245-258
    • /
    • 2015
  • This study aimed to conduct the standardization of various specifications for determining the proper construction and operation of domestic CORS (Continuously Operating Reference Station). To achieve the plan, the standardization was proposed for various compositions of CORS, such as the data quality, structure, and equipment. Also, we have studied the method for empirically determining the reference values of QC (Quality Check) of CORS data. Those large amounts of samples for each QC index values were built to approach in empirical and statistical methods. In fact, those general and recommended reference values were determined from analyzing the sample distributions, using the empirical and statistical approaches. The result is expected to be utilized for a variety of research fields for standardization, accurate data acquisitions and service operations for the domestic CORS

A Study on Significant Properties for Dataset Type Preservation Format (데이터세트 유형 전자기록의 필수보존속성 연구)

  • Jung-eun Lee;Dongmin Yang
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.34 no.4
    • /
    • pp.259-283
    • /
    • 2023
  • This study acknowledges that prevailing regulation concerning for the long-term preservation of electronic records focus mainly on document types, neglecting the preservation of electronic records from various administrative information systems. With the growing interest in data management in the era of big data, it is imperative to establish clear standards for the long-term preservation of datasets. The choice of preservation format for electronic records is based on the specific standards for each type of electronic record. These standards are formulated according to the significant properties relevant to the electronic record type. This study aims to identify the significant properties of electronic records of each record type, before creating specific preservation format selection criteria for these record types. To achieve this, we reviewed and analyzed R&D studies by the National Archives of Korea and the NARA in the United States. As a result of the research, 9 significant properties were identified for database-type entities, and 7 significant properties were identified for structured data-type entities.

A Study of Criterion for Efficient Clustering Estimation of Temporal Data (Temporal 데이터의 효율적 군집 추정을 위한 기준 연구)

  • Jeon, Jin-Ho;Kim, Min-Soo
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.11 no.5
    • /
    • pp.139-144
    • /
    • 2011
  • Most real world system such as world economy, management, medical and engineering applications contain a series of complex phenomena. One of common methods to understand these system is to build a model and analyze the behavior of the system. As a first step, Determining the best clusters on data. As a second step, Determining the model of the cluster. In this paper, we investigated heuristic search methods for efficient clustering. It is also confirmed that the Bayesian Information Criterion more reliable than Cheeseman-Stutz ones.

Improving data quality through Data Owners management (데이터 오너 관리를 통한 데이터 품질 향상)

  • Park, Ji-Soo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.11a
    • /
    • pp.278-281
    • /
    • 2007
  • 데이터 품질 기준은 반드시 현업의 입장에서 바라봐야 하며, 현업의 마인드가 데이터 품질에 가장 결정적인 영향을 미친다. 이에 따라 데이터 품질을 향상시키기 위해서는 현업이 데이터 품질 관리에 직접 참여할 수 있는 연구가 필요하다. 본 연구에서는 데이터 값(Data Value)에 대한 데이터 오너 (Owner)를 부여하여 데이터 품질 오류 시 현업이 직접 데이터 품질 관리 프로세스에 참여 할 수 있는 방안을 제시하였다. 데이터 품질 관리 프로세스는 데이터 품질 대상 및 기준을 정의하고 측정, 분석, 개선하는 방법이다. 본 연구에서 제시한 데이터 오너 관리 방안은 보다 효율적인 데이터 품질 관리 프로세스를 개선 시킬 수 있을 것이다.

Proposition of causally confirmed measures in association rule mining (인과적 확인 측도에 의한 연관성 규칙 탐색)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.4
    • /
    • pp.857-868
    • /
    • 2014
  • Data mining is the representative analysis methodology in the era of big data, and is the process to analyze a massive volume database and summarize it into meaningful information. Association rule technique finds the relationship among several items in huge database using the interestingness measures such as support, confidence, lift, etc. But these interestingness measures cannot be used to establish a causality relationship between antecedent and consequent item sets. Moreover, we can not know association direction by them. This paper propose causally confirmed association thresholds to compensate for these problems, and then check the three conditions of interestingness measures. The comparative studies with basic association thresholds, causal association thresholds, and causally confirmed association thresholds are shown by simulation studies. The results show that causally confirmed association thresholds are better than basic and causal association thresholds.

MMS Data Accuracy Evaluation by Distance of Reference Point for Construction of Road Geospatial Information (도로공간정보 구축을 위한 기준점 거리 별 MMS 성과물의 정확도 평가)

  • Lee, Keun Wang;Park, Joon Kyu
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.39 no.6
    • /
    • pp.549-554
    • /
    • 2021
  • Precise 3D road geospatial information is the basic infrastructure for autonomous driving and is essential data for safe autonomous driving. MMS (Mobile Mapping System) is being used as equipment for road spatial information construction, and related research is being conducted. However, there are insufficient studies to analyze the effect of the baseline reference point distance, which is an important factor in the accuracy of the MMS outcome, on the accuracy of the outcome. Therefore, in this study, the accuracy of the data acquired using MMS by reference point distance was analyzed. Point cloud data was constructed using MMS for the road in the study site. For data processing, 4 data were constructed considering the distance from the reference point for MMS data, and the accuracy was analyzed by comparing the results of 12 checkpoints for accuracy evaluation. The accuracy of the MMS data showed a difference of -0.09 m to 0.11 m in the horizontal direction and 0.04 m to 0.19 m in the height direction. The error in the vertical direction was larger than that in the horizontal direction, and it was found that the accuracy decreased as the distance from the reference point increased. In addition, as the length of the road increases, the distance from the reference point may vary, so additional research is needed. If the accuracy evaluation of the method using multiple reference points is made in the future, it will be possible to present an effective method of using reference points for the construction of precise road spatial information.