• Title/Summary/Keyword: 결측 보정

Search Result 80, Processing Time 0.032 seconds

A Novel on Auto Imputation and Analysis Prediction Model of Data Missing Scope based on Machine Learning (머신러닝기반의 데이터 결측 구간의 자동 보정 및 분석 예측 모델에 대한 연구)

  • Jung, Se-Hoon;Lee, Han-Sung;Kim, Jun-Yeong;Sim, Chun-Bo
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.2
    • /
    • pp.257-268
    • /
    • 2022
  • When there is a missing value in the raw data, if ignore the missing values and proceed with the analysis, the accuracy decrease due to the decrease in the number of sample. The method of imputation and analyzing patterns and significant values can compensate for the problem of lower analysis quality and analysis accuracy as a result of bias rather than simply removing missing values. In this study, we proposed to study irregular data patterns and missing processing methods of data using machine learning techniques for the study of correction of missing values. we would like to propose a plan to replace the missing with data from a similar past point in time by finding the situation at the time when the missing data occurred. Unlike previous studies, data correction techniques present new algorithms using DNN and KNN-MLE techniques. As a result of the performance evaluation, the ANAE measurement value compared to the existing missing section correction algorithm confirmed a performance improvement of about 0.041 to 0.321.

A Real-time Correction of the Underestimation Noise for GK2A Daily NDVI (GK2A 일단위 NDVI의 과소추정 노이즈 실시간 보정)

  • Lee, Soo-Jin;Youn, Youjeong;Sohn, Eunha;Kim, Mija;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1301-1314
    • /
    • 2022
  • Normalized Difference Vegetation Index (NDVI) is utilized as an indicator to represent the vegetation condition on the land surface in various applications such as land cover, crop yield, agricultural drought, soil moisture, and forest disaster. However, satellite optical sensors for visible and infrared rays cannot see through the clouds, so the NDVI of the cloud pixel is not a valid value for the land surface. This study proposed a real-time correction of the underestimation noise for GEO-KOMPSAT-2A (GK2A) daily NDVI and made sure its feasibility through the quantitative comparisons with Moderate Resolution Imaging Spectroradiometer (MODIS) NDVI and the qualitative interpretation of time-series changes. The underestimation noise was effectively corrected by the procedures such as the time-series correction considering vegetation phenology, the outlier removal using long-term climatology, and the gap filling using rigorous statistical methods. The correlation with MODIS NDVI was higher, and the difference was lower, showing a 32.7% improvement compared to the original NDVI product. The proposed method has an extensibility for use in other satellite products with some modification.

A point-scale gap filling of the flux-tower data using the artificial neural network (인공신경망 기법을 이용한 청미천 유역 Flux tower 결측치 보정)

  • Jeon, Hyunho;Baik, Jongjin;Lee, Seulchan;Choi, Minha
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.11
    • /
    • pp.929-938
    • /
    • 2020
  • In this study, we estimated missing evapotranspiration (ET) data at a eddy-covariance flux tower in the Cheongmicheon farmland site using the Artificial Neural Network (ANN). The ANN showed excellent performance in numerical analysis and is expanding in various fields. To evaluate the performance the ANN-based gap-filling, ET was calculated using the existing gap-filling methods of Mean Diagnostic Variation (MDV) and Food and Aggregation Organization Penman-Monteith (FAO-PM). Then ET was evaluated by time series method and statistical analysis (coefficient of determination, index of agreement (IOA), root mean squared error (RMSE) and mean absolute error (MAE). For the validation of each gap-filling model, we used 30 minutes of data in 2015. Of the 121 missing values, the ANN method showed the best performance by supplementing 70, 53 and 84 missing values, respectively, in the order of MDV, FAO-PM, and ANN methods. Analysis of the coefficient of determination (MDV, FAO-PM, and ANN methods followed by 0.673, 0.784, and 0.841, respectively.) and the IOA (The MDV, FAO-PM, and ANN methods followed by 0.899, 0.890, and 0.951 respectively.) indicated that, all three methods were highly correlated and considered to be fully utilized, and among them, ANN models showed the highest performance and suitability. Based on this study, it could be used more appropriately in the study of gap-filling method of flux tower data using machine learning method.

Development of Hydrologic Data Aquisition and Management System(HDAMS) in Anyangcheon watershed (안양천 유역의 실시간 수문모니터링 자료관리시스템 개발)

  • Lee, Kyoung-Do;Kim, You-Jin;Kim, Nam-Il;Lee, Kil-Seoung
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2007.05a
    • /
    • pp.2029-2033
    • /
    • 2007
  • 오늘날 특정 유역에서의 수문현상 및 수문순환에 대한 분석을 위한 연구가 활발히 진행되고 있다. 이를 위해서는 수문자료의 관측은 반드시 수반되어야 하며, 관측자료의 품질관리 및 원시자료의 관리 등이 요구되고 있는 실정이다. 관측된 수문자료의 품질관리라 함은 자료의 신뢰도 분석과 자료의 보완의 두 과정을 포함한다. 여기서, 신뢰도 분석이라 함은 자료 속에 포함된 불확실성을 판별하는 작업을 의미하며, 자료의 불확실성은 위에서 언급된 자료의 불충분 및 불안정을 제외한 부정확, 불일관성에서 비롯된다. 자료의 보완이라 함은 자료의 신뢰도 분석을 통하여 자료 속에 포함된 불확실한 성분들을 찾아내고, 이를 제거한 후 완전한 자료로 대체하고, 자료가 결측된 경우 공백을 연결함으로써 자료의 완전성을 유지하거나 또는 불충분한 자료를 확장하는 일련의 보완작업이라고 정의한다. 자료의 품질을 결정하는 주요 인자는 크게 관측소 관리의 하드웨어적인 측면과 자료 분석의 소프트웨어적인 측면이 있다. 하드웨어적인 측면에서의 수문자료 품질관리를 위해서 본 과제에서는 현장에 설치된 수위계, 강우량계의 센서 등에 대한 장비를 점검하고, 현장실측을 통해 지속적으로 측정값을 보정해주는 역할을 수행하고 있으며, 소프트웨어적인 측면에서 수문자료의 품질관리를 위해서는 수문자료의 수집 단계부터 시작하여 데이터베이스 저장, 필터링, 통계분석, 웹 및 C/S(Client Server)를 통한 배포 등의 일련의 자료 처리 과정을 수행할 수 있는 수문자료관리 프로그램을 웹 시스템과 C/S로 분류하여 정의내릴 수 있다. 본 연구에서는 수문자료의 관리자 입장에서의 보다 효율적이고 체계적으로 자료를 관리하고 분석하기 위한 방안으로 수문자료관리시스템(Hydrologic Data Aquisition and Management System, HDAMS)을 개발하였다. 이 시스템은 안양천 유역에서 시범 적용하고 있으며, 범용성을 전제로 개발되었다. 또한 수문자료 관리 프로그램의 DB 구조 및 DB 자료를 활용한 다양한 분석기능은 갖도록 설계하였으며 계획된 데이터베이스 구조를 바탕으로 계측기 인터페이스와 사용자 인터페이스, 데이터베이스 간의 연동이 원활히 이루어지도록 개발하고자 한다.

  • PDF

An Evaluation System For Freeway Traffic Data Processing Techniques (고속도로 교통자료 처리기법 통합평가 시스템 개발)

  • Oh, Dong-Wook;Oh, Cheol;NamKoong, Sung;Jeon, Se-Kil
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.7 no.4
    • /
    • pp.13-24
    • /
    • 2008
  • Real-time traffic data are readily obtainable by traffic surveillance systems of intelligent transportation systems (ITS). Such data greatly support further applications in the field of traffic operations, planning, and safety. However, traffic data should be appropriately processed to fully exploit the benefits of data collection capability. Rather than developing individual data processing techniques, which is major concern of existing studies, this study proposes a novel methodology for evaluating data processing techniques in an integrated manner. Also, a tool for implementing the proposed methodology is developed. Users can extract useful and more reliable traffic data based upon their ultimate purpose of data usage by the evaluation tool developed in this study. Actual freeway traffic data are, as an example, fed into the evaluation tool, and results are discussed.

  • PDF

Design and development of GWB (Global Water Bank) system (GWB (Global Water Bank) 시스템 설계 및 개발)

  • Kim, Min Kuk;Kim, Jeong Bae;Park, Jong-Pyo;Jeong, Ui-Seok;Bae, Deg Hyo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.468-468
    • /
    • 2017
  • 최근 기후변화로 인해 수자원 위험요소인 가뭄 및 홍수피해가 증가하고 있으며, 인구증가에 따른 수량감소로 수자원관리가 더욱 어려워지고 있는 실정이다. 효율적인 수자원관리 및 기후변화 대응을 위해 세계 물시장은 점점 증가하고 있으며, 이에 따라 국내기업 또한 해외사업 진출을 추진하고 있으나, 사업에 필수적인 기상, 수문 등 기초자료의 부재로 어려움을 겪고 있다. 본 연구에서는 국내기업의 해외사업 진출 시 필요한 기상, 지형, 수문, 인문 사회 등 기초자료를 제공하는 글로벌 수자원정보제공시스템(Global Water Bank, GWB)을 설계 및 개발하고자 한다. 국내 외 예비타당성보고서 및 국내에서 수집 가능한 국외 정보현황을 분석하여 자료 제공인자를 도출하였으며, 이를 토대로 시스템 내 제공항목을 기상, 지형, 수문해석, 인문 사회, 기후변화 자료로 구분하였다. 해외시장 진출범위를 고려하여 자료의 공간적인 범위를 전지구로 설정하였으며, 전지구 자료의 가용성을 검토하여 제공자료를 구축하였다. 기상자료는 NCDC (National Climate Data Center)의 관측 지점자료와 APHRODITE (Asian Precipitation - Highly-Resolved Observational Data Integration Towards Evaluation) 격자자료를 수집하였으며, 오 결측 자료는 품질검토를 수행하여 보정하였다. 지형자료의 경우 USGS (U.S. Geological Survey)의 DEM, FAO (Food and Agriculture Organization of the United Nations)의 토양도, UMD (University of Maryland)의 토지피복도를 구축하였다. 수문자료는 GRDC(Global Runoff Data Centre)의 관측 지점자료를 수집하였으며, 미계측 지역의 수문자료 구축을 위해 VIC(Variable Infiltration Capacity) 수문모형을 활용하여 $0.5^{\circ}$ 공간해상도의 격자 유출량 자료를 생산하였다. 인문 사회자료로 World Bank의 국가별 통계자료를 수집하였으며, 구축된 각 자료는 GWB 시스템을 통해 제공된다. 시스템의 시범운영을 위해 아시아 지역을 대상으로 GWB- 버전을 개발하였으며, 시범지역 내 관측자료와 비교분석하여 자료의 활용성을 검증하였다. 추후 GWB 시스템은 해외진출 사업 우선지역 선정 근거로 활용될 수 있는 가상수 및 물산업지수 등의 추가정보를 제공하고 타 지역으로 확대적용 예정이다.

  • PDF

Activity Type Detection Of Random Forest Model Using UWB Radar And Indoor Environmental Measurement Sensor (UWB 레이더와 실내 환경 측정 센서를 이용한 랜덤 포레스트 모델의 재실활동 유형 감지)

  • Park, Jin Su;Jeong, Ji Seong;Yang, Chul Seung;Lee, Jeong Gi
    • The Journal of the Convergence on Culture Technology
    • /
    • v.8 no.6
    • /
    • pp.899-904
    • /
    • 2022
  • As the world becomes an aging society due to a decrease in the birth rate and an increase in life expectancy, a system for health management of the elderly population is needed. Among them, various studies on occupancy and activity types are being conducted for smart home care services for indoor health management. In this paper, we propose a random forest model that classifies activity type as well as occupancy status through indoor temperature and humidity, CO2, fine dust values and UWB radar positioning for smart home care service. The experiment measures indoor environment and occupant positioning data at 2-second intervals using three sensors that measure indoor temperature and humidity, CO2, and fine dust and two UWB radars. The measured data is divided into 80% training set data and 20% test set data after correcting outliers and missing values, and the random forest model is applied to evaluate the list of important variables, accuracy, sensitivity, and specificity.

Discussion of Soil Respiration for Understanding Ecosystem Carbon Cycle in Korea (생태계 탄소순환 이해를 위한 국내 토양호흡 연구의 고찰)

  • Lee, Jae-Ho;Yi, Jun-Seok;Chun, Young-Moon;Chae, Nam-Yi;Lee, Jae-Seok
    • Korean Journal of Ecology and Environment
    • /
    • v.46 no.2
    • /
    • pp.310-318
    • /
    • 2013
  • In territorial ecosystem, soil has stored considerable amount of carbon, and it is vulnerable to weakness release much of the carbon to atmosphere. In this study, we have been effort realization and discussion to the error between inter-instruments and measurement methods, time and special variations, gap filling and separation from each source included in soil respiration, used to collect soil respiration data in various ecosystems in Korea. In conclusion, it have to collect calibration data throughout comparison test between methods and instruments because accumulated data from past and accumulating data in present did not calibrated. In predicting change of soil carbon dynamic using the model method, it needs important data such as longterm and short-term data, artificial handling data of major factor, data from various ecosystem, soil texture, soil depth etc. In company with, we should collect highly qualified data through deep consideration of present problems.

Hydrologic Analysis of the September 1990 Extreme Flood Occurred on the Chungju Dam Basin (충주(忠州)댐 유역(流域) 1990년(年) 9월(月) 대홍수(大洪水)의 수문학적(水文學的) 분석(分析))

  • Ko, Seok Ku;Lee, Hee Sung;Jeong, Dong Kug;Jung, Jae Sung
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.12 no.4_1
    • /
    • pp.107-119
    • /
    • 1992
  • A heavy storm hit the central part of the Korean Peninsula especially on the Chungju Dam Basin from the 9th to 12th of September 1990. The Chungju multipurpose dam is the largest water project in Korea completed in 1986. The storm recorded a peak inflow of about $21,000m^3/sec$ at the dam site which is equivalent to 500 to 1000 years recurring frequency according to the designed concept. Extensive hydrological analyses including field investigation were performed to identify the storm. The result of the field investigation showed that 6 gages among the 22 telemetering rain-gages located in the basin were proved to be out-of-normal operation during the storm. The corrected basin average rainfall was estimated to be 458.6 mm ranging from 206 to 665 mm. The correction of the rainfall depth included the adjustment of the rainfall depths of the 6 gages using the Kriging interpolation technique, and adjustment according to the heights of the gage mouths. For the maintenance and operation of the Chungju Dam, new design floods were suggested from the trend analysis which showed that the design flood have to be increased because of the increasing tendency of the annual flood peaks.

  • PDF

Estimation of TROPOMI-derived Ground-level SO2 Concentrations Using Machine Learning Over East Asia (기계학습을 활용한 동아시아 지역의 TROPOMI 기반 SO2 지상농도 추정)

  • Choi, Hyunyoung;Kang, Yoojin;Im, Jungho
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.2
    • /
    • pp.275-290
    • /
    • 2021
  • Sulfur dioxide (SO2) in the atmosphere is mainly generated from anthropogenic emission sources. It forms ultra-fine particulate matter through chemical reaction and has harmful effect on both the environment and human health. In particular, ground-level SO2 concentrations are closely related to human activities. Satellite observations such as TROPOMI (TROPOspheric Monitoring Instrument)-derived column density data can provide spatially continuous monitoring of ground-level SO2 concentrations. This study aims to propose a 2-step residual corrected model to estimate ground-level SO2 concentrations through the synergistic use of satellite data and numerical model output. Random forest machine learning was adopted in the 2-step residual corrected model. The proposed model was evaluated through three cross-validations (i.e., random, spatial and temporal). The results showed that the model produced slopes of 1.14-1.25, R values of 0.55-0.65, and relative root-mean-square-error of 58-63%, which were improved by 10% for slopes and 3% for R and rRMSE when compared to the model without residual correction. The model performance by country was slightly reduced in Japan, often resulting in overestimation, where the sample size was small, and the concentration level was relatively low. The spatial and temporal distributions of SO2 produced by the model agreed with those of the in-situ measurements, especially over Yangtze River Delta in China and Seoul Metropolitan Area in South Korea, which are highly dependent on the characteristics of anthropogenic emission sources. The model proposed in this study can be used for long-term monitoring of ground-level SO2 concentrations on both the spatial and temporal domains.