• Title/Summary/Keyword: 결측자료

Search Result 302, Processing Time 0.028 seconds

A Comprehensive Method to Impute Vehicle Trajectory Data Collected in Wireless Traffic Surveillance Environments (무선통신기반 교통정보수집체계하에서의 차량주행궤적정보 결측치 보정방안)

  • Yeon, Ji-Yun;Kim, Hyeon-Mi;O, Cheol;Kim, Won-Gyu
    • Journal of Korean Society of Transportation
    • /
    • v.27 no.4
    • /
    • pp.175-181
    • /
    • 2009
  • Intelligent Transportation Systems(ITS) enables road users to enhance efficiency of their trips in a variety of traffic conditions. As a significant part of ITS, information communication technology among vehicles and between vehicles and infrastructure has been being developed to upgrade current traffic data collection technology through location-based traffic surveillance systems. A wider and detailed range of traffic data can be acquired with ease by the technology. However, its performance level falls with environmental impediments such as large vehicles, buildings, harsh weather, which often bring about wireless communication failure. For imputation of vehicle trajectory data discontinued by the failure, several potential existing methods were reviewed and a new method to complement them was devised. AIMSUN API(Application Programming Interface) software was utilized to simulate vehicle trajectories data and missing vehicle trajectories data was randomly generated for the verification of the method. The method was proven to yield more accurate and reliable traffic data than the existing ones.

불규칙한 관측주기를 갖는 지하수자료를 이용한 지하수위 변동의 시계열 분석

  • 이명재;이강근
    • Proceedings of the Korean Society of Soil and Groundwater Environment Conference
    • /
    • 2000.11a
    • /
    • pp.64-68
    • /
    • 2000
  • 장기간 관측된 지하수위 자료를 시계열분석 중의 하나인 전이함수 모형(Transfer Function - Noise model)을 이용하여 분석하였다. 일반적으로 전이함수 모형은 입력 변수와 출력변수와의 관계가 선형적일 때 적용이 가능하며, 자료가 시간에 대해 연속적으로 존재해야 하는 제한이 있다. 강수량과 지하수위의 변동은 비선형적인 관계를 가지고 있어 이러한 전이함수 모형을 직접 적용하는데는 어려움이 있다. 이러한 비선형성의 정도를 감소시키기 위해 물리모형(HYDRUS)을 이용하여 침투량을 계산하고 이를 입력변수로 사용하여 전이함수 모형을 적용하였다. 침투량을 입력변수로 모형을 추정하였을 때, 강수량을 직접 입력자료로 사용했을 경우보다 ME(mean error), RMSE(root-mean-squre error), MAE(mean absolute error)에서 상대적으로 작은 값을 보여주고 있다. TFN 모형의 모수를 추정하기 위해서 Kalman 필터 알고리즘과 최우추정법(Maximum Likelihood Estimation)을 이용하였다. Kalman 필터 알고리즘을 이용하여 불규칙한 관측주기를 갖는 시계열이나 결측값이 있는 시계열에 대해서도 전이함수 모형을 구하였으며, 이를 통해 결측값에 대한 추정이 가능하였다.

  • PDF

The study on error, missing data and imputation of the smart card data for the transit OD construction (대중교통 OD구축을 위한 대중교통카드 데이터의 오류와 결측 분석 및 보정에 관한 연구)

  • Park, Jun-Hwan;Kim, Soon-Gwan;Cho, Chong-Suk;Heo, Min-Wook
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.2
    • /
    • pp.109-119
    • /
    • 2008
  • The number of card users has grown steadily after the adaption of smart card. Considering the diverse information from smart card data, the increase of card usage rate leads to various useful implications meaning in travel pattern analysis and transportation policy. One of the most important implications is the possibility that the data enables us to generate transit O/D tables easily. In the case of generating transit O/D tables from smart card data, it is necessary to filter data error and/or data missing. Also, the correction of data missing is an important procedure. In this study, it is examined to compute the level of data error and data missing, and to correct data missing for transit O/D generation.

Undecided inference using logistic regression for credit evaluation (신용평가에서 로지스틱 회귀를 이용한 미결정자 추론)

  • Hong, Chong-Sun;Jung, Min-Sub
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.2
    • /
    • pp.149-157
    • /
    • 2011
  • Undecided inference could be regarded as a missing data problem such as MARand MNAR. Under the assumption of MAR, undecided inference make use of logistic regression model. The probability of default for the undecided group is obtained with regression coefficient vectors for the decided group and compare with the probability of default for the decided group. And under the assumption of MNAR, undecide dinference make use of logistic regression model with additional feature random vector. Simulation results based on two kinds of real data are obtained and compared. It is found that the misclassification rates are not much different from the rate of rawdata under the assumption of MAR. However the misclassification rates under the assumption of MNAR are less than those under the assumption of MAR, and as the ratio of the undecided group is increasing, the misclassification rates is decreasing.

Outlier Filtering and Missing Data Imputation Algorithm using TCS Data (TCS데이터를 이용한 이상치제거 및 결측보정 알고리즘 개발)

  • Do, Myung-Sik;Lee, Hyang-Mee;NamKoong, Seong
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.4
    • /
    • pp.241-250
    • /
    • 2008
  • With the ever-growing amount of traffic, there is an increasing need for good quality travel time information. Various existing outlier filtering and missing data imputation algorithms using AVI data for interrupted and uninterrupted traffic flow have been proposed. This paper is devoted to development of an outlier filtering and missing data imputation algorithm by using Toll Collection System (TCS) data. TCS travel time data collected from August to September 2007 were employed. Travel time data from TCS are made out of records of every passing vehicle; these data have potential for providing real-time travel time information. However, the authors found that as the distance between entry tollgates and exit tollgates increases, the variance of travel time also increases. Also, time gaps appeared in the case of long distances between tollgates. Finally, the authors propose a new method for making representative values after removal of abnormal and "noise" data and after analyzing existing methods. The proposed algorithm is effective.

Analysis of flow relationship for replacement to IRDIMS continuous data (자동유량측정시설 연속유량자료 보완을 위한 상하류관계 검토)

  • Kwon, Young Bin;Kim, Dong Su;Cha, Jun Ho;Jung, Sung Won
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2019.05a
    • /
    • pp.359-359
    • /
    • 2019
  • 2011년 4대강 다기능보 설치에 따라 배수영향을 받는 구간은 기존의 측정방법으로는 유량측정이 어려워 보 구간별로 자동유량측정시설을 설치하여 실시간으로 유량을 생산하고 있다. 하지만 현재 수질개선 및 하천 자연화를 위해 보 운영수위를 저하시켰다. 이에 수위 저하에 따른 측정영역 한계로 정상적인 운영에 어려움이 발생하고 있다. 본 연구에서는 낙동강 합천창녕보 영향 구간 내 합천군(율지교), 합천군(적포교) 지점을 대상으로 상하류 유량관계를 활용하여 결측 및 오측자료를 보완하고자 한다. 대상지점은 2018년 수문개방에 따른 수위저하로 인한 결측과 부유물에 걸림에 의한 유속자료 오측으로 자료의 보완이 필요하였다. 이로 인해 자료 보완을 위하여 각 지점의 환산유량을 이용하여 경향성 검토를 하였으며, 상하류 환산유량과 검보정 측정성과와의 관계를 활용한 관계식을 개발하여 유량을 산정하였다. 산정된 유량과 검보정 측정결과 상관도(R2)는 0.95 이상으로 나타나 매우 합리적으로 판단되나 수문조작 시에 일부 편차는 보인다는 결과를 나타냈다. 단기적인 자료보완은 품질관리를 통해 다양한 방법으로 가능하지만 상하류 유량관계를 활용한 방법이 장기적인 자료를 보완하는 방법으로 적절하다고 판단된다. 향후 보완방법을 다른 보 구간의 지점에도 적용하여 보 수위저하에 따른 시설물 개선공사가 진행되는 동안 실시간 유량자료의 제공으로 연속적인 유량자료 생산이 가능하도록 하고자 한다.

  • PDF

The Comparison of Estimation Methods for the Missing Rainfall Data with spatio-temporal Variability (시공간적 변동성을 고려한 강우의 결측치 추정 방법의 비교)

  • Kim, Byung-Sik;Noh, Hui-Seong;Kim, Hung-Soo
    • Journal of Wetlands Research
    • /
    • v.13 no.2
    • /
    • pp.189-197
    • /
    • 2011
  • This paper reviewed application of data-driven method, distance-weighted method(IDWM, IEWM, CCWM, ANN), and radar data method estimated of missing raifall data. To evaluate these methods, statistics was compared using radar and station rainfall data from Imjin-river basin. The range of RMSE values calculated for CCWM, ANN was 1.4 to 1.79mm, and the range of RMSE values estimated data used for radar rainfall data was 0.05 to 2.26mm. Spatial characteristics is considered to Radar rainfall data rather than station rainfall data. Result suggest that estimated data used for radar data can impove estimation of missing raifall data.

Missing Hydrological Data Estimation using Neural Network and Real Time Data Reconciliation (신경망을 이용한 결측 수문자료 추정 및 실시간 자료 보정)

  • Oh, Jae-Woo;Park, Jin-Hyeog;Kim, Young-Kuk
    • Journal of Korea Water Resources Association
    • /
    • v.41 no.10
    • /
    • pp.1059-1065
    • /
    • 2008
  • Rainfall data is the most basic input data to analyze the hydrological phenomena and can be missing due to various reasons. In this research, a neural network based model to estimate missing rainfall data as approximate values was developed for 12 rainfall stations in the Soyang river basin to improve existing methods. This approach using neural network has shown to be useful in many applications to deal with complicated natural phenomena and displayed better results compared to the popular offline estimating methods, such as RDS(Reciprocal Distance Squared) method and AMM(Arithmetic Mean Method). Additionally, we proposed automated data reconciliation systems composed of a neural network learning processer to be capable of real-time reconciliation to transmit reliable hydrological data online.

Application of DINEOF to Reconstruct the Missing Data from GOCI Chlorophyll-a (GOCI Chlorophyll-a 결측 자료의 복원을 위한 DINEOF 방법 적용)

  • Hwang, Do-Hyun;Jung, Hahn Chul;Ahn, Jae-Hyun;Choi, Jong-Kuk
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.6_1
    • /
    • pp.1507-1515
    • /
    • 2021
  • If chlorophyll-a is estimated through ocean color remote sensing, it is able to understand the global distribution of phytoplankton and primary production. However, there are missing data in the ocean color observed from the satellites due to the clouds or weather conditions. In thisstudy, the missing data of the GOCI (Geostationary Ocean Color Imager) chlorophyll-a product wasreconstructed by using DINEOF (Data INterpolation Empirical Orthogonal Functions). DINEOF reconstructs the missing data based on spatio-temporal data, and the accuracy was cross-verified by removing a part of the GOCI chlorophyll-a image and comparing it with the reconstructed image. In the study area, the optimal EOF (Empirical Orthogonal Functions) mode for DINEOF wasin 10-13. The temporal and spatialreconstructed data reflected the increasing chlorophyll-a concentration in the afternoon, and the noise of outliers was filtered. Therefore, it is expected that DINEOF is useful to reconstruct the missing images, also it is considered that it is able to use as basic data for monitoring the ocean environment.