• Title/Summary/Keyword: 결측자료 추정

Search Result 84, Processing Time 0.028 seconds

Calibration of the Hargreaves Equation for the Reference Evapotranspiration Estimation on a Nation-Wide Scale (우리나라 기준 증발산량 산정을 위한 Hargreaves 계수 산정)

  • Lee, Khil-Ha;Park, Jae-Hyeon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.6B
    • /
    • pp.675-681
    • /
    • 2008
  • In this study, the daily-based reference evapotranspiration was evaluated with Hargreaves equation at the 23 meteorological stations for the time period of 1997-2006. The Hargreaves coefficient was self-calibrated to give the best fit with Penman-Monteith evapotranspiration, being regarded as a reference. On the basis of the estimated parameter set, a generalized regression was conducted to estimate the Hargreaves evapotranspiration by just using temperature data. This study will contribute to water resources planning, irrigation schedule, and environmental management.

Application of DINEOF to Reconstruct the Missing Data from GOCI Chlorophyll-a (GOCI Chlorophyll-a 결측 자료의 복원을 위한 DINEOF 방법 적용)

  • Hwang, Do-Hyun;Jung, Hahn Chul;Ahn, Jae-Hyun;Choi, Jong-Kuk
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.6_1
    • /
    • pp.1507-1515
    • /
    • 2021
  • If chlorophyll-a is estimated through ocean color remote sensing, it is able to understand the global distribution of phytoplankton and primary production. However, there are missing data in the ocean color observed from the satellites due to the clouds or weather conditions. In thisstudy, the missing data of the GOCI (Geostationary Ocean Color Imager) chlorophyll-a product wasreconstructed by using DINEOF (Data INterpolation Empirical Orthogonal Functions). DINEOF reconstructs the missing data based on spatio-temporal data, and the accuracy was cross-verified by removing a part of the GOCI chlorophyll-a image and comparing it with the reconstructed image. In the study area, the optimal EOF (Empirical Orthogonal Functions) mode for DINEOF wasin 10-13. The temporal and spatialreconstructed data reflected the increasing chlorophyll-a concentration in the afternoon, and the noise of outliers was filtered. Therefore, it is expected that DINEOF is useful to reconstruct the missing images, also it is considered that it is able to use as basic data for monitoring the ocean environment.

An Evaluation System For Freeway Traffic Data Processing Techniques (고속도로 교통자료 처리기법 통합평가 시스템 개발)

  • Oh, Dong-Wook;Oh, Cheol;NamKoong, Sung;Jeon, Se-Kil
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.7 no.4
    • /
    • pp.13-24
    • /
    • 2008
  • Real-time traffic data are readily obtainable by traffic surveillance systems of intelligent transportation systems (ITS). Such data greatly support further applications in the field of traffic operations, planning, and safety. However, traffic data should be appropriately processed to fully exploit the benefits of data collection capability. Rather than developing individual data processing techniques, which is major concern of existing studies, this study proposes a novel methodology for evaluating data processing techniques in an integrated manner. Also, a tool for implementing the proposed methodology is developed. Users can extract useful and more reliable traffic data based upon their ultimate purpose of data usage by the evaluation tool developed in this study. Actual freeway traffic data are, as an example, fed into the evaluation tool, and results are discussed.

  • PDF

Suggestions on the Improvement of the Hydrological Data Operation II (수문관측자료 운영 개선방안에 대한 연구 II)

  • Kim, Hwi-Rin;Cho, Hyo-Seob
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2007.05a
    • /
    • pp.879-882
    • /
    • 2007
  • 수문관측자료 운영 개선방안에 대한 연구(2006, 김휘린, 조효섭)에서 건설교통부 한강홍수통제소에서 수행하고 있는 수문관측자료를 대상으로 하여 관측, 기록, 전송, 품질관리, DB구축 및 정보화의 5단계로 임의 분류하고 각 단계별 현황을 파악하여 문제점을 검토하고 개선방안을 제안하였다. 이는 적극적으로 수용되어 수문관측자료 운영에 대해 개선을 시도하였으며 본 논문에서는 이를 간략히 소개하고자한다. 첫째, 관측소 점검 대장이 조사과와 전기통신과로 이분화되어 있고 각 관측소별로 점검대장이 비치되어 있으나 이를 수기로 작성하고 있으며 자료의 업데이트 및 과거점검대장의 DB 구축과 비전산화를 지적한 바 있다. 이에 '물관련시스템 DB연계 사업'을 통해 수기로 작성된 관측시설 점검대장을 전부 DB로 구축하였고 이를 총괄하여 관리 및 점검사항을 업데이트할 수 있는 관측시설 점검대장 관리 및 입력 프로그램을 구축하였고 현재 한강홍수통제소에서 시험 운영 중에 있다. 향후 보완이 끝난 후에는 낙동강, 금강, 영산강 홍수통제소에도 확대 설치 및 운영을 실시할 예정이다. 둘째, 수문자료의 품질관리에 있어서 전산시스템에 의한 완전 자동화는 실현하기가 어려울 뿐 아니라 바람직하지 못한 결과를 가져올 수 있으므로 담당자의 수동 검토 및 처리과정은 필수적이라고 논한 바 있다. 그 후 수문자료품질관리T/F팀(조사과, 전기통신과, 하천정보센터)이 구성되었고, 홍수기 오 결측자료 발생 확인, 긴급대응 촉구, 이상치 발생원인 추정 및 대책 마련 등 고품질 수문자료를 생성하기 위해 노력하였다. T/F팀 활동사항은 타홍수통제소에서도 벤치마킹이 되고 있다. 보다 정확한 댐운영자료 공유방안을 위한 관련기관과의 협력회의가 개최되어 품질관리된 자료의 정보 공유시기, 공유방식, 자료형태 등이 결정되었다. 이는 유관기관간 품질관리된 댐운영자료의 효율적인 정보 공유체계 확립을 위한 체계를 마련하는 계기라고 사료된다. 또한, 유량측정사업 결과를 익년에 반영하는 기존 방식을 개선하기 위해 유량자료관리및분석시스템(프론티어사업에서 수행, 한국건설기술연구원 개발)을 통제소내에 설치 운영을 추진하고 있다.

  • PDF

Flood Frequency Analysis at Indogyo Station in Han River Basins (한강 인도교지점에서의 홍수빈도해석에 대한 고찰)

  • Lee, Young Seok;Kim, Kyung Duk;Heo, Jun-Haeng
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2004.05b
    • /
    • pp.1098-1102
    • /
    • 2004
  • 본 연구에서는 수도권을 포함하는 한강하류부에서 가장 중요한 측수지점중 하나인 인도교지점의 연 최대 홍수량 자료에 내해서 빈도해석을 시행하였다. 자료를 3개의 자료(자료 I : $1918\~1940$, 자료 II: $1952\~2002$, 자료 III: 결측치를 제외한 $1918\~2002$)로 구분하였으며, 수문자료에 일반적으로 많이 사용하는 13가지 확률 분포형을 적용하여 매개변수를 추정한 뒤 적합성여부를 판정하였으며, 적합도 검정방법 및 도시적인 방법을 통하여 적정 확률분포형을 선정하였고, 채택된 분포형(gamma-3, GEV, Gumbel, Weibull-2)에 내하여 확률홍수량을 산정하였다. 또한, 위치도시공식(plotting position formula)과 역사적 홍수정보(historic information)를 이용한 빈도해석 결과와도 비교${\cdot}$분석하였다. 그 결과 확률분포형 가운데에는 GEV와 Gumbel 분포형이 인도교지점의 홍수빈도해석에 적합한 것으로 판단된다.

  • PDF

Estimation of Missing Records in Daily Climate Data over the Korean Peninsula (한반도의 과거 기후 데이터 구축을 위한 누락된 기록 추정)

  • Noh, Gyu-Ho;Ahn, Kuk-Hyun
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.135-135
    • /
    • 2020
  • 우리나라의 기후 자료는 일반적으로 기상청에서 발표하는 종관기상관측(ASOS)과 방재기상관측(AWS), 그리고 북한이 세계기상기구(WMO, World Meteorogical Organization)의 기상통신망(GTS)을 통해 보낸 북한기상관측(NKO)을 사용 할 수 있다. 그러나 이 중 40년 이상의 완전한 관측 자료를 얻을 수 있는 건 ASOS가 유일하지만 공간적인 표현에 한계를 갖고 있다. AWS는 관측소가 많다는 장점이 있지만 관측 기간이 길지 않고 이용 가능한 기간에도 관측이 연속적이지 못한 경우가 많다. NKO는 비록 27개의 관측소가 있지만 많은 데이터가 누락되어 일별 기후자료의 사용에 한계를 갖고 있다. 이러한 미관측 기간이나 관측 자료의 누락은 연속적인 시계열 자료분석을 기반으로 하는 수자원 모델링에 있어서 문제를 야기한다. 본 연구는 1973년부터 2019년까지 47년의 신뢰도 높은 한반도 일일 기후 자료를 구축하기 위해 다양한 방법론을 비교하였다. 추정에 사용한 방법은 총 7개로 EM algorithm for probabilistic principal components (PPCA-EM), Inverse distance weight method (IDWM), Nearest neighbor method (NNM), Multivariate normal copulas (Copula), Elastic net model (Elastic), Ordinary kriging (OK), Regularized principal components with EM algorithm (RPCA-EM)를 살펴보았다. 다양한 형태의 결측치를 가정하여 그 결과값을 비교하였고 이는 Root mean squared error(RMSE), Kling-Gupta efficiency(KGE), Nash-Sutcliffe efficiency(NSE)를 통해 평가하였다. 최종 선택된 방법론을 통하여 한반도 전역을 그리드 기반의 강수 및 최저온도/최고온도의 일별자료로 생성하였다.

  • PDF

Missing values imputation for time course gene expression data using the pattern consistency index adaptive nearest neighbors (시간경로 유전자 발현자료에서 패턴일치지수와 적응 최근접 이웃을 활용한 결측값 대치법)

  • Shin, Heyseo;Kim, Dongjae
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.3
    • /
    • pp.269-280
    • /
    • 2020
  • Time course gene expression data is a large amount of data observed over time in microarray experiments. This data can also simultaneously identify the level of gene expression. However, the experiment process is complex, resulting in frequent missing values due to various causes. In this paper, we propose a pattern consistency index adaptive nearest neighbors as a method of missing value imputation. This method combines the adaptive nearest neighbors (ANN) method that reflects local characteristics and the pattern consistency index that considers consistent degree for gene expression between observations over time points. We conducted a Monte Carlo simulation study to evaluate the usefulness of proposed the pattern consistency index adaptive nearest neighbors (PANN) method for two yeast time course data.

A joint modeling of longitudinal zero-inflated count data and time to event data (경시적 영과잉 가산자료와 생존자료의 결합모형)

  • Kim, Donguk;Chun, Jihun
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1459-1473
    • /
    • 2016
  • Both longitudinal data and survival data are collected simultaneously in longitudinal data which are observed throughout the passage of time. In this case, the effect of the independent variable becomes biased (provided that sole use of longitudinal data analysis does not consider the relation between both data used) if the missing that occurred in the longitudinal data is non-ignorable because it is caused by a correlation with the survival data. A joint model of longitudinal data and survival data was studied as a solution for such problem in order to obtain an unbiased result by considering the survival model for the cause of missing. In this paper, a joint model of the longitudinal zero-inflated count data and survival data is studied by replacing the longitudinal part with zero-inflated count data. A hurdle model and proportional hazards model were used for each longitudinal zero inflated count data and survival data; in addition, both sub-models were linked based on the assumption that the random effect of sub-models follow the multivariate normal distribution. We used the EM algorithm for the maximum likelihood estimator of parameters and estimated standard errors of parameters were calculated using the profile likelihood method. In simulation, we observed a better performance of the joint model in bias and coverage probability compared to the separate model.

An Estimation of Link Travel Time by Using BMS Data (BMS 데이터를 활용한 링크단위 여행시간 산출방안에 관한 연구)

  • Jeon, Ok-Hee;Ahn, Gye-Hyeong;Hyun, Cheol-Seung;Hong, Kyung-Sik;Kim, Hyun-Ju;Lee, Choul-Ki
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.13 no.3
    • /
    • pp.78-88
    • /
    • 2014
  • Now, UTIS collects and provides traffic information by building RSE 1,150(unit) and OBE about 51,000(vehicle). it's inevitable to enlarge traffic information sources which use to improve quality of UTIS traffic information for Stabilizing UTIS's service. but there are missing data sections. And, In this study as a way to overcome these problems, based on BIS(Bus information system) installed and operating in the capital area to develop normal vehicle's link transit time estimation model which is used realtime collecting BMS data, we'll utilize the model to provide missing data section's information. For these problem, we selected partial section of suwon-city, anyang-city followed by drive only way or not and conducted model estimating and verification each of BMS data and UTIS traffic information. Consequently, Case2,4,6,8 presented highly credibility between UTIS communication data and estimated value but In the Case 3,5 we determined to replace communication data of UTIS' missing data section too hard for large error. So we need to apply high credibility model formula adjusting road managing condition and the situation of object section.

Efficient Outlier Detection of the Water Temperature Monitoring Data (수온 관측 자료의 효율적인 이상 자료 탐지)

  • Cho, Hongyeon;Jeong, Shin Taek;Ko, Dong Hui;Son, Kyeong-Pyo
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.26 no.5
    • /
    • pp.285-291
    • /
    • 2014
  • The statistical information of the coastal water temperature monitoring data can be biased because of outliers and missing intervals. Though a number of outlier detection methods have been developed, their applications are very limited to the in-situ monitoring data because of the assumptions of the a prior information of the outliers and no-missing condition, and the excessive computational time for some methods. In this study, the practical robust method is developed that can be efficiently and effectively detect the outliers in case of the big-data. This model is composed of these two parts, one part is the construction part of the approximate components of the monitoring data using the robust smoothing and data re-sampling method, and the other part is the main iterative outlier detection part using the detailed components of the data estimated by the approximate components. This model is tested using the two-years 5-minute interval water temperature data in Lake Saemangeum. It can be estimated that the outlier proportion of the data is about 1.6-3.7%. It shows that most of the outliers in the data are detected and removed with satisfaction by the model. In order to effectively detect and remove the outliers, the outlier detection using the long-span smoothing should be applied earlier than that using the short-span smoothing.