Browse > Article
http://dx.doi.org/10.5351/KJAS.2016.29.3.549

A comparison of imputation methods for the consecutive missing temperature data  

Kim, Hee-Kyung (Department of Statistics, Dongguk University)
Kang, In-Kyeong (Department of Statistics, Dongguk University)
Lee, Jae-Won (KMA National Climate Data Center)
Lee, Yung-Seop (Department of Statistics, Dongguk University)
Publication Information
The Korean Journal of Applied Statistics / v.29, no.3, 2016 , pp. 549-557 More about this Journal
Abstract
Consecutive missing values are likely to occur in long climate data due to system error or defective equipment. Furthermore, it is difficult to impute missing values. However, these complicated problems can be overcame by imputing missing values with reference time series. Reference time series must be composed of similar time series to time series that include missing values. We performed a simulation to compare three missing imputation methods (the adjusted normal ratio method, the regression method and the IDW method) to complete the missing values of time series. A comparison of the three missing imputation methods for the daily mean temperatures at 14 climatological stations indicated that the IDW method was better thanx others at south seaside stations. We also found the regression method was better than others at most stations (except south seaside stations).
Keywords
consecutive missing value; missing value imputation; adjusted normal ratio methods; regression method; IDW method;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Azman, M. A., Zakaria, R., and Radi, N. F. A. (2015). Estimation of missing rainfall data in Pahang using modified spatial interpolation weighting methods, The 2nd ISM International Statistical Conferencd 2014 (ISM-II): Empowering the Applications of Statistical and Mathematical Sciences, 1643, 65-72
2 Di Piazza, A., Lo Conti, F., Noto, L. V., Viola, F., and La Loggia, G. (2011). Comparative analysis of different techniques for spatial interpolation of rainfall data to create a serially complete monthly time series of precipitation for Sicily, Italy, International Journal of Applied Earth Observation and Geoinformation, 13, 396-408.   DOI
3 Durre, I., Menne, M. J., Gleason, B. E., Houston, T. G., and Vose, R. S. (2010). Comprehensive automated quality assurance of daily surface observations, National Climatic Data Center, 49, 1615-1633.
4 Jung, S.-Y. (2014). A study of consecutive missing value imputation method using reference series in time series, M.S. Thesis, Department of Statistics, Graduate School of Dongguk University, Seoul, Korea.
5 Lee, Y.-S. (2003). Data Mining Cookbook by Olivia Parr Rud, Kyowoo Publishing Company, Seoul.
6 Legates, D. R., and McCabe Jr., G. J. (1999). Evaluating the use of "goodness-of -fit" measures in hydrologic and hydroclimatic model evaluation, Water Resources Research, 35, 233-241.   DOI
7 Paulhus, J. L. H. and Kohler, M. A. (1952). Interpolation of missing precipitation records, Monthly Weather Review, 80, 129-133.   DOI
8 Teegavarapu, R. S. V. and Chandramouli, V. (2005). Improved weighting methods, deterministic and stochastic data-driven models for estimation of missing precipitation records, Journal of Hydrology, 312, 191-206.   DOI
9 You, J. S., Hubbard, K. G., and Goddard, S. (2008). Comparison of methods for spatially estimating station temperatures in a quality control system, International Journal of Climatology, 28, 777-787.   DOI
10 Young, K. (1992). A three-way model for interpolating for monthly precipitation values, Monthly Weather Review, 120, 2561-2569.   DOI