• Title/Summary/Keyword: 회귀 크리깅

Search Result 34, Processing Time 0.021 seconds

Spatial Prediction of Wind Speed Data (풍속 자료의 공간예측)

  • Jeong, Seung-Hwan;Park, Man-Sik;Kim, Kee-Whan
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.2
    • /
    • pp.345-356
    • /
    • 2010
  • In this paper, we introduce the linear regression model taking the parametric spatial association structure into account and employ it to five-year averaged wind speed data measured at 460 meteorological monitoring stations in South Korea. From the prediction map obtained by the model with spatial association parameters, we can see that inland area has smaller wind speed than coastal regions. When comparing the spatial linear regression model with classical one by using one-leave-out cross-validation, the former outperforms the latter in terms of similarity between the observations and the corresponding predictions and coverage rate of 95% prediction intervals.

The Characteristics of Groundwater Quality in the Youngsan and Sumjin River Basins Using Geostatistical Methods (지구통계 기법을 이용한 영산강.섬진강 유역의 지하수 수질특성 연구)

  • 정상용;심병완;김규범;강동환;박희영
    • Journal of the Korean Society of Groundwater Environment
    • /
    • v.7 no.3
    • /
    • pp.125-132
    • /
    • 2000
  • pH, EC and TDS are basic components in the investigation of groundwater quality, and are very important to the preliminary assessment of groundwater quality. These three chemical components investigated at the Youngsan and Sumjin river basins in 1998 suggest that the groundwater quality is generally good in these basins. Linear regression analysis shows that TDS versus EC has an linear correlation, but EC versus pH, and TDS versus pH have nearly no correlation. The relation of TDS and EC is 1.0 mg/1=1.52 $mu\textrm{S}$/cm, and it is the quality of natural water. In geostatistical analysis. three kinds of data are stationary random functions and they have exponential variograms. According to the isopleth maps of the groundwater quality, the groundwater quality of the Youngsan river basin is more contaminated than that of the Sumjin river basin. The isopleth maps of TDS and EC show very similar patterns because of the strong correlation between TDS and EC. The minimum and maximum values of the groundwater quality data are not reflected on the isopleth maps because kriging produces smooth distributions with minimum estimation variances.

  • PDF

Comparative Assessment of Linear Regression and Machine Learning for Analyzing the Spatial Distribution of Ground-level NO2 Concentrations: A Case Study for Seoul, Korea (서울 지역 지상 NO2 농도 공간 분포 분석을 위한 회귀 모델 및 기계학습 기법 비교)

  • Kang, Eunjin;Yoo, Cheolhee;Shin, Yeji;Cho, Dongjin;Im, Jungho
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.6_1
    • /
    • pp.1739-1756
    • /
    • 2021
  • Atmospheric nitrogen dioxide (NO2) is mainly caused by anthropogenic emissions. It contributes to the formation of secondary pollutants and ozone through chemical reactions, and adversely affects human health. Although ground stations to monitor NO2 concentrations in real time are operated in Korea, they have a limitation that it is difficult to analyze the spatial distribution of NO2 concentrations, especially over the areas with no stations. Therefore, this study conducted a comparative experiment of spatial interpolation of NO2 concentrations based on two linear-regression methods(i.e., multi linear regression (MLR), and regression kriging (RK)), and two machine learning approaches (i.e., random forest (RF), and support vector regression (SVR)) for the year of 2020. Four approaches were compared using leave-one-out-cross validation (LOOCV). The daily LOOCV results showed that MLR, RK, and SVR produced the average daily index of agreement (IOA) of 0.57, which was higher than that of RF (0.50). The average daily normalized root mean square error of RK was 0.9483%, which was slightly lower than those of the other models. MLR, RK and SVR showed similar seasonal distribution patterns, and the dynamic range of the resultant NO2 concentrations from these three models was similar while that from RF was relatively small. The multivariate linear regression approaches are expected to be a promising method for spatial interpolation of ground-level NO2 concentrations and other parameters in urban areas.

Estimation of Representative Area-Level Concentrations of Particulate Matter(PM10) in Seoul, Korea (미세먼지(PM10)의 지역적 대푯값 산정 방법에 관한 연구 - 서울특별시를 대상으로)

  • SONG, In-Sang;KIM, Sun-Young
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.19 no.4
    • /
    • pp.118-129
    • /
    • 2016
  • Many epidemiological studies, relying on administrative air pollution monitoring data, have reported the association between particulate matter ($PM_{10}$) air pollution and human health. These monitoring data were collected at a limited number of fixed sites, whereas government-generated health data are aggregated at the area level. To link these two data types for assessing health effects, it is necessary to estimate area-level concentrations of $PM_{10}$. In this study, we estimated district (Gu)-level $PM_{10}$ concentrations using a previously developed pointwise exposure prediction model for $PM_{10}$ and three types of point locations in Seoul, Korea. These points included 16,230 centroids of the largest census output residential areas, 422 community service centers, and 610 centroids on the 1km grid. After creating three types of points, we predicted $PM_{10}$ annual average concentrations at all locations and calculated Gu averages of predicted $PM_{10}$ concentrations as representative Gu-estimates. Then, we compared estimates to each other and to measurements. Prediction-based Gu-level estimates showed higher correlations with measurement-based estimates as prediction locations became more population representative ($R^2=0.06-0.59$). Among the three estimates, grid-based estimates gave lowest correlations compared to the other two(0.35-0.47). This study provides an approach for estimating area-level air pollution concentrations and assesses air pollution health effects using national-scale administrative health data.