• Title/Summary/Keyword: RMSE (Root Mean Square Error)

Search Result 648, Processing Time 0.031 seconds

Development of a Grid-based Daily Watershed Runoff Model and the Evaluation of Its Applicability (분포형 유역 일유출 모형의 개발 및 적용성 검토)

  • Hong, Woo-Yong;Park, Geun-Ae;Jeong, In-Kyun;Kim, Seong-Joon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.30 no.5B
    • /
    • pp.459-469
    • /
    • 2010
  • This study is to develop a grid-based daily runoff model considering seasonal vegetation canopy condition. The model simulates the temporal and spatial variation of runoff components (surface, interflow, and baseflow), evapotranspiration (ET) and soil moisture contents of each grid element. The model is composed of three main modules of runoff, ET, and soil moisture. The total runoff was simulated by using soil water storage capacity of the day, and was allocated by introducing recession curves of each runoff component. The ET was calculated by Penman-Monteith method considering MODIS leaf area index (LAI). The daily soil moisture was routed by soil water balance equation. The model was evaluated for 930 $km^2$ Yongdam watershed. The model uses 1 km spatial data on landuse, soil, boundary, MODIS LAI. The daily weather data was built using IDW method (2000-2008). Model calibration was carried out to compare with the observed streamflow at the watershed outlet. The Nash-Sutcliffe model efficiency was 0.78~0.93. The watershed soil moisture was sensitive to precipitation and soil texture, consequently affected the streamflow, and the evapotranspiration responded to landuse type.

Development and application of cellular automata-based urban inundation and water cycle model CAW (셀룰러 오토마타 기반 도시침수 및 물순환 해석 모형 CAW의 개발 및 적용)

  • Lee, Songhee;Choi, Hyeonjin;Woo, Hyuna;Kim, Minyoung;Lee, Eunhyung;Kim, Sanghyun;Noh, Seong Jin
    • Journal of Korea Water Resources Association
    • /
    • v.57 no.3
    • /
    • pp.165-179
    • /
    • 2024
  • It is crucial to have a comprehensive understanding of inundation and water cycle in urban areas for mitigating flood risks and sustainable water resources management. In this study, we developed a Cellular Automata-based integrated Water cycle model (CAW). A comparative analysis with physics-based and conventional cellular automata-based models was performed in an urban watershed in Portland, USA, to evaluate the adequacy of spatiotemporal inundation simulation in the context of a high-resolution setup. A high similarity was found in the maximum inundation maps by CAW and Weighted Cellular Automata 2 Dimension (WCA2D) model presumably due to the same diffuse wave assumption, showing an average Root-Mean-Square-Error (RMSE) value of 1.3 cm and high scores of binary pattern indices (HR 0.91, FAR 0.02, CSI 0.90). Furthermore, through multiple simulation experiments estimating the effects of land cover and soil conditions on inundation and infiltration, as the impermeability rate increased by 41%, the infiltration decreased by 54% (4.16 mm/m2) while the maximum inundation depth increased by 10% (2.19 mm/m2). It was expected that high-resolution integrated inundation and water cycle analysis considering various land cover and soil conditions in urban areas would be feasible using CAW.

Monitoring Ground-level SO2 Concentrations Based on a Stacking Ensemble Approach Using Satellite Data and Numerical Models (위성 자료와 수치모델 자료를 활용한 스태킹 앙상블 기반 SO2 지상농도 추정)

  • Choi, Hyunyoung;Kang, Yoojin;Im, Jungho;Shin, Minso;Park, Seohui;Kim, Sang-Min
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_3
    • /
    • pp.1053-1066
    • /
    • 2020
  • Sulfur dioxide (SO2) is primarily released through industrial, residential, and transportation activities, and creates secondary air pollutants through chemical reactions in the atmosphere. Long-term exposure to SO2 can result in a negative effect on the human body causing respiratory or cardiovascular disease, which makes the effective and continuous monitoring of SO2 crucial. In South Korea, SO2 monitoring at ground stations has been performed, but this does not provide spatially continuous information of SO2 concentrations. Thus, this research estimated spatially continuous ground-level SO2 concentrations at 1 km resolution over South Korea through the synergistic use of satellite data and numerical models. A stacking ensemble approach, fusing multiple machine learning algorithms at two levels (i.e., base and meta), was adopted for ground-level SO2 estimation using data from January 2015 to April 2019. Random forest and extreme gradient boosting were used as based models and multiple linear regression was adopted for the meta-model. The cross-validation results showed that the meta-model produced the improved performance by 25% compared to the base models, resulting in the correlation coefficient of 0.48 and root-mean-square-error of 0.0032 ppm. In addition, the temporal transferability of the approach was evaluated for one-year data which were not used in the model development. The spatial distribution of ground-level SO2 concentrations based on the proposed model agreed with the general seasonality of SO2 and the temporal patterns of emission sources.

The PRISM-based Rainfall Mapping at an Enhanced Grid Cell Resolution in Complex Terrain (복잡지형 고해상도 격자망에서의 PRISM 기반 강수추정법)

  • Chung, U-Ran;Yun, Kyung-Dahm;Cho, Kyung-Sook;Yi, Jae-Hyun;Yun, Jin-I.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.11 no.2
    • /
    • pp.72-78
    • /
    • 2009
  • The demand for rainfall data in gridded digital formats has increased in recent years due to the close linkage between hydrological models and decision support systems using the geographic information system. One of the most widely used tools for digital rainfall mapping is the PRISM (parameter-elevation regressions on independent slopes model) which uses point data (rain gauge stations), a digital elevation model (DEM), and other spatial datasets to generate repeatable estimates of monthly and annual precipitation. In the PRISM, rain gauge stations are assigned with weights that account for other climatically important factors besides elevation, and aspects and the topographic exposure are simulated by dividing the terrain into topographic facets. The size of facet or grid cell resolution is determined by the density of rain gauge stations and a $5{\times}5km$ grid cell is considered as the lowest limit under the situation in Korea. The PRISM algorithms using a 270m DEM for South Korea were implemented in a script language environment (Python) and relevant weights for each 270m grid cell were derived from the monthly data from 432 official rain gauge stations. Weighted monthly precipitation data from at least 5 nearby stations for each grid cell were regressed to the elevation and the selected linear regression equations with the 270m DEM were used to generate a digital precipitation map of South Korea at 270m resolution. Among 1.25 million grid cells, precipitation estimates at 166 cells, where the measurements were made by the Korea Water Corporation rain gauge network, were extracted and the monthly estimation errors were evaluated. An average of 10% reduction in the root mean square error (RMSE) was found for any months with more than 100mm monthly precipitation compared to the RMSE associated with the original 5km PRISM estimates. This modified PRISM may be used for rainfall mapping in rainy season (May to September) at much higher spatial resolution than the original PRISM without losing the data accuracy.

Estimation of TROPOMI-derived Ground-level SO2 Concentrations Using Machine Learning Over East Asia (기계학습을 활용한 동아시아 지역의 TROPOMI 기반 SO2 지상농도 추정)

  • Choi, Hyunyoung;Kang, Yoojin;Im, Jungho
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.2
    • /
    • pp.275-290
    • /
    • 2021
  • Sulfur dioxide (SO2) in the atmosphere is mainly generated from anthropogenic emission sources. It forms ultra-fine particulate matter through chemical reaction and has harmful effect on both the environment and human health. In particular, ground-level SO2 concentrations are closely related to human activities. Satellite observations such as TROPOMI (TROPOspheric Monitoring Instrument)-derived column density data can provide spatially continuous monitoring of ground-level SO2 concentrations. This study aims to propose a 2-step residual corrected model to estimate ground-level SO2 concentrations through the synergistic use of satellite data and numerical model output. Random forest machine learning was adopted in the 2-step residual corrected model. The proposed model was evaluated through three cross-validations (i.e., random, spatial and temporal). The results showed that the model produced slopes of 1.14-1.25, R values of 0.55-0.65, and relative root-mean-square-error of 58-63%, which were improved by 10% for slopes and 3% for R and rRMSE when compared to the model without residual correction. The model performance by country was slightly reduced in Japan, often resulting in overestimation, where the sample size was small, and the concentration level was relatively low. The spatial and temporal distributions of SO2 produced by the model agreed with those of the in-situ measurements, especially over Yangtze River Delta in China and Seoul Metropolitan Area in South Korea, which are highly dependent on the characteristics of anthropogenic emission sources. The model proposed in this study can be used for long-term monitoring of ground-level SO2 concentrations on both the spatial and temporal domains.

Characteristics of the Differences between Significant Wave Height at Ieodo Ocean Research Station and Satellite Altimeter-measured Data over a Decade (2004~2016) (이어도 해양과학기지 관측 파고와 인공위성 관측 유의파고 차이의 특성 연구 (2004~2016))

  • WOO, HYE-JIN;PARK, KYUNG-AE;BYUN, DO-SEONG;LEE, JOOYOUNG;LEE, EUNIL
    • The Sea:JOURNAL OF THE KOREAN SOCIETY OF OCEANOGRAPHY
    • /
    • v.23 no.1
    • /
    • pp.1-19
    • /
    • 2018
  • In order to compare significant wave height (SWH) data from multi-satellites (GFO, Jason-1, Envisat, Jason-2, Cryosat-2, SARAL) and SWH measurements from Ieodo Ocean Research Station (IORS), we constructed a 12 year matchup database between satellite and IORS measurements from December 2004 to May 2016. The satellite SWH showed a root mean square error (RMSE) of about 0.34 m and a positive bias of 0.17 m with respect to the IORS wave height. The satellite data and IORS wave height data did not show any specific seasonal variations or interannual variability, which confirmed the consistency of satellite data. The effect of the wind field on the difference of the SWH data between satellite and IORS was investigated. As a result, a similar result was observed in which a positive biases of about 0.17 m occurred on all satellites. In order to understand the effects of topography and the influence of the construction structures of IORS on the SWH differences, we investigated the directional dependency of differences of wave height, however, no statistically significant characteristics of the differences were revealed. As a result of analyzing the characteristics of the error as a function of the distance between the satellite and the IORS, the biases are almost constant about 0.14 m regardless of the distance. By contrast, the amplitude of the SWH differences, the maximum value minus the minimum value at a given distance range, was found to increase linearly as the distance was increased. On the other hand, as a result of the accuracy evaluation of the satellite SWH from the Donghae marine meteorological buoy of Korea Meteorological Administration, the satellite SWH presented a relatively small RMSE of about 0.27 m and no specific characteristics of bias such as the validation results at IORS. In this paper, we propose a conversion formula to correct the significant wave data of IORS with the satellite SWH data. In addition, this study emphasizes that the reliability of data should be prioritized to be extensively utilized and presents specific methods and strategies in order to upgrade the IORS as an international world-wide marine observation site.

Growth and Predictive Model of Wild-type Salmonella spp. on Temperature and Time during Cut and Package Processing in Cold Pork Meats (냉장돈육 가공공정 온도와 시간에서의 Wild-type Salmonella spp.의 성장특성 및 예측모델)

  • Song, Ju Yeon;Kim, Yong Soo;Hong, Chong Hae;Bahk, Gyung Jin
    • Journal of Food Hygiene and Safety
    • /
    • v.28 no.1
    • /
    • pp.7-12
    • /
    • 2013
  • This study presents the influence on growth properties determined using a novel predictive growth model of wild-type Salmonella spp. KSC 101 by variations in the temperature and time during cut packaging in cold, uncooked pork meat. The experiment performed for model development included an arrangement of different temperatures ($0^{\circ}C$, $5^{\circ}C$, $10^{\circ}C$, $15^{\circ}C$, and $20^{\circ}C$) and time durations (0, 1, 2, and 3 hours) that reflect actual pork-cut and packaging processes. No growth was observed at $0^{\circ}C$ and $5^{\circ}C$, whereas some growth was observed at $10^{\circ}C$, $15^{\circ}C$, and $20^{\circ}C$, with a mean increase of only 0.34 log CFU/g. The growth observed at $20^{\circ}C$ was more robust than that observed at $15^{\circ}C$, but the difference was not statistically significant (p > 0.05). However, compared with PMP (Pathogen Modeling Program), the wild-type Salmonella spp. KSC 101 showed a more rapid growth. We used the Gompertz 4 parameter equation as the primary model, and the exponential decay formula as the secondary model. The estimated $R^2$ values were 0.99 or higher. The developed model was evaluated by comparison of the experimental and predictive values, and the values were in agreement with the ${\pm}0.5$ log CFU/g, although the RMSE (Root mean square error) value was 0.103, which indicates a slight overestimation. Therefore, we suggest that the developed predictive growth model would be useful as a tool for evaluating sanitation criteria in pork cut-packaging processes.

A Study on Estimating Rice Yield in DPRK Using MODIS NDVI and Rainfall Data (MODIS NDVI와 강수량 자료를 이용한 북한의 벼 수량 추정 연구)

  • Hong, Suk Young;Na, Sang-Il;Lee, Kyung-Do;Kim, Yong-Seok;Baek, Shin-Chul
    • Korean Journal of Remote Sensing
    • /
    • v.31 no.5
    • /
    • pp.441-448
    • /
    • 2015
  • Lack of agricultural information for food supply and demand in Democratic People's republic Korea(DPRK) make people sometimes confused for right and timely decision for policy support. We carried out a study to estimate paddy rice yield in DPRK using MODIS NDVI reflecting rice growth and climate data. Mean of MODIS $NDVI_{max}$ in paddy rice over the country acquired and processed from 2002 to 2014 and accumulated rainfall collected from 27 weather stations in September from 2002 to 2014 were used to estimated paddy rice yield in DPRK. Coefficient of determination of the multiple regression model was 0.44 and Root Mean Square Error(RMSE) was 0.27 ton/ha. Two-way analysis of variance resulted in 3.0983 of F ratio and 0.1008 of p value. Estimated milled rice yield showed the lowest value as 2.71 ton/ha in 2007, which was consistent with RDA rice yield statistics and the highest value as 3.54 ton/ha in 2006, which was not consistent with the statistics. Scatter plot of estimated rice yield and the rice yield statistics implied that estimated rice yield was higher when the rice yield statistics was less than 3.3 ton/ha and lower when the rice yield statistics was greater than 3.3 ton/ha. Limitation of rice yield model was due to lower quality of climate and statistics data, possible cloud contamination of time-series NDVI data, and crop mask for rice paddy, and coarse spatial resolution of MODIS satellite data. Selection of representative areas for paddy rice consisting of homogeneous pixels and utilization of satellite-based weather information can improve the input parameters for rice yield model in DPRK in the future.

Validation of Satellite SMAP Sea Surface Salinity using Ieodo Ocean Research Station Data (이어도 해양과학기지 자료를 활용한 SMAP 인공위성 염분 검증)

  • Park, Jae-Jin;Park, Kyung-Ae;Kim, Hee-Young;Lee, Eunil;Byun, Do-Seong;Jeong, Kwang-Yeong
    • Journal of the Korean earth science society
    • /
    • v.41 no.5
    • /
    • pp.469-477
    • /
    • 2020
  • Salinity is not only an important variable that determines the density of the ocean but also one of the main parameters representing the global water cycle. Ocean salinity observations have been mainly conducted using ships, Argo floats, and buoys. Since the first satellite salinity was launched in 2009, it is also possible to observe sea surface salinity in the global ocean using satellite salinity data. However, the satellite salinity data contain various errors, it is necessary to validate its accuracy before applying it as research data. In this study, the salinity accuracy between the Soil Moisture Active Passive (SMAP) satellite salinity data and the in-situ salinity data provided by the Ieodo ocean research station was evaluated, and the error characteristics were analyzed from April 2015 to August 2020. As a result, a total of 314 match-up points were produced, and the root mean square error (RMSE) and mean bias of salinity were 1.79 and 0.91 psu, respectively. Overall, the satellite salinity was overestimated compare to the in-situ salinity. Satellite salinity is dependent on various marine environmental factors such as season, sea surface temperature (SST), and wind speed. In summer, the difference between the satellite salinity and the in-situ salinity was less than 0.18 psu. This means that the accuracy of satellite salinity increases at high SST rather than at low SST. This accuracy was affected by the sensitivity of the sensor. Likewise, the error was reduced at wind speeds greater than 5 m s-1. This study suggests that satellite-derived salinity data should be used in coastal areas for limited use by checking if they are suitable for specific research purposes.

Application of Machine Learning Algorithm and Remote-sensed Data to Estimate Forest Gross Primary Production at Multi-sites Level (산림 총일차생산량 예측의 공간적 확장을 위한 인공위성 자료와 기계학습 알고리즘의 활용)

  • Lee, Bora;Kim, Eunsook;Lim, Jong-Hwan;Kang, Minseok;Kim, Joon
    • Korean Journal of Remote Sensing
    • /
    • v.35 no.6_2
    • /
    • pp.1117-1132
    • /
    • 2019
  • Forest covers 30% of the Earth's land area and plays an important role in global carbon flux through its ability to store much greater amounts of carbon than other terrestrial ecosystems. The Gross Primary Production (GPP) represents the productivity of forest ecosystems according to climate change and its effect on the phenology, health, and carbon cycle. In this study, we estimated the daily GPP for a forest ecosystem using remote-sensed data from Moderate Resolution Imaging Spectroradiometer (MODIS) and machine learning algorithms Support Vector Machine (SVM). MODIS products were employed to train the SVM model from 75% to 80% data of the total study period and validated using eddy covariance measurement (EC) data at the six flux tower sites. We also compare the GPP derived from EC and MODIS (MYD17). The MODIS products made use of two data sets: one for Processed MODIS that included calculated by combined products (e.g., Vapor Pressure Deficit), another one for Unprocessed MODIS that used MODIS products without any combined calculation. Statistical analyses, including Pearson correlation coefficient (R), mean squared error (MSE), and root mean square error (RMSE) were used to evaluate the outcomes of the model. In general, the SVM model trained by the Unprocessed MODIS (R = 0.77 - 0.94, p < 0.001) derived from the multi-sites outperformed those trained at a single-site (R = 0.75 - 0.95, p < 0.001). These results show better performance trained by the data including various events and suggest the possibility of using remote-sensed data without complex processes to estimate GPP such as non-stationary ecological processes.