• Title/Summary/Keyword: RMSE (Root Mean Square Error)

Search Result 648, Processing Time 0.032 seconds

Predicting Forest Gross Primary Production Using Machine Learning Algorithms (머신러닝 기법의 산림 총일차생산성 예측 모델 비교)

  • Lee, Bora;Jang, Keunchang;Kim, Eunsook;Kang, Minseok;Chun, Jung-Hwa;Lim, Jong-Hwan
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.21 no.1
    • /
    • pp.29-41
    • /
    • 2019
  • Terrestrial Gross Primary Production (GPP) is the largest global carbon flux, and forest ecosystems are important because of the ability to store much more significant amounts of carbon than other terrestrial ecosystems. There have been several attempts to estimate GPP using mechanism-based models. However, mechanism-based models including biological, chemical, and physical processes are limited due to a lack of flexibility in predicting non-stationary ecological processes, which are caused by a local and global change. Instead mechanism-free methods are strongly recommended to estimate nonlinear dynamics that occur in nature like GPP. Therefore, we used the mechanism-free machine learning techniques to estimate the daily GPP. In this study, support vector machine (SVM), random forest (RF) and artificial neural network (ANN) were used and compared with the traditional multiple linear regression model (LM). MODIS products and meteorological parameters from eddy covariance data were employed to train the machine learning and LM models from 2006 to 2013. GPP prediction models were compared with daily GPP from eddy covariance measurement in a deciduous forest in South Korea in 2014 and 2015. Statistical analysis including correlation coefficient (R), root mean square error (RMSE) and mean squared error (MSE) were used to evaluate the performance of models. In general, the models from machine-learning algorithms (R = 0.85 - 0.93, MSE = 1.00 - 2.05, p < 0.001) showed better performance than linear regression model (R = 0.82 - 0.92, MSE = 1.24 - 2.45, p < 0.001). These results provide insight into high predictability and the possibility of expansion through the use of the mechanism-free machine-learning models and remote sensing for predicting non-stationary ecological processes such as seasonal GPP.

Analysis of Empirical Multiple Linear Regression Models for the Production of PM2.5 Concentrations (PM2.5농도 산출을 위한 경험적 다중선형 모델 분석)

  • Choo, Gyo-Hwang;Lee, Kyu-Tae;Jeong, Myeong-Jae
    • Journal of the Korean earth science society
    • /
    • v.38 no.4
    • /
    • pp.283-292
    • /
    • 2017
  • In this study, the empirical models were established to estimate the concentrations of surface-level $PM_{2.5}$ over Seoul, Korea from 1 January 2012 to 31 December 2013. We used six different multiple linear regression models with aerosol optical thickness (AOT), ${\AA}ngstr{\ddot{o}}m$ exponents (AE) data from Moderate Resolution Imaging Spectroradiometer (MODIS) aboard Terra and Aqua satellites, meteorological data, and planetary boundary layer depth (PBLD) data. The results showed that $M_6$ was the best empirical model and AOT, AE, relative humidity (RH), wind speed, wind direction, PBLD, and air temperature data were used as input data. Statistical analysis showed that the result between the observed $PM_{2.5}$ and the estimated $PM_{2.5}$ concentrations using $M_6$ model were correlations (R=0.62) and root square mean error ($RMSE=10.70{\mu}gm^{-3}$). In addition, our study show that the relation strongly depends on the seasons due to seasonal observation characteristics of AOT, with a relatively better correlation in spring (R=0.66) and autumntime (R=0.75) than summer and wintertime (R was about 0.38 and 0.56). These results were due to cloud contamination of summertime and the influence of snow/ice surface of wintertime, compared with those of other seasons. Therefore, the empirical multiple linear regression model used in this study showed that the AOT data retrieved from the satellite was important a dominant variable and we will need to use additional weather variables to improve the results of $PM_{2.5}$. Also, the result calculated for $PM_{2.5}$ using empirical multi linear regression model will be useful as a method to enable monitoring of atmospheric environment from satellite and ground meteorological data.

Estimation of Chlorophyll-a Concentrations in the Nakdong River Using High-Resolution Satellite Image (고해상도 위성영상을 이용한 낙동강 유역의 클로로필-a 농도 추정)

  • Choe, Eun-Young;Lee, Jae-Woon;Lee, Jae-Kwan
    • Korean Journal of Remote Sensing
    • /
    • v.27 no.5
    • /
    • pp.613-623
    • /
    • 2011
  • This study assessed the feasibility to apply Two-band and Three-band reflectance models for chlorophyll-a estimation in turbid productive waters whose scale is smaller and narrower than ocean using a high spatial resolution image. Those band ratio models were successfully applied to analyzing chlorophyll-a concentrations of ocean or coastal water using Moderate Imaging Spectroradiometer(MODIS), Sea-viewing Wide Field-fo-view Sensor(SeaWiFS), Medium Resolution Imaging Spectrometer(MERIS), etc. Two-band and Three-band models based on band ratio such as Red and NIR band were generally used for the Chl-a in turbid waters. Two-band modes using Red and NIR bands of RapidEye image showed no significant results with $R^2$ 0.38. To enhance a band ratio between absorption and reflection peak, We used red-edge band(710 nm) of RapidEye image for Twoband and Three-band models. Red-RE Two-band and Red-RE-NIR Three-band reflectance model (with cubic equation) for the RapidEye image provided significance performances with $R^2$ 0.66 and 0.73, respectively. Their performance showed the 'Approximate Prediction' with RPD, 1.39 and 1.29 and RMSE, 24.8, 22.4, respectively. Another three-band model with quadratic equation showed similar performances to Red-RE two-band model. The findings in this study demonstrated that Two-band and Three-band reflectance models using a red-edge band can approximately estimate chlorophyll-a concentrations in a turbid river water using high-resolution satellite image. In the distribution map of estimated Chl-a concentrations, three-band model with cubic equation showed lower values than twoband model. In the further works, quantification and correction of spectral interferences caused by suspended sediments and colored dissolved organic matters will improve the accuracy of chlorophyll-a estimation in turbid waters.

Validation of Satellite Scatterometer Sea-Surface Wind Vectors (MetOp-A/B ASCAT) in the Korean Coastal Region (한반도 연안해역에서 인공위성 산란계(MetOp-A/B ASCAT) 해상풍 검증)

  • Kwak, Byeong-Dae;Park, Kyung-Ae;Woo, Hye-Jin;Kim, Hee-Young;Hong, Sung-Eun;Sohn, Eun-Ha
    • Journal of the Korean earth science society
    • /
    • v.42 no.5
    • /
    • pp.536-555
    • /
    • 2021
  • Sea-surface wind is an important variable in ocean-atmosphere interactions, leading to the changes in ocean surface currents and circulation, mixed layers, and heat flux. With the development of satellite technology, sea-surface winds data retrieved from scatterometer observation data have been used for various purposes. In a complex marine environment such as the Korean Peninsula coast, scatterometer-observed sea-surface wind is an important factor for analyzing ocean and atmospheric phenomena. Therefore, the validation results of wind accuracy can be used for diverse applications. In this study, the sea-surface winds derived from ASCAT (Advanced SCATterometer) mounted on MetOp-A/B (METeorological Operational Satellite-A/B) were validated compared to in-situ wind measurements at 16 marine buoy stations around the Korean Peninsula from January to December 2020. The buoy winds measured at a height of 4-5 m from the sea surface were converted to 10-m neutral winds using the LKB (Liu-Katsaros-Businger) model. The matchup procedure produced 5,544 and 10,051 collocation points for MetOp-A and MetOp-B, respectively. The root mean square errors (RMSE) were 1.36 and 1.28 m s-1, and bias errors amounted to 0.44 and 0.65 m s-1 for MetOp-A and MetOp-B, respectively. The wind directions of both scatterometers exhibited negative biases of -8.03° and -6.97° and RMSE values of 32.46° and 36.06° for MetOp-A and MetOp-B, respectively. These errors were likely associated with the stratification and dynamics of the marine-atmospheric boundary layer. In the seas around the Korean Peninsula, the sea-surface winds of the ASCAT tended to be more overestimated than the in-situ wind speeds, particularly at weak wind speeds. In addition, the closer the distance from the coast, the more the amplification of error. The present results could contribute to the development of a prediction model as improved input data and the understanding of air-sea interaction and impact of typhoons in the coastal regions around the Korean Peninsula.

Study on the Concentration Estimation Equation of Nitrogen Dioxide using Hyperspectral Sensor (초분광센서를 활용한 이산화질소 농도 추정식에 관한 연구)

  • Jeon, Eui-Ik;Park, Jin-Woo;Lim, Seong-Ha;Kim, Dong-Woo;Yu, Jae-Jin;Son, Seung-Woo;Jeon, Hyung-Jin;Yoon, Jeong-Ho
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.6
    • /
    • pp.19-25
    • /
    • 2019
  • The CleanSYS(Clean SYStem) is operated to monitor air pollutants emitted from specific industrial complexes in Korea. So the industrial complexes without the system are directly monitored by the control officers. For efficient monitoring, studies using various sensors have been conducted to monitor air pollutants emitted from industrial complex. In this study, hyperspectral sensors were used to model and verify the equations for estimating the concentration of $NO_2$(nitrogen dioxide) in air pollutants emitted. For development of the equations, spectral radiance were observed for $NO_2$ at various concentrations with different SZA(Solar Zenith Angle), VZA(Viewing Zenith Angle), and RAA(Relative Azimuth Angle). From the observed spectral radiance, the calculated value of the difference between the values of the specific wavelengths was taken as an absorption depth, and the equations were developed using the relationship between the depth and the $NO_2$ concentration. The spectral radiance mixed gas of $NO_2$ and $SO_2$(sulfur dioxide) was used to verify the equations. As a result, the $R^2$(coefficient of determination) and RMSE(Root Mean Square Error) were different from 0.71~0.88 and 72~23 ppm according to the form of the equation, and $R^2$ of the exponential form was the highest among the equations. Depending on the type of the equations, the accuracy of the estimated concentration with varying concentrations is not constant. However, if the equations are advanced in the future, hyperspectral sensors can be used to monitor the $NO_2$ emitted from the industrial complex.

Predicting Crime Risky Area Using Machine Learning (머신러닝기반 범죄발생 위험지역 예측)

  • HEO, Sun-Young;KIM, Ju-Young;MOON, Tae-Heon
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.21 no.4
    • /
    • pp.64-80
    • /
    • 2018
  • In Korea, citizens can only know general information about crime. Thus it is difficult to know how much they are exposed to crime. If the police can predict the crime risky area, it will be possible to cope with the crime efficiently even though insufficient police and enforcement resources. However, there is no prediction system in Korea and the related researches are very much poor. From these backgrounds, the final goal of this study is to develop an automated crime prediction system. However, for the first step, we build a big data set which consists of local real crime information and urban physical or non-physical data. Then, we developed a crime prediction model through machine learning method. Finally, we assumed several possible scenarios and calculated the probability of crime and visualized the results in a map so as to increase the people's understanding. Among the factors affecting the crime occurrence revealed in previous and case studies, data was processed in the form of a big data for machine learning: real crime information, weather information (temperature, rainfall, wind speed, humidity, sunshine, insolation, snowfall, cloud cover) and local information (average building coverage, average floor area ratio, average building height, number of buildings, average appraised land value, average area of residential building, average number of ground floor). Among the supervised machine learning algorithms, the decision tree model, the random forest model, and the SVM model, which are known to be powerful and accurate in various fields were utilized to construct crime prevention model. As a result, decision tree model with the lowest RMSE was selected as an optimal prediction model. Based on this model, several scenarios were set for theft and violence cases which are the most frequent in the case city J, and the probability of crime was estimated by $250{\times}250m$ grid. As a result, we could find that the high crime risky area is occurring in three patterns in case city J. The probability of crime was divided into three classes and visualized in map by $250{\times}250m$ grid. Finally, we could develop a crime prediction model using machine learning algorithm and visualized the crime risky areas in a map which can recalculate the model and visualize the result simultaneously as time and urban conditions change.

Evaluation of stream flow and water quality changes of Yeongsan river basin by inter-basin water transfer using SWAT (SWAT을 이용한 유역간 물이동량에 따른 영산강유역의 하천 유량 및 수질 변동 분석)

  • Kim, Yong Won;Lee, Ji Wan;Woo, So Young;Kim, Seong Joon
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.12
    • /
    • pp.1081-1095
    • /
    • 2020
  • This study is to evaluate stream flow and water quality changes of Yeongsan river basin (3,371.4 km2) by inter-basin water transfer (IBWT) from Juam dam of Seomjin river basin using SWAT (Soil and Water Assessment Tool). The SWAT was established using inlet function for IBWT between donor and receiving basins. The SWAT was calibrated and validated with 14 years (2005 ~ 2018) data of 1 stream (MR) and 2 multi-functional weir (SCW, JSW) water level gauging stations, and 3 water quality stations (GJ2, NJ, and HP) including data of IBWT and effluent from wastewater treatment plants of Yeongsan river basin. For streamflow and weir inflows (MR, SCW, and JSW), the coefficient of determination (R2), Nash-Sutcliffe efficiency (NSE), root mean square error (RMSE), and percent bias (PBIAS) were 0.69 ~ 0.81, 0.61 ~ 0.70, 1.34 ~ 2.60 mm/day, and -8.3% ~ +7.6% respectively. In case of water quality, the R2 of SS, T-N, and T-P were 0.69 ~ 0.81, 0.61 ~ 0.70, and 0.54 ~ 0.63 respectively. The Yeongsan river basin average streamflow was 12.0 m3/sec and the average SS, T-N, and T-P were 110.5 mg/L, 4.4 mg/L, 0.18 mg/L respectively. Under the 130% scenario of IBWT amount, the streamflow, SS increased to 12.94 m3/sec (+7.8%), 111.26 mg/L (+0.7%) and the T-N, T-P decreased to 4.17 mg/L (-5.2%), 0.165 mg/L (-8.3%) respectively. Under the 70% scenario of IBWT amount, the streamflow, SS decreased to 11.07 m3/sec (-7.8%), 109.74 mg/L (-0.7%) and the T-N, T-P increased to 4.68 mg/L (+6.4%), 0.199 mg/L (+10.6%) respectively.

Physical Offset of UAVs Calibration Method for Multi-sensor Fusion (다중 센서 융합을 위한 무인항공기 물리 오프셋 검보정 방법)

  • Kim, Cheolwook;Lim, Pyeong-chae;Chi, Junhwa;Kim, Taejung;Rhee, Sooahm
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1125-1139
    • /
    • 2022
  • In an unmanned aerial vehicles (UAVs) system, a physical offset can be existed between the global positioning system/inertial measurement unit (GPS/IMU) sensor and the observation sensor such as a hyperspectral sensor, and a lidar sensor. As a result of the physical offset, a misalignment between each image can be occurred along with a flight direction. In particular, in a case of multi-sensor system, an observation sensor has to be replaced regularly to equip another observation sensor, and then, a high cost should be paid to acquire a calibration parameter. In this study, we establish a precise sensor model equation to apply for a multiple sensor in common and propose an independent physical offset estimation method. The proposed method consists of 3 steps. Firstly, we define an appropriate rotation matrix for our system, and an initial sensor model equation for direct-georeferencing. Next, an observation equation for the physical offset estimation is established by extracting a corresponding point between a ground control point and the observed data from a sensor. Finally, the physical offset is estimated based on the observed data, and the precise sensor model equation is established by applying the estimated parameters to the initial sensor model equation. 4 region's datasets(Jeon-ju, Incheon, Alaska, Norway) with a different latitude, longitude were compared to analyze the effects of the calibration parameter. We confirmed that a misalignment between images were adjusted after applying for the physical offset in the sensor model equation. An absolute position accuracy was analyzed in the Incheon dataset, compared to a ground control point. For the hyperspectral image, root mean square error (RMSE) for X, Y direction was calculated for 0.12 m, and for the point cloud, RMSE was calculated for 0.03 m. Furthermore, a relative position accuracy for a specific point between the adjusted point cloud and the hyperspectral images were also analyzed for 0.07 m, so we confirmed that a precise data mapping is available for an observation without a ground control point through the proposed estimation method, and we also confirmed a possibility of multi-sensor fusion. From this study, we expect that a flexible multi-sensor platform system can be operated through the independent parameter estimation method with an economic cost saving.

Performance evaluation of hyperspectral bathymetry method for morphological mapping in a large river confluence (초분광수심법 기반 대하천 합류부 하상측정 성능 평가)

  • Kim, Dongsu;Seo, Youngcheol;You, Hojun;Gwon, Yeonghwa
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.3
    • /
    • pp.195-210
    • /
    • 2023
  • Additional deposition and erosion in large rivers in South Korea have continued to occur toward morphological stabilization after massive dredging through the four major river restoration project, subsequently requiring precise bathymetry monitoring. Hyperspectral bathymetry method has increasingly been highlighted as an alternative way to estimate bathymetry with high spatial resolution in shallow depth for replacing classical intrusive direct measurement techniques. This study introduced the conventional Optimal Band Ratio Analysis (OBRA) of hyperspectral bathymetry method, and evaluated the performance in a domestic large river in normal turbid and flow condition. Maximum measurable depth was estimated by applying correlation coefficient and root mean square error (RMSE) produced during OBRA with cascadedly applying cut-off depth, where the consequent hyperspectral bathymetry map excluded the region over the derived maximum measurable depth. Also non-linearity was considered in building relation between optimal band and depth. We applied the method to the Nakdong and Hwang River confluence as a large river case and obtained the following features. First, the hyperspectal method showed acceptable performance in morphological mapping for shallow regions, where the maximum measurable depth was 2.5 m and 1.25 m in the Nakdong and Hwang river, respectively. Second, RMSE was more feasible to derive the maximum measurable depth rather than the conventional correlation coefficient whereby considering various scenario of excluding range of in situ depths for OBRA. Third, highly turbid region in Hwang River did not allow hyperspectral bathymetry mapping compared with the case of adjacent Nakdong River, where maximum measurable depth was down to half in Hwang River.

Validation of Extreme Rainfall Estimation in an Urban Area derived from Satellite Data : A Case Study on the Heavy Rainfall Event in July, 2011 (위성 자료를 이용한 도시지역 극치강우 모니터링: 2011년 7월 집중호우를 중심으로)

  • Yoon, Sun-Kwon;Park, Kyung-Won;Kim, Jong Pil;Jung, Il-Won
    • Journal of Korea Water Resources Association
    • /
    • v.47 no.4
    • /
    • pp.371-384
    • /
    • 2014
  • This study developed a new algorithm of extreme rainfall extraction based on the Communication, Ocean and Meteorological Satellite (COMS) and the Tropical Rainfall Measurement Mission (TRMM) Satellite image data and evaluated its applicability for the heavy rainfall event in July-2011 in Seoul, South Korea. The power-series-regression-based Z-R relationship was employed for taking into account for empirical relationships between TRMM/PR, TRMM/VIRS, COMS, and Automatic Weather System(AWS) at each elevation. The estimated Z-R relationship ($Z=303R^{0.72}$) agreed well with observation from AWS (correlation coefficient=0.57). The estimated 10-minute rainfall intensities from the COMS satellite using the Z-R relationship generated underestimated rainfall intensities. For a small rainfall event the Z-R relationship tended to overestimated rainfall intensities. However, the overall patterns of estimated rainfall were very comparable with the observed data. The correlation coefficients and the Root Mean Square Error (RMSE) of 10-minute rainfall series from COMS and AWS gave 0.517, and 3.146, respectively. In addition, the averaged error value of the spatial correlation matrix ranged from -0.530 to -0.228, indicating negative correlation. To reduce the error by extreme rainfall estimation using satellite datasets it is required to take into more extreme factors and improve the algorithm through further study. This study showed the potential utility of multi-geostationary satellite data for building up sub-daily rainfall and establishing the real-time flood alert system in ungauged watersheds.