• Title/Summary/Keyword: R-RMSE

Search Result 502, Processing Time 0.036 seconds

Study on Water Quality Predictability through Machine Learning Techniques in Non-point Pollutant Management Area (비점오염원관리지역의 머신러닝 기법을 통한 수질 예측 가능성 연구)

  • Yeong Na Yu;Min Hwan Shin;Dong Hyuk Kum;Kyoung Jae Lim;Jong Gun Kim
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.467-467
    • /
    • 2023
  • 강우에 의해 발생하는 비점오염물질의 수질 데이터가 충분하지 않아 비점오염원이 문제가 되고 있는 유역의 수질개선을 위한 대책마련이 어려운 실정이다. 기존에 환경부에서 운영하고 있는 자동측정망은 1시간 간격으로 데이터를 축적하고 있으나, 비점오염원이 문제가 되는 유역에 설치되어 있지 않거나 수온, DO, pH 등 현장항목만을 측정하고 있어 하천의 수질오염을 대표할 수 있는 T-P나 SS 등의 수질분석 항목의 부재하다. 이로인해 유역의 수질개선 대책을 수립하기 위한 오염원의 현황을 파악하기 어려운 실정이다. 따라서, 본 연구에서는 비점오염원관리지역 중 골지천 유역을 대상으로 수질항목별 상관성을 분석하고, 실측자료를 기반으로 DT, MLP, SVM, RF, GB, XGB 등의 머신러닝 기법을 통해 수질 예측 가능성을 연구하였다. 상관관계 분석결과 입력변수인 탁도 항목이 예측 수질과 뚜렷한 상관관계를 보이는 것으로 나타났으나, 그 외 항목에서는 약한 상관관계를 보이거나 상관관계가 없는 것으로 나타났다. 머신러닝 기법을 활용한 수질 예측 분석 결과, 검무교와 태봉2교, 제1여량교는 RF 기법에서 결정계수(R2) 0.57~0.86, RMSE 16.49~175.60으로 예측성이 우수한 것으로 나타났다. 관말교는 SVM 기법에서 R2 0.65, RMSE 57.69로, 송계교는 XGB 기법에서 R2 0.74, RMSE 282.86으로 가장 예측성이 우수한 것으로 나타났다. 분석결과와 같이 머신러닝 기법을 활용한 수질 예측은 가능하나, 예측성이 우수한 머신러닝 기법의 R2 비교 결과, 유역면적이 큰 제1여량교와 작은 관말교에서 0.57과 0.65로 다른 지점에 비해 낮은 것으로 나타났다. RMSE 비교 결과, 상류 산간지역에 발생한 국지성 호우의 영향으로 흙탕물이 가장 자주 발생하는 태봉2교 지점과 우선관리지역이 합류되는 송계교 지점에서 175.60과 282.86으로 예측값과 실측값의 오차가 큰 것으로 나타났다. 연구결과와 같이 하천 수질을 예측하기 위해서는 유역면적 혹은 유역특성과 관련한 기초자료를 추가로 적용하여 머신러닝 기법을 적용 해야할 것으로 판단된다. 또한, 본 연구에서 예측한 수질 항목 이외에 입력변수를 추가로 확보하여 수질의 예측 가능성을 검토해야 할 것으로 보여진다.

  • PDF

Analysis of Livestock Nonpoint Source Pollutant Load Ratio for Each Sub-watershed in Sancheong Watershed using HSPF Model (HSPF 모형을 이용한 산청 유역의 소유역별 축산비점오염부하량 비중 분석)

  • Kim, So Rae;Kim, Sang Min
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.62 no.1
    • /
    • pp.39-50
    • /
    • 2020
  • The objective of this study was to assess the livestock nonpoint source pollutant impact on water quality in Namgang dam watershed using the HSPF (Hydrological Simulation Program-Fortran) model. The input data for the HSPF model was established using the landcover, digital elevation, and watershed and river maps. In order to apply the pollutant load to the HSPF model, the delivery load of the livestock nonpoint source in the Namgang dam watershed was calculated and used as a point pollutant input data for the HSPF model. The hydrologic and water quality parameters of HSPF model were calibrated and validated using the observed runoff data from 2007 to 2015 at Sancheong station. The R2 (Determination Coefficient), RMSE (Root Mean Square Error), NSE (Nash-Sutcliffe efficiency coefficient), and RMAE (Relative Mean Absolute Error) were used to evaluate the model performance. The simulation results for annual mean runoff showed that R2 ranged 0.79~0.81, RMSE 1.91~2.73 mm/day, NSE 0.7~0.71 and RMAE 0.37~0.49 mm/day for daily runoff. The simulation results for annual mean BOD for RMSE ranged 0.99~1.13 mg/L and RMAE 0.49~0.55 mg/L, annual mean TN for RMSE ranged 1.65~1.72 mg/L and RMAE 0.55 mg/L, and annual mean TP for RMSE ranged 0.043~0.055 mg/L and RMAE 0.552~0.570 mg/L. As a result of livestock nonpoint pollutant loading simulation for each sub-watersehd using the HSPF model, the BOD ranged 16.6~163 kg/day, TN ranged 27.5~337 kg/day, TP ranged 1.22~14.1 kg/day.

Machine Learning Algorithms Evaluation and CombML Development for Dam Inflow Prediction (댐 유입량 예측을 위한 머신러닝 알고리즘 평가 및 CombML 개발)

  • Hong, Jiyeong;Bae, Juhyeon;Jeong, Yeonseok;Lim, Kyoung Jae
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.317-317
    • /
    • 2021
  • 효율적인 물관리를 위한 댐 유입량 대한 연구는 필수적이다. 본 연구에서는 다양한 머신러닝 알고리즘을 통해 40년동안의 기상 및 댐 유입량 데이터를 이용하여 소양강댐 유입량을 예측하였으며, 그 중 고유량과 저유량예측에 적합한 알고리즘을 각각 선정하여 머신러닝 알고리즘을 결합한 CombML을 개발하였다. 의사 결정 트리 (DT), 멀티 레이어 퍼셉트론 (MLP), 랜덤 포레스트(RF), 그래디언트 부스팅 (GB), RNN-LSTM 및 CNN-LSTM 알고리즘이 사용되었으며, 그 중 가장 정확도가 높은 모형과 고유량이 아닌 경우에서 특별히 예측 정확도가 높은 모형을 결합하여 결합 머신러닝 알고리즘 (CombML)을 개발 및 평가하였다. 사용된 알고리즘 중 MLP가 NSE 0.812, RMSE 77.218 m3/s, MAE 29.034 m3/s, R 0.924, R2 0.817로 댐 유입량 예측에서 최상의 결과를 보여주었으며, 댐 유입량이 100 m3/s 이하인 경우 앙상블 모델 (RF, GB) 이 댐 유입 예측에서 MLP보다 더 나은 성능을 보였다. 따라서, 유입량이 100 m3/s 이상 시의 평균 일일 강수량인 16 mm를 기준으로 강수가 16mm 이하인 경우 앙상블 방법 (RF 및 GB)을 사용하고 강수가 16 mm 이상인 경우 MLP를 사용하여 댐 유입을 예측하기 위해 두 가지 복합 머신러닝(CombML) 모델 (RF_MLP 및 GB_MLP)을 개발하였다. 그 결과 RF_MLP에서 NSE 0.857, RMSE 68.417 m3/s, MAE 18.063 m3/s, R 0.927, R2 0.859, GB_MLP의 경우 NSE 0.829, RMSE 73.918 m3/s, MAE 18.093 m3/s, R 0.912, R2 0.831로 CombML이 댐 유입을 가장 정확하게 예측하는 것으로 평가되었다. 본 연구를 통해 하천 유황을 고려한 여러 머신러닝 알고리즘의 결합을 통한 유입량 예측 결과, 알고리즘 결합 시 예측 모형의 정확도가 개선되는 것이 확인되었으며, 이는 추후 효율적인 물관리에 이용될 수 있을 것으로 판단된다.

  • PDF

Rainfall Estimation Using TRMM-PR/VIRS and GMS Data (TRMM-PR/VIRS와 GMS 자료를 이용한 강수량 추정에 관한 연구)

  • 김영섭;박경원
    • Korean Journal of Remote Sensing
    • /
    • v.18 no.6
    • /
    • pp.319-326
    • /
    • 2002
  • Rainfall estimation was conducted based on TRMM-PR/VIES and GMS data. AWS rainfall data were used for various validation. General procedure is as follows; 1) Z-R relationship was made by the comparison of TRMM-PR and AWS data. 2) new algorithm was developed by the estimates from Z-R equation and TBB of VIRS. 3) rainfall was estimated through the substitution of GMS data for TBB of VIRS in the newly developed algorithm. Z-R relationship based on TRMM is $Z=303R^{0.72}$ with correlation coefficient 0.57. The newly developed algorithm is shown as correlation coefficient 0.67 and RMSE 17mm/hr. New algorithm shows the underestimating tendency in case of heavy rainfall event.

Evaluation of Factors Used in AAPM TG-43 Formalism Using Segmented Sources Integration Method and Monte Carlo Simulation: Implementation of microSelectron HDR Ir-192 Source (미소선원 적분법과 몬테칼로 방법을 이용한 AAPM TG-43 선량계산 인자 평가: microSelectron HDR Ir-192 선원에 대한 적용)

  • Ahn, Woo-Sang;Jang, Won-Woo;Park, Sung-Ho;Jung, Sang-Hoon;Cho, Woon-Kap;Kim, Young-Seok;Ahn, Seung-Do
    • Progress in Medical Physics
    • /
    • v.22 no.4
    • /
    • pp.190-197
    • /
    • 2011
  • Currently, the dose distribution calculation used by commercial treatment planning systems (TPSs) for high-dose rate (HDR) brachytherapy is derived from point and line source approximation method recommended by AAPM Task Group 43 (TG-43). However, the study of Monte Carlo (MC) simulation is required in order to assess the accuracy of dose calculation around three-dimensional Ir-192 source. In this study, geometry factor was calculated using segmented sources integration method by dividing microSelectron HDR Ir-192 source into smaller parts. The Monte Carlo code (MCNPX 2.5.0) was used to calculate the dose rate $\dot{D}(r,\theta)$ at a point ($r,\theta$) away from a HDR Ir-192 source in spherical water phantom with 30 cm diameter. Finally, anisotropy function and radial dose function were calculated from obtained results. The obtained geometry factor was compared with that calculated from line source approximation. Similarly, obtained anisotropy function and radial dose function were compared with those derived from MCPT results by Williamson. The geometry factor calculated from segmented sources integration method and line source approximation was within 0.2% for $r{\geq}0.5$ cm and 1.33% for r=0.1 cm, respectively. The relative-root mean square error (R-RMSE) of anisotropy function obtained by this study and Williamson was 2.33% for r=0.25 cm and within 1% for r>0.5 cm, respectively. The R-RMSE of radial dose function was 0.46% at radial distance from 0.1 to 14.0 cm. The geometry factor acquired from segmented sources integration method and line source approximation was in good agreement for $r{\geq}0.1$ cm. However, application of segmented sources integration method seems to be valid, since this method using three-dimensional Ir-192 source provides more realistic geometry factor. The anisotropy function and radial dose function estimated from MCNPX in this study and MCPT by Williamson are in good agreement within uncertainty of Monte Carlo codes except at radial distance of r=0.25 cm. It is expected that Monte Carlo code used in this study could be applied to other sources utilized for brachytherapy.

Ensemble deep learning-based models to predict the resilient modulus of modified base materials subjected to wet-dry cycles

  • Mahzad Esmaeili-Falak;Reza Sarkhani Benemaran
    • Geomechanics and Engineering
    • /
    • v.32 no.6
    • /
    • pp.583-600
    • /
    • 2023
  • The resilient modulus (MR) of various pavement materials plays a significant role in the pavement design by a mechanistic-empirical method. The MR determination is done by experimental tests that need time and money, along with special experimental tools. The present paper suggested a novel hybridized extreme gradient boosting (XGB) structure for forecasting the MR of modified base materials subject to wet-dry cycles. The models were created by various combinations of input variables called deep learning. Input variables consist of the number of W-D cycles (WDC), the ratio of free lime to SAF (CSAFR), the ratio of maximum dry density to the optimum moisture content (DMR), confining pressure (σ3), and deviatoric stress (σd). Two XGB structures were produced for the estimation aims, where determinative variables were optimized by particle swarm optimization (PSO) and black widow optimization algorithm (BWOA). According to the results' description and outputs of Taylor diagram, M1 model with the combination of WDC, CSAFR, DMR, σ3, and σd is recognized as the most suitable model, with R2 and RMSE values of BWOA-XGB for model M1 equal to 0.9991 and 55.19 MPa, respectively. Interestingly, the lowest value of RMSE for literature was at 116.94 MPa, while this study could gain the extremely lower RMSE owned by BWOA-XGB model at 55.198 MPa. At last, the explanations indicate the BWO algorithm's capability in determining the optimal value of XGB determinative parameters in MR prediction procedure.

Prediction of Salinity of Nakdong River Estuary Using Deep Learning Algorithm (LSTM) for Time Series Analysis (시계열 분석 딥러닝 알고리즘을 적용한 낙동강 하굿둑 염분 예측)

  • Woo, Joung Woon;Kim, Yeon Joong;Yoon, Jong Sung
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.34 no.4
    • /
    • pp.128-134
    • /
    • 2022
  • Nakdong river estuary is being operated with the goal of expanding the period of seawater inflow from this year to 2022 every month and creating a brackish water area within 15 km of the upstream of the river bank. In this study, the deep learning algorithm Long Short-Term Memory (LSTM) was applied to predict the salinity of the Nakdong Bridge (about 5 km upstream of the river bank) for the purpose of rapid decision making for the target brackish water zone and prevention of salt water damage. Input data were constructed to reflect the temporal and spatial characteristics of the Nakdong River estuary, such as the amount of discharge from Changnyeong and Hamanbo, and an optimal model was constructed in consideration of the hydraulic characteristics of the Nakdong River Estuary by changing the degree according to the sequence length. For prediction accuracy, statistical analysis was performed using the coefficient of determination (R-squred) and RMSE (root mean square error). When the sequence length was 12, the R-squred 0.997 and RMSE 0.122 were the highest, and the prior prediction time showed a high degree of R-squred 0.93 or more until the 12-hour interval.

Tree Height Estimation of Pinus densiflora and Pinus koraiensis in Korea with the Use of UAV-Acquired Imagery

  • Talkasen, Lynn J.;Kim, Myeong Jun;Kim, Dong Hyeon;Kim, Dong Geun;Lee, Kawn Hee
    • Journal of Forest and Environmental Science
    • /
    • v.33 no.3
    • /
    • pp.187-196
    • /
    • 2017
  • The use of unmanned aerial vehicles (UAV) for the estimation of tree height is gaining recognition. This study aims to assess the effectiveness of tree height estimation of Pinus densiflora Sieb. et Zucc. and Pinus koraiensis Sieb. et Zucc. using digital surface model (DSM) generated from UAV-acquired imageries. Images were taken with the $Trimble^{(R)}$ UX5 equipped with Sony ${\alpha}5100$. The generated DSM, together with the digital elevation model (DEM) generated from a digital map of the study areas, were used in the estimation of tree height. Field measurements were conducted in order to generate a regression model and carry out accuracy assessment. The obtained coefficients of determination (R2) and root mean square error (RMSE) for P. densiflora (R2=0.71; RMSE=1.00 m) and P. koraiensis (R2=0.64; RMSE=0.85 m) are comparable to the results of similar studies. The results of the paired two-tailed t-test show that the two tree height estimation methods are not significantly different (p-value=0.04 and 0.10, alpha level=0.01), which means that tree height estimation using UAV imagery could be used as an alternative to field measurement.

Prediction of Compaction, Strength Characteristics for Reservoir Soil Using Portable Static Cone Penetration Test (휴대용 정적 콘 관입시험을 통한 저수지 제방 토양의 다짐, 강도 특성 및 사면 안정성 예측)

  • Jeon, Jihun;Son, Younghwan;Kim, Taejin;Jo, Sangbeom;Jung, Seungjoo;Heo, Jun;Bong, Taeho;Kim, Donggeun
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.65 no.5
    • /
    • pp.1-11
    • /
    • 2023
  • Due to climate change and aging of reservoirs, damage to embankment slopes is increasing. However, the safety diagnosis of the reservoir slope is mainly conducted by visual observation, and the time and economic cost are formidable to apply soil mechanical tests and slope stability analysis. Accordingly, this study presented a predicting method for the compaction and strength characteristics of the reservoir embankment soil using a portable static cone penetration test. The predicted items consisted of dry density, cohesion, and internal friction angle, which are the main factors of slope stability analysis. Portable static cone penetration tests were performed at 19 reservoir sites, and prediction equations were constructed from the correlation between penetration resistance data and test results of soil samples. The predicted dry density and strength parameters showed a correlation with test results between R2 0.40 and 0.93, and it was found to replace the test results well when used as input data for slope stability analysis (R2 0.8134 or more, RMSE 0.0320 or less). In addition, the prediction equations for the minimum safety factor of the slope were presented using the penetration resistance and gradient. As a result of comparing the predicted safety factor with the analysis results, R2 0.5125, RMSE 0.0382 in coarse-grained soil, R2 0.4182 and RMSE 0.0628 in fine-grained soil. The results of this study can be used as a way to improve the existing slope safety diagnosis method, and are expected to be used to predict the characteristics of various soils and inspect slopes.

The GOCI-II Early Mission Marine Fog Detection Products: Optical Characteristics and Verification (천리안 해양위성 2호(GOCI-II) 임무 초기 해무 탐지 산출: 해무의 광학적 특성 및 초기 검증)

  • Kim, Minsang;Park, Myung-Sook
    • Korean Journal of Remote Sensing
    • /
    • v.37 no.5_2
    • /
    • pp.1317-1328
    • /
    • 2021
  • This study analyzes the early satellite mission marine fog detection results from Geostationary Ocean Color Imager-II (GOCI-II). We investigate optical characteristics of the GOCI-II spectral bands for marine fog between October 2020 and March 2021 during the overlapping mission period of Geostationary Ocean Color Imager (GOCI) and GOCI-II. For Rayleigh-corrected reflection (Rrc) at 412 nm band available for the input of the GOCI-II marine fog algorithm, the inter-comparison between GOCI and GOCI-II data showed a small Root Mean Square Error (RMSE) value (0.01) with a high correlation coefficient (0.988). Another input variable, Normalized Localization Standard (NLSD), also shows a reasonable correlation (0.798) between the GOCI and GOCI-II data with a small RMSE value (0.007). We also found distinctive optical characteristics between marine fog and clouds by the GOCI-II observations, showing the narrower distribution of all bands' Rrc values centered at high values for cloud compared to marine fog. The GOCI-II marine fog detection distribution for actual cases is similar to the GOCI but more detailed due to the improved spatial resolution from 500 m to 250 m. The validation with the automated synoptic observing system (ASOS) visibility data confirms the initial reliability of the GOCI-II marine fog detection. Also, it is expected to improve the performance of the GOCI-II marine fog detection algorithm by adding sufficient samples to verify stable performance, improving the post-processing process by replacing real-time available cloud input data and reducing false alarm by adding aerosol information.