• Title/Summary/Keyword: MAE(mean absolute error)

Search Result 187, Processing Time 0.024 seconds

Detection of Wildfire Smoke Plumes Using GEMS Images and Machine Learning (GEMS 영상과 기계학습을 이용한 산불 연기 탐지)

  • Jeong, Yemin;Kim, Seoyeon;Kim, Seung-Yeon;Yu, Jeong-Ah;Lee, Dong-Won;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_3
    • /
    • pp.967-977
    • /
    • 2022
  • The occurrence and intensity of wildfires are increasing with climate change. Emissions from forest fire smoke are recognized as one of the major causes affecting air quality and the greenhouse effect. The use of satellite product and machine learning is essential for detection of forest fire smoke. Until now, research on forest fire smoke detection has had difficulties due to difficulties in cloud identification and vague standards of boundaries. The purpose of this study is to detect forest fire smoke using Level 1 and Level 2 data of Geostationary Environment Monitoring Spectrometer (GEMS), a Korean environmental satellite sensor, and machine learning. In March 2022, the forest fire in Gangwon-do was selected as a case. Smoke pixel classification modeling was performed by producing wildfire smoke label images and inputting GEMS Level 1 and Level 2 data to the random forest model. In the trained model, the importance of input variables is Aerosol Optical Depth (AOD), 380 nm and 340 nm radiance difference, Ultra-Violet Aerosol Index (UVAI), Visible Aerosol Index (VisAI), Single Scattering Albedo (SSA), formaldehyde (HCHO), nitrogen dioxide (NO2), 380 nm radiance, and 340 nm radiance were shown in that order. In addition, in the estimation of the forest fire smoke probability (0 ≤ p ≤ 1) for 2,704 pixels, Mean Bias Error (MBE) is -0.002, Mean Absolute Error (MAE) is 0.026, Root Mean Square Error (RMSE) is 0.087, and Correlation Coefficient (CC) showed an accuracy of 0.981.

Model- Data Based Small Area Estimation

  • Shin, Key-Il;Lee, Sang Eun
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.637-645
    • /
    • 2003
  • Small area estimation had been studied using data-based methods such as Direct, Indirect, Synthetic methods. However recently, model-based such as based on regression or time series estimation methods are applied to the study. In this paper we investigate a model-data based small area estimation which takes into account the spatial relation among the areas. The Economic Active Population Survey in 2001 are used for analysis and the results from the model based and model-data based estimation are compared with using MSE(Mean squared error), MAE(Mean absolute error) and MB(Mean bias).

Strength prediction of rotary brace damper using MLR and MARS

  • Mansouri, I.;Safa, M.;Ibrahim, Z.;Kisi, O.;Tahir, M.M.;Baharom, S.;Azimi, M.
    • Structural Engineering and Mechanics
    • /
    • v.60 no.3
    • /
    • pp.471-488
    • /
    • 2016
  • This study predicts the strength of rotary brace damper by analyzing a new set of probabilistic models using the usual method of multiple linear regressions (MLR) and advanced machine-learning methods of multivariate adaptive regression splines (MARS), Rotary brace damper can be easily assembled with high energy-dissipation capability. To investigate the behavior of this damper in structures, a steel frame is modeled with this device subjected to monotonic and cyclic loading. Several response parameters are considered, and the performance of damper in reducing each response is evaluated. MLR and MARS methods were used to predict the strength of this damper. Displacement was determined to be the most effective parameter of damper strength, whereas the thickness did not exhibit any effect. Adding thickness parameter as inputs to MARS and MLR models did not increase the accuracies of the models in predicting the strength of this damper. The MARS model with a root mean square error (RMSE) of 0.127 and mean absolute error (MAE) of 0.090 performed better than the MLR model with an RMSE of 0.221 and MAE of 0.181.

A Comparison of CME Arrival Time Estimations by the WSA/ENLIL Cone Model and an Empirical Model

  • Jang, Soo-Jeong;Moon, Yong-Jae;Lee, Kyoung-Sun;Na, Hyeon-Ock
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.37 no.1
    • /
    • pp.92.1-92.1
    • /
    • 2012
  • In this work we have examined the performance of the WSA/ENLIL cone model provided by Community Coordinated Modeling Center (CCMC). The WSA/ENLIL model simulates the propagation of coronal mass ejections (CMEs) from the Sun into the heliosphere. We estimate the shock arrival times at the Earth using 29 halo CMEs from 2001 to 2002. These halo CMEs have cone model parameters from Michalek et al. (2007) as well as their associated interplanetary (IP) shocks. We make a comparison between CME arrival times by the WSA/ENLIL cone model and IP shock observations. For the WSA/ENLIL cone model, the root mean square(RMS) error is about 13 hours and the mean absolute error(MAE) is approximately 10.4 hours. We compared these estimates with those of the empirical model by Kim et al.(2007). For the empirical model, the RMS and MAE errors are about 10.2 hours and 8.7 hours, respectively. We are investigating several possibilities on relatively large errors of the WSA/ENLIL cone model, which may be caused by cone model velocities, CME density enhancement factor, or CME-CME interaction.

  • PDF

Two Machine Learning Models for Mobile Phone Battery Discharge Rate Prediction Based on Usage Patterns

  • Chantrapornchai, Chantana;Nusawat, Paingruthai
    • Journal of Information Processing Systems
    • /
    • v.12 no.3
    • /
    • pp.436-454
    • /
    • 2016
  • This research presents the battery discharge rate models for the energy consumption of mobile phone batteries based on machine learning by taking into account three usage patterns of the phone: the standby state, video playing, and web browsing. We present the experimental design methodology for collecting data, preprocessing, model construction, and parameter selections. The data is collected based on the HTC One X hardware platform. We considered various setting factors, such as Bluetooth, brightness, 3G, GPS, Wi-Fi, and Sync. The battery levels for each possible state vector were measured, and then we constructed the battery prediction model using different regression functions based on the collected data. The accuracy of the constructed models using the multi-layer perceptron (MLP) and the support vector machine (SVM) were compared using varying kernel functions. Various parameters for MLP and SVM were considered. The measurement of prediction efficiency was done by the mean absolute error (MAE) and the root mean squared error (RMSE). The experiments showed that the MLP with linear regression performs well overall, while the SVM with the polynomial kernel function based on the linear regression gives a low MAE and RMSE. As a result, we were able to demonstrate how to apply the derived model to predict the remaining battery charge.

Soft computing techniques in prediction Cr(VI) removal efficiency of polymer inclusion membranes

  • Yaqub, Muhammad;EREN, Beytullah;Eyupoglu, Volkan
    • Environmental Engineering Research
    • /
    • v.25 no.3
    • /
    • pp.418-425
    • /
    • 2020
  • In this study soft computing techniques including, Artificial Neural Network (ANN) and Adaptive Neuro-Fuzzy Inference System (ANFIS) were investigated for the prediction of Cr(VI) transport efficiency by novel Polymer Inclusion Membranes (PIMs). Transport experiments carried out by varying parameters such as time, film thickness, carrier type, carier rate, plasticizer type, and plasticizer rate. The predictive performance of ANN and ANFIS model was evaluated by using statistical performance criteria such as Root Mean Standard Error (RMSE), Mean Absolute Error (MAE), and Coefficient of Determination (R2). Moreover, Sensitivity Analysis (SA) was carried out to investigate the effect of each input on PIMs Cr(VI) removal efficiency. The proposed ANN model presented reliable and valid results, followed by ANFIS model results. RMSE and MAE values were 0.00556, 0.00163 for ANN and 0.00924, 0.00493 for ANFIS model in the prediction of Cr(VI) removal efficiency on testing data sets. The R2 values were 0.973 and 0.867 on testing data sets by ANN and ANFIS, respectively. Results show that the ANN-based prediction model performed better than ANFIS. SA demonstrated that time; film thickness; carrier type and plasticizer type are major operating parameters having 33.61%, 26.85%, 21.07% and 8.917% contribution, respectively.

Prediction of Larix kaempferi Stand Growth in Gangwon, Korea, Using Machine Learning Algorithms

  • Hyo-Bin Ji;Jin-Woo Park;Jung-Kee Choi
    • Journal of Forest and Environmental Science
    • /
    • v.39 no.4
    • /
    • pp.195-202
    • /
    • 2023
  • In this study, we sought to compare and evaluate the accuracy and predictive performance of machine learning algorithms for estimating the growth of individual Larix kaempferi trees in Gangwon Province, Korea. We employed linear regression, random forest, XGBoost, and LightGBM algorithms to predict tree growth using monitoring data organized based on different thinning intensities. Furthermore, we compared and evaluated the goodness-of-fit of these models using metrics such as the coefficient of determination (R2), mean absolute error (MAE), and root mean square error (RMSE). The results revealed that XGBoost provided the highest goodness-of-fit, with an R2 value of 0.62 across all thinning intensities, while also yielding the lowest values for MAE and RMSE, thereby indicating the best model fit. When predicting the growth volume of individual trees after 3 years using the XGBoost model, the agreement was exceptionally high, reaching approximately 97% for all stand sites in accordance with the different thinning intensities. Notably, in non-thinned plots, the predicted volumes were approximately 2.1 m3 lower than the actual volumes; however, the agreement remained highly accurate at approximately 99.5%. These findings will contribute to the development of growth prediction models for individual trees using machine learning algorithms.

The Development of a Rainfall Correction Technique based on Machine Learning for Hydrological Applications (수문학적 활용을 위한 머신러닝 기반의 강우보정기술 개발)

  • Lee, Young-Mi;Ko, Chul-Min;Shin, Seong-Cheol;Kim, Byung-Sik
    • Journal of Environmental Science International
    • /
    • v.28 no.1
    • /
    • pp.125-135
    • /
    • 2019
  • For the purposes of enhancing usability of Numerical Weather Prediction (NWP), the quantitative precipitation prediction scheme by machine learning has been proposed. In this study, heavy rainfall was corrected for by utilizing rainfall predictors from LENS and Radar from 2017 to 2018, as well as machine learning tools LightGBM and XGBoost. The results were analyzed using Mean Absolute Error (MAE), Normalized Peak Error (NPE), and Peak Timing Error (PTE) for rainfall corrected through machine learning. Machine learning results (i.e. using LightGBM and XGBoost) showed improvements in the overall correction of rainfall and maximum rainfall compared to LENS. For example, the MAE of case 5 was found to be 24.252 using LENS, 11.564 using LightGBM, and 11.693 using XGBoost, showing excellent error improvement in machine learning results. This rainfall correction technique can provide hydrologically meaningful rainfall information such as predictions of flooding. Future research on the interpretation of various hydrologic processes using machine learning is necessary.

Predicting a Queue Length Using a Deep Learning Model at Signalized Intersections (딥러닝 모형을 이용한 신호교차로 대기행렬길이 예측)

  • Na, Da-Hyuk;Lee, Sang-Soo;Cho, Keun-Min;Kim, Ho-Yeon
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.20 no.6
    • /
    • pp.26-36
    • /
    • 2021
  • In this study, a deep learning model for predicting the queue length was developed using the information collected from the image detector. Then, a multiple regression analysis model, a statistical technique, was derived and compared using two indices of mean absolute error(MAE) and root mean square error(RMSE). From the results of multiple regression analysis, time, day of the week, occupancy, and bus traffic were found to be statistically significant variables. Occupancy showed the most strong impact on the queue length among the variables. For the optimal deep learning model, 4 hidden layers and 6 lookback were determined, and MAE and RMSE were 6.34 and 8.99. As a result of evaluating the two models, the MAE of the multiple regression model and the deep learning model were 13.65 and 6.44, respectively, and the RMSE were 19.10 and 9.11, respectively. The deep learning model reduced the MAE by 52.8% and the RMSE by 52.3% compared to the multiple regression model.

Covid19 trends predictions using time series data (시계열 데이터를 활용한 코로나19 동향 예측)

  • Kim, Jae-Ho;Kim, Jang-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.7
    • /
    • pp.884-889
    • /
    • 2021
  • The number of people infected with Covid-19 in Korea seemed to be gradually decreasing thanks to various efforts such as social distancing and vaccines. However, just as the number of infected people increased after a particular incident on February 20, 2020, the number of infected people has been increasing rapidly since December 2020 by approximately 500 per day. Therefore, the future Covid-19 is predicted through the Prophet algorithm using Kaggle's dataset, and the explanatory power for this prediction is added through the coefficient of determination, mean absolute error, mean percent error, mean square difference, and mean square deviation through Scikit-learn. Moreover, in the absence of a specific incident rapidly increasing the cases of Covid-19, the proposed method predicts the number of infected people in Korea and emphasizes the importance of implementing epidemic prevention and quarantine rules for future diseases.