• Title/Summary/Keyword: rRMSE

Search Result 515, Processing Time 0.025 seconds

Estimating pile setup parameter using XGBoost-based optimized models

  • Xigang Du;Ximeng Ma;Chenxi Dong;Mehrdad Sattari Nikkhoo
    • Geomechanics and Engineering
    • /
    • v.36 no.3
    • /
    • pp.259-276
    • /
    • 2024
  • The undrained shear strength is widely acknowledged as a fundamental mechanical property of soil and is considered a critical engineering parameter. In recent years, researchers have employed various methodologies to evaluate the shear strength of soil under undrained conditions. These methods encompass both numerical analyses and empirical techniques, such as the cone penetration test (CPT), to gain insights into the properties and behavior of soil. However, several of these methods rely on correlation assumptions, which can lead to inconsistent accuracy and precision. The study involved the development of innovative methods using extreme gradient boosting (XGB) to predict the pile set-up component "A" based on two distinct data sets. The first data set includes average modified cone point bearing capacity (qt), average wall friction (fs), and effective vertical stress (σvo), while the second data set comprises plasticity index (PI), soil undrained shear cohesion (Su), and the over consolidation ratio (OCR). These data sets were utilized to develop XGBoost-based methods for predicting the pile set-up component "A". To optimize the internal hyperparameters of the XGBoost model, four optimization algorithms were employed: Particle Swarm Optimization (PSO), Social Spider Optimization (SSO), Arithmetic Optimization Algorithm (AOA), and Sine Cosine Optimization Algorithm (SCOA). The results from the first data set indicate that the XGBoost model optimized using the Arithmetic Optimization Algorithm (XGB - AOA) achieved the highest accuracy, with R2 values of 0.9962 for the training part and 0.9807 for the testing part. The performance of the developed models was further evaluated using the RMSE, MAE, and VAF indices. The results revealed that the XGBoost model optimized using XGBoost - AOA outperformed other models in terms of accuracy, with RMSE, MAE, and VAF values of 0.0078, 0.0015, and 99.6189 for the training part and 0.0141, 0.0112, and 98.0394 for the testing part, respectively. These findings suggest that XGBoost - AOA is the most accurate model for predicting the pile set-up component.

Optimization of forensic identification through 3-dimensional imaging analysis of labial tooth surface using open-source software

  • Arofi Kurniawan;Aspalilah Alias;Mohd Yusmiaidil Putera Mohd Yusof;Anand Marya
    • Imaging Science in Dentistry
    • /
    • v.54 no.1
    • /
    • pp.63-69
    • /
    • 2024
  • Purpose: The objective of this study was to determine the minimum number of teeth in the anterior dental arch that would yield accurate results for individual identification in forensic contexts. Materials and Methods: The study involved the analysis of 28 sets of 3-dimensional (3D) point cloud data, focused on the labial surface of the anterior teeth. These datasets were superimposed within each group in both genuine and imposter pairs. Group A incorporated data from the right to the left central incisor, group B from the right to the left lateral incisor, and group C from the right to the left canine. A comprehensive analysis was conducted, including the evaluation of root mean square error (RMSE) values and the distances resulting from the superimposition of dental arch segments. All analyses were conducted using CloudCompare version 2.12.4 (Telecom ParisTech and R&D, Kyiv, Ukraine). Results: The distances between genuine pairs in groups A, B, and C displayed an average range of 0.153 to 0.184mm. In contrast, distances for imposter pairs ranged from 0.338 to 0.522 mm. RMSE values for genuine pairs showed an average range of 0.166 to 0.177, whereas those for imposter pairs ranged from 0.424 to 0.638. A statistically significant difference was observed between the distances of genuine and imposter pairs(P<0.05). Conclusion: The exceptional performance observed for the labial surfaces of anterior teeth underscores their potential as a dependable criterion for accurate 3D dental identification. This was achieved by assessing a minimum of 4 teeth.

Development of a soil total carbon prediction model using a multiple regression analysis method

  • Jun-Hyuk, Yoo;Jwa-Kyoung, Sung;Deogratius, Luyima;Taek-Keun, Oh;Jaesung, Cho
    • Korean Journal of Agricultural Science
    • /
    • v.48 no.4
    • /
    • pp.891-897
    • /
    • 2021
  • There is a need for a technology that can quickly and accurately analyze soil carbon contents. Existing soil carbon analysis methods are cumbersome in terms of professional manpower requirements, time, and cost. It is against this background that the present study leverages the soil physical properties of color and water content levels to develop a model capable of predicting the carbon content of soil sample. To predict the total carbon content of soil, the RGB values, water content of the soil, and lux levels were analyzed and used as statistical data. However, when R, G, and B with high correlations were all included in a multiple regression analysis as independent variables, a high level of multicollinearity was noted and G was thus excluded from the model. The estimates showed that the estimation coefficients for all independent variables were statistically significant at a significance level of 1%. The elastic values of R and B for the soil carbon content, which are of major interest in this study, were -2.90 and 1.47, respectively, showing that a 1% increase in the R value was correlated with a 2.90% decrease in the carbon content, whereas a 1% increase in the B value tallied with a 1.47% increase in the carbon content. Coefficient of determination (R2), root mean square error (RMSE), and mean absolute percentage error (MAPE) methods were used for regression verification, and calibration samples showed higher accuracy than the validation samples in terms of R2 and MAPE.

Soft computing based mathematical models for improved prediction of rock brittleness index

  • Abiodun I. Lawal;Minju Kim;Sangki Kwon
    • Geomechanics and Engineering
    • /
    • v.33 no.3
    • /
    • pp.279-289
    • /
    • 2023
  • Brittleness index (BI) is an important property of rocks because it is a good index to predict rockburst. Due to its importance, several empirical and soft computing (SC) models have been proposed in the literature based on the punch penetration test (PPT) results. These models are very important as there is no clear-cut experimental means for measuring BI asides the PPT which is very costly and time consuming to perform. This study used a novel Multivariate Adaptive regression spline (MARS), M5P, and white-box ANN to predict the BI of rocks using the available data in the literature for an improved BI prediction. The rock density, uniaxial compressive strength (σc) and tensile strength (σt) were used as the input parameters into the models while the BI was the targeted output. The models were implemented in the MATLAB software. The results of the proposed models were compared with those from existing multilinear regression, linear and nonlinear particle swarm optimization (PSO) and genetic algorithm (GA) based models using similar datasets. The coefficient of determination (R2), adjusted R2 (Adj R2), root-mean squared error (RMSE) and mean absolute percentage error (MAPE) were the indices used for the comparison. The outcomes of the comparison revealed that the proposed ANN and MARS models performed better than the other models with R2 and Adj R2 values above 0.9 and least error values while the M5P gave similar performance to those of the existing models. Weight partitioning method was also used to examine the percentage contribution of model predictors to the predicted BI and tensile strength was found to have the highest influence on the predicted BI.

Mathematical Models to Predict Staphylococcus aureus Growth on Processed Cheeses

  • Kim, Kyungmi;Lee, Heeyoung;Moon, Jinsan;Kim, Youngjo;Heo, Eunjeong;Park, Hyunjung;Yoon, Yohan
    • Journal of Food Hygiene and Safety
    • /
    • v.28 no.3
    • /
    • pp.217-221
    • /
    • 2013
  • This study developed predictive models for the kinetic behavior of Staphylococcus aureus on processed cheeses. Mozzarella slice cheese and cheddar slice cheese were inoculated with 0.1 ml of a S. aureus strain mixture (ATCC13565, ATCC14458, ATCC23235, ATCC27664, and NCCP10826). The inoculated samples were then stored at $4^{\circ}C$ (1440 h), $15^{\circ}C$ (288 h), $25^{\circ}C$ (72 h), and $30^{\circ}C$ (48 h), and the growth of all bacteria and of S. aureus were enumerated on tryptic soy agar and mannitol salt agar, respectively. The Baranyi model was fitted to the growth data of S. aureus to calculate growth rate (${\mu}_{max}$; ${\log}CFU{\cdot}g^{-1}{\cdot}h^{-1}$), lag phase duration (LPD; h), lower asymptote (log CFU/g), and upper asymptote (log CFU/g). The growth parameters were further analyzed using the square root model as a function of temperature. The model performance was validated with observed data, and the root mean square error (RMSE) was calculated. At $4^{\circ}C$, S. aureus cell growth was not observed on either processed cheese, but S. aureus growth on the mozzarella and cheddar cheeses was observed at $15^{\circ}C$, $25^{\circ}C$, and $30^{\circ}C$. The ${\mu}_{max}$ values increased, but LPD values decreased as storage temperature increased. In addition, the developed models showed acceptable performance (RMSE = 0.3500-0.5344). This result indicates that the developed kinetic model should be useful in describing the growth pattern of S. aureus in processed cheeses.

Evaluation of the Satellite-based Air Temperature for All Sky Conditions Using the Automated Mountain Meteorology Station (AMOS) Records: Gangwon Province Case Study (산악기상관측정보를 이용한 위성정보 기반의 전천후 기온 자료의 평가 - 강원권역을 중심으로)

  • Jang, Keunchang;Won, Myoungsoo;Yoon, Sukhee
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.19 no.1
    • /
    • pp.19-26
    • /
    • 2017
  • Surface air temperature ($T_{air}$) is a key variable for the meteorology and climatology, and is a fundamental factor of the terrestrial ecosystem functions. Satellite remote sensing from the Moderate Resolution Imaging Spectroradiometer (MODIS) provides an opportunity to monitor the $T_{air}$. However, the several problems such as frequent cloud cover and mountainous region can result in substantial retrieval error and signal loss in MODIS $T_{air}$. In this study, satellite-based $T_{air}$ was estimated under both clear and cloudy sky conditions in Gangwon Province using Aqua MODIS07 temperature profile product (MYD07_L2) and GCOM-W1 Advanced Microwave Scanning Radiometer 2 (AMSR2) brightness temperature ($T_b$) at 37 GHz frequency, and was compared with the measurements from the Automated Mountain Meteorology Stations (AMOS). The application of ambient temperature lapse rate was performed to improve the retrieval accuracy in mountainous region, which showed the improvement of estimation accuracy approximately 4% of RMSE. A simple pixel-wise regression method combining synergetic information from MYD07_L2 $T_{air}$ and AMSR2 $T_b$ was applied to estimate surface $T_{air}$ for all sky conditions. The $T_{air}$ retrievals showed favorable agreement in comparison with AMOS data (r=0.80, RMSE=7.9K), though the underestimation was appeared in winter season. Substantial $T_{air}$ retrievals were estimated 61.4% (n=2,657) for cloudy sky conditions. The results presented in this study indicate that the satellite remote sensing can produce the surface $T_{air}$ at the complex mountainous region for all sky conditions.

Development of a deep neural network model to estimate solar radiation using temperature and precipitation (온도와 강수를 이용하여 일별 일사량을 추정하기 위한 심층 신경망 모델 개발)

  • Kang, DaeGyoon;Hyun, Shinwoo;Kim, Kwang Soo
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.21 no.2
    • /
    • pp.85-96
    • /
    • 2019
  • Solar radiation is an important variable for estimation of energy balance and water cycle in natural and agricultural ecosystems. A deep neural network (DNN) model has been developed in order to estimate the daily global solar radiation. Temperature and precipitation, which would have wider availability from weather stations than other variables such as sunshine duration, were used as inputs to the DNN model. Five-fold cross-validation was applied to train and test the DNN models. Meteorological data at 15 weather stations were collected for a long term period, e.g., > 30 years in Korea. The DNN model obtained from the cross-validation had relatively small value of RMSE ($3.75MJ\;m^{-2}\;d^{-1}$) for estimates of the daily solar radiation at the weather station in Suwon. The DNN model explained about 68% of variation in observed solar radiation at the Suwon weather station. It was found that the measurements of solar radiation in 1985 and 1998 were considerably low for a small period of time compared with sunshine duration. This suggested that assessment of the quality for the observation data for solar radiation would be needed in further studies. When data for those years were excluded from the data analysis, the DNN model had slightly greater degree of agreement statistics. For example, the values of $R^2$ and RMSE were 0.72 and $3.55MJ\;m^{-2}\;d^{-1}$, respectively. Our results indicate that a DNN would be useful for the development a solar radiation estimation model using temperature and precipitation, which are usually available for downscaled scenario data for future climate conditions. Thus, such a DNN model would be useful for the impact assessment of climate change on crop production where solar radiation is used as a required input variable to a crop model.

Evaluation of Sensitivity and Retrieval Possibility of Land Surface Temperature in the Mid-infrared Wavelength through Radiative Transfer Simulation (복사전달모의를 통한 중적외 파장역의 민감도 분석 및 지표면온도 산출 가능성 평가)

  • Choi, Youn-Young;Suh, Myoung-Seok;Cha, DongHwan;Seo, DooChun
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.1423-1444
    • /
    • 2022
  • In this study, the sensitivity of the mid-infrared radiance to atmospheric and surface factors was analyzed using the radiative transfer model, MODerate resolution atmospheric TRANsmission (MODTRAN6)'s simulation data. The possibility of retrieving the land surface temperature (LST) using only the mid-infrared bands at night was evaluated. Based on the sensitivity results, the LST retrieval algorithm that reflects various factors for night was developed, and the level of the LST retrieval algorithm was evaluated using reference LST and observed LST. Sensitivity experiments were conducted on the atmospheric profiles, carbon dioxide, ozone, diurnal variation of LST, land surface emissivity (LSE), and satellite viewing zenith angle (VZA), which mainly affect satellite remote sensing. To evaluate the possibility of using split-window method, the mid-infrared wavelength was divided into two bands based on the transmissivity. Regardless of the band, the top of atmosphere (TOA) temperature is most affected by atmospheric profile, and is affected in order of LSE, diurnal variation of LST, and satellite VZA. In all experiments, band 1, which corresponds to the atmospheric window, has lower sensitivity, whereas band 2, which includes ozone and water vapor absorption, has higher sensitivity. The evaluation results for the LST retrieval algorithm using prescribed LST showed that the correlation coefficient (CC), the bias and the root mean squared error (RMSE) is 0.999, 0.023K and 0.437K, respectively. Also, the validation with 26 in-situ observation data in 2021 showed that the CC, bias and RMSE is 0.993, 1.875K and 2.079K, respectively. The results of this study suggest that the LST can be retrieved using different characteristics of the two bands of mid-infrared to the atmospheric and surface conditions at night. Therefore, it is necessary to retrieve the LST using satellite data equipped with sensors in the mid-infrared bands.

A new model for curbing filtrate loss in dynamic application of nano-treated aqueous mud systems

  • Okoro, Emmanuel E.;Oladejo, Bukola R.;Sanni, Samuel E.;Obomanu, Tamunotonjo;Ibe, Amarachukwu A.;Orodu, Oyinkepreye D.;Olawole, Olukunle C.
    • Advances in nano research
    • /
    • v.9 no.1
    • /
    • pp.59-67
    • /
    • 2020
  • Filter cake formation during rotary drilling operation is an unavoidable scenario, hence there is need for constant improvement in the approaches used in monitoring the cake thickness growth in order to prevent drill-string sticking. This study proposes an improved model that predicts the growth of mud cake thickness overtime with the consideration of the addition of nanoparticles in the formulated drilling fluid system. Ferric oxide, titanium dioxide and copper oxide nanoparticles were used in varying amounts (2 g, 4 g and 6 g), and filtration data were obtained from the HPHT filtration test. The filter cakes formed were further analyzed with scanning electron microscope to obtain the morphological characteristics. The data obtained was used to validate the new filtrate loss model. This model specifically presents the concept of time variation in filter cake formation as against the previous works of constant and definite time. Regression coefficient which is a statistical measure was used to validate the new model and the predicted results were compared with the API model. The new model showed R2 values of 99.9%, and the predictions from the proposed filtration model can be said to be more closely related to the experimental data than that predicted from the API model from the SSE and RMSE results.

A Comparative Study Between Linear Regression and Support Vector Regression Model Based on Environmental Factors of a Smart Bee Farm

  • Rahman, A. B. M. Salman;Lee, MyeongBae;Venkatesan, Saravanakumar;Lim, JongHyun;Shin, ChangSun
    • Smart Media Journal
    • /
    • v.11 no.5
    • /
    • pp.38-47
    • /
    • 2022
  • Honey is one of the most significant ingredients in conventional food production in different regions of the world. Honey is commonly used as an ingredient in ethnic food. Beekeeping is performed in various locations as part of the local food culture and an occupation related to pollinator production. It is important to conduct beekeeping so that it generates food culture and helps regulate the regional environment in an integrated manner in preserving and improving local food culture. This study analyzes different types of environmental factors of a smart bee farm. The major goal of this study is to determine the best prediction model between the linear regression model (LM) and the support vector regression model (SVR) based on the environmental factors of a smart bee farm. The performance of prediction models is measured by R2 value, root mean squared error (RMSE), and mean absolute error (MAE). From all analysis reports, the best prediction model is the support vector regression model (SVR) with a low coefficient of variation, and the R2 values for Farm inside temperature, bee box inside temperature, and Farm inside humidity are 0.97, 0.96, and 0.44.