• Title/Summary/Keyword: 10-fold Validation

Search Result 248, Processing Time 0.027 seconds

Several models for tunnel boring machine performance prediction based on machine learning

  • Mahmoodzadeh, Arsalan;Nejati, Hamid Reza;Ibrahim, Hawkar Hashim;Ali, Hunar Farid Hama;Mohammed, Adil Hussein;Rashidi, Shima;Majeed, Mohammed Kamal
    • Geomechanics and Engineering
    • /
    • v.30 no.1
    • /
    • pp.75-91
    • /
    • 2022
  • This paper aims to show how to use several Machine Learning (ML) methods to estimate the TBM penetration rate systematically (TBM-PR). To this end, 1125 datasets including uniaxial compressive strength (UCS), Brazilian tensile strength (BTS), punch slope index (PSI), distance between the planes of weakness (DPW), orientation of discontinuities (alpha angle-α), rock fracture class (RFC), and actual/measured TBM-PRs were established. To evaluate the ML methods' ability to perform, the 5-fold cross-validation was taken into consideration. Eventually, comparing the ML outcomes and the TBM monitoring data indicated that the ML methods have a very good potential ability in the prediction of TBM-PR. However, the long short-term memory model with a correlation coefficient of 0.9932 and a route mean square error of 2.68E-6 outperformed the remaining six ML algorithms. The backward selection method showed that PSI and RFC were more and less significant parameters on the TBM-PR compared to the others.

Machine Learning Methods for Trust-based Selection of Web Services

  • Hasnain, Muhammad;Ghani, Imran;Pasha, Muhammad F.;Jeong, Seung R.
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.1
    • /
    • pp.38-59
    • /
    • 2022
  • Web services instances can be classified into two categories, namely trusted and untrusted from users. A web service with high throughput (TP) and low response time (RT) instance values is a trusted web service. Web services are not trustworthy due to the mismatch in the guaranteed instance values and the actual values achieved by users. To perform web services selection from users' attained TP and RT values, we need to verify the correct prediction of trusted and untrusted instances from invoked web services. This accurate prediction of web services instances is used to perform the selection of web services. We propose to construct fuzzy rules to label web services instances correctly. This paper presents web services selection using a well-known machine learning algorithm, namely REPTree, for the correct prediction of trusted and untrusted instances. Performance comparison of REPTree with five machine learning models is conducted on web services datasets. We have performed experiments on web services datasets using a ten k-fold cross-validation method. To evaluate the performance of the REPTree classifier, we used accuracy metrics (Sensitivity and Specificity). Experimental results showed that web service (WS1) gained top selection score with the (47.0588%) trusted instances, and web service (WS2) was selected the least with (25.00%) trusted instances. Evaluation results of the proposed web services selection approach were found as (asymptotic sig. = 0.019), demonstrating the relationship between final selection and recommended trust score of web services.

Classification method for failure modes of RC columns based on key characteristic parameters

  • Yu, Bo;Yu, Zecheng;Li, Qiming;Li, Bing
    • Structural Engineering and Mechanics
    • /
    • v.84 no.1
    • /
    • pp.1-16
    • /
    • 2022
  • An efficient and accurate classification method for failure modes of reinforced concrete (RC) columns was proposed based on key characteristic parameters. The weight coefficients of seven characteristic parameters for failure modes of RC columns were determined first based on the support vector machine-recursive feature elimination. Then key characteristic parameters for classifying flexure, flexure-shear and shear failure modes of RC columns were selected respectively. Subsequently, a support vector machine with key characteristic parameters (SVM-K) was proposed to classify three types of failure modes of RC columns. The optimal parameters of SVM-K were determined by using the ten-fold cross-validation and the grid-search algorithm based on 270 sets of available experimental data. Results indicate that the proposed SVM-K has high overall accuracy, recall and precision (e.g., accuracy>95%, recall>90%, precision>90%), which means that the proposed SVM-K has superior performance for classification of failure modes of RC columns. Based on the selected key characteristic parameters for different types of failure modes of RC columns, the accuracy of SVM-K is improved and the decision function of SVM-K is simplified by reducing the dimensions and number of support vectors.

Form-finding of lifting self-forming GFRP elastic gridshells based on machine learning interpretability methods

  • Soheila, Kookalani;Sandy, Nyunn;Sheng, Xiang
    • Structural Engineering and Mechanics
    • /
    • v.84 no.5
    • /
    • pp.605-618
    • /
    • 2022
  • Glass fiber reinforced polymer (GFRP) elastic gridshells consist of long continuous GFRP tubes that form elastic deformations. In this paper, a method for the form-finding of gridshell structures is presented based on the interpretable machine learning (ML) approaches. A comparative study is conducted on several ML algorithms, including support vector regression (SVR), K-nearest neighbors (KNN), decision tree (DT), random forest (RF), AdaBoost, XGBoost, category boosting (CatBoost), and light gradient boosting machine (LightGBM). A numerical example is presented using a standard double-hump gridshell considering two characteristics of deformation as objective functions. The combination of the grid search approach and k-fold cross-validation (CV) is implemented for fine-tuning the parameters of ML models. The results of the comparative study indicate that the LightGBM model presents the highest prediction accuracy. Finally, interpretable ML approaches, including Shapely additive explanations (SHAP), partial dependence plot (PDP), and accumulated local effects (ALE), are applied to explain the predictions of the ML model since it is essential to understand the effect of various values of input parameters on objective functions. As a result of interpretability approaches, an optimum gridshell structure is obtained and new opportunities are verified for form-finding investigation of GFRP elastic gridshells during lifting construction.

Gaussian process regression model to predict factor of safety of slope stability

  • Arsalan, Mahmoodzadeh;Hamid Reza, Nejati;Nafiseh, Rezaie;Adil Hussein, Mohammed;Hawkar Hashim, Ibrahim;Mokhtar, Mohammadi;Shima, Rashidi
    • Geomechanics and Engineering
    • /
    • v.31 no.5
    • /
    • pp.453-460
    • /
    • 2022
  • It is essential for geotechnical engineers to conduct studies and make predictions about the stability of slopes, since collapse of a slope may result in catastrophic events. The Gaussian process regression (GPR) approach was carried out for the purpose of predicting the factor of safety (FOS) of the slopes in the study that was presented here. The model makes use of a total of 327 slope cases from Iran, each of which has a unique combination of geometric and shear strength parameters that were analyzed by PLAXIS software in order to determine their FOS. The K-fold (K = 5) technique of cross-validation (CV) was used in order to conduct an analysis of the accuracy of the models' predictions. In conclusion, the GPR model showed excellent ability in the prediction of FOS of slope stability, with an R2 value of 0.8355, RMSE value of 0.1372, and MAPE value of 6.6389%, respectively. According to the results of the sensitivity analysis, the characteristics (friction angle) and (unit weight) are, in descending order, the most effective, the next most effective, and the least effective parameters for determining slope stability.

Enhancing prediction accuracy of concrete compressive strength using stacking ensemble machine learning

  • Yunpeng Zhao;Dimitrios Goulias;Setare Saremi
    • Computers and Concrete
    • /
    • v.32 no.3
    • /
    • pp.233-246
    • /
    • 2023
  • Accurate prediction of concrete compressive strength can minimize the need for extensive, time-consuming, and costly mixture optimization testing and analysis. This study attempts to enhance the prediction accuracy of compressive strength using stacking ensemble machine learning (ML) with feature engineering techniques. Seven alternative ML models of increasing complexity were implemented and compared, including linear regression, SVM, decision tree, multiple layer perceptron, random forest, Xgboost and Adaboost. To further improve the prediction accuracy, a ML pipeline was proposed in which the feature engineering technique was implemented, and a two-layer stacked model was developed. The k-fold cross-validation approach was employed to optimize model parameters and train the stacked model. The stacked model showed superior performance in predicting concrete compressive strength with a correlation of determination (R2) of 0.985. Feature (i.e., variable) importance was determined to demonstrate how useful the synthetic features are in prediction and provide better interpretability of the data and the model. The methodology in this study promotes a more thorough assessment of alternative ML algorithms and rather than focusing on any single ML model type for concrete compressive strength prediction.

Development of Prediction Model of Chloride Diffusion Coefficient using Machine Learning (기계학습을 이용한 염화물 확산계수 예측모델 개발)

  • Kim, Hyun-Su
    • Journal of Korean Association for Spatial Structures
    • /
    • v.23 no.3
    • /
    • pp.87-94
    • /
    • 2023
  • Chloride is one of the most common threats to reinforced concrete (RC) durability. Alkaline environment of concrete makes a passive layer on the surface of reinforcement bars that prevents the bar from corrosion. However, when the chloride concentration amount at the reinforcement bar reaches a certain level, deterioration of the passive protection layer occurs, causing corrosion and ultimately reducing the structure's safety and durability. Therefore, understanding the chloride diffusion and its prediction are important to evaluate the safety and durability of RC structure. In this study, the chloride diffusion coefficient is predicted by machine learning techniques. Various machine learning techniques such as multiple linear regression, decision tree, random forest, support vector machine, artificial neural networks, extreme gradient boosting annd k-nearest neighbor were used and accuracy of there models were compared. In order to evaluate the accuracy, root mean square error (RMSE), mean square error (MSE), mean absolute error (MAE) and coefficient of determination (R2) were used as prediction performance indices. The k-fold cross-validation procedure was used to estimate the performance of machine learning models when making predictions on data not used during training. Grid search was applied to hyperparameter optimization. It has been shown from numerical simulation that ensemble learning methods such as random forest and extreme gradient boosting successfully predicted the chloride diffusion coefficient and artificial neural networks also provided accurate result.

Identification of Pb-Zn ore under the condition of low count rate detection of slim hole based on PGNAA technology

  • Haolong Huang;Pingkun Cai;Wenbao Jia;Yan Zhang
    • Nuclear Engineering and Technology
    • /
    • v.55 no.5
    • /
    • pp.1708-1717
    • /
    • 2023
  • The grade analysis of lead-zinc ore is the basis for the optimal development and utilization of deposits. In this study, a method combining Prompt Gamma Neutron Activation Analysis (PGNAA) technology and machine learning is proposed for lead-zinc mine borehole logging, which can identify lead-zinc ores of different grades and gangue in the formation, providing real-time grade information qualitatively and semi-quantitatively. Firstly, Monte Carlo simulation is used to obtain a gamma-ray spectrum data set for training and testing machine learning classification algorithms. These spectra are broadened, normalized and separated into inelastic scattering and capture spectra, and then used to fit different classifier models. When the comprehensive grade boundary of high- and low-grade ores is set to 5%, the evaluation metrics calculated by the 5-fold cross-validation show that the SVM (Support Vector Machine), KNN (K-Nearest Neighbor), GNB (Gaussian Naive Bayes) and RF (Random Forest) models can effectively distinguish lead-zinc ore from gangue. At the same time, the GNB model has achieved the optimal accuracy of 91.45% when identifying high- and low-grade ores, and the F1 score for both types of ores is greater than 0.9.

Novel potential drugs for the treatment of primary open-angle glaucoma using protein-protein interaction network analysis

  • Parisima Ghaffarian Zavarzadeh;Zahra Abedi
    • Genomics & Informatics
    • /
    • v.21 no.1
    • /
    • pp.6.1-6.8
    • /
    • 2023
  • Glaucoma is the second leading cause of irreversible blindness, and primary open-angle glaucoma (POAG) is the most common type. Due to inadequate diagnosis, treatment is often not administered until symptoms occur. Hence, approaches enabling earlier prediction or diagnosis of POAG are necessary. We aimed to identify novel drugs for glaucoma through bioinformatics and network analysis. Data from 36 samples, obtained from the trabecular meshwork of healthy individuals and patients with POAG, were acquired from a dataset. Next, differentially expressed genes (DEGs) were identified to construct a protein-protein interaction (PPI) network. In both stages, the genes were enriched by studying the critical biological processes and pathways related to POAG. Finally, a drug-gene network was constructed, and novel drugs for POAG treatment were proposed. Genes with p < 0.01 and |log fold change| > 0.3 (1,350 genes) were considered DEGs and utilized to construct a PPI network. Enrichment analysis yielded several key pathways that were upregulated or downregulated. For example, extracellular matrix organization, the immune system, neutrophil degranulation, and cytokine signaling were upregulated among immune pathways, while signal transduction, the immune system, extracellular matrix organization, and receptor tyrosine kinase signaling were downregulated. Finally, novel drugs including metformin hydrochloride, ixazomib citrate, and cisplatin warrant further analysis of their potential roles in POAG treatment. The candidate drugs identified in this computational analysis require in vitro and in vivo validation to confirm their effectiveness in POAG treatment. This may pave the way for understanding life-threatening disorders such as cancer.

LSTM based Gait Phase Estimation Method Robust to Changes in Gait Speed (LSTM 기반 보행 속도 변화에 강인한 웨어러블 로봇의 보행 위상 추정 방법)

  • Kim, Ho-Bin;Lee, Jong-Bok;Kim, Sun-Woo;Kim, Sang-Do;Park, Shin-Suk;Kim, KangGeon;Lee, Jongwon
    • Annual Conference of KIPS
    • /
    • 2022.11a
    • /
    • pp.429-431
    • /
    • 2022
  • 하지 웨어러블 로봇의 근력 보조 성능을 극대화하기 위해서는 착용자의 보행 상태를 인식하는 보행 위상 추정 기술이 필수적으로 요구된다. 본 논문에서는 착용자의 보행 속도 변화 및 착용자 간 보행 특성 차이에도 강인하게 보행 위상을 추정할 수 있는 LSTM 기반 보행 위상 강건 인식 기술을 개발하였다. 웨어러블 고관절 보조 로봇을 착용한 총 5명의 트레드밀 및 실외 overground의 보행 센서 정보를 바탕으로 학습을 수행하였다. 저속 및 고속 보행을 포함한 다양한 보행 속도에서 정밀한 보행 위상 추정이 가능한 웨어러블 센서 조합을 도출하였고, 보행 위상 인식 정밀성은 5-Fold Cross Validation 기준 RMSE 약 1.68% 수준의 결과를 얻을 수 있었다.