• Title/Summary/Keyword: GridSearchCV

Search Result 5, Processing Time 0.02 seconds

A Study on the Drug Classification Using Machine Learning Techniques (머신러닝 기법을 이용한 약물 분류 방법 연구)

  • Anmol Kumar Singh;Ayush Kumar;Adya Singh;Akashika Anshum;Pradeep Kumar Mallick
    • Advanced Industrial SCIence
    • /
    • v.3 no.2
    • /
    • pp.8-16
    • /
    • 2024
  • This paper shows the system of drug classification, the goal of this is to foretell the apt drug for the patients based on their demographic and physiological traits. The dataset consists of various attributes like Age, Sex, BP (Blood Pressure), Cholesterol Level, and Na_to_K (Sodium to Potassium ratio), with the objective to determine the kind of drug being given. The models used in this paper are K-Nearest Neighbors (KNN), Logistic Regression and Random Forest. Further to fine-tune hyper parameters using 5-fold cross-validation, GridSearchCV was used and each model was trained and tested on the dataset. To assess the performance of each model both with and without hyper parameter tuning evaluation metrics like accuracy, confusion matrices, and classification reports were used and the accuracy of the models without GridSearchCV was 0.7, 0.875, 0.975 and with GridSearchCV was 0.75, 1.0, 0.975. According to GridSearchCV Logistic Regression is the most suitable model for drug classification among the three-model used followed by the K-Nearest Neighbors. Also, Na_to_K is an essential feature in predicting the outcome.

Application of a support vector machine for prediction of piping and internal stability of soils

  • Xue, Xinhua
    • Geomechanics and Engineering
    • /
    • v.18 no.5
    • /
    • pp.493-502
    • /
    • 2019
  • Internal stability is an important safety issue for levees, embankments, and other earthen structures. Since a large part of the world's population lives near oceans, lakes and rivers, floods resulting from breaching of dams can lead to devastating disasters with tremendous loss of life and property, especially in densely populated areas. There are some main factors that affect the internal stability of dams, levees and other earthen structures, such as the erodibility of the soil, the water velocity inside the soil mass and the geometry of the earthen structure, etc. Thus, the mechanism of internal erosion and stability of soils is very complicated and it is vital to investigate the assessment methods of internal stability of soils in embankment dams and their foundations. This paper presents an improved support vector machine (SVM) model to predict the internal stability of soils. The grid search algorithm (GSA) is employed to find the optimal parameters of SVM firstly, and then the cross - validation (CV) method is employed to estimate the classification accuracy of the GSA-SVM model. Two examples of internal stability of soils are presented to validate the predictive capability of the proposed GSA-SVM model. In addition to verify the effectiveness of the proposed GSA-SVM model, the predictions from the proposed GSA-SVM model were compared with those from the traditional back propagation neural network (BPNN) model. The results showed that the proposed GSA-SVM model is a feasible and efficient tool for assessing the internal stability of soils with high accuracy.

In-situ stresses ring hole measurement of concrete optimized based on finite element and GBDT algorithm

  • Chen Guo;Zheng Yang;Yanchao Yue;Wenxiao Li;Hantao Wu
    • Computers and Concrete
    • /
    • v.34 no.4
    • /
    • pp.477-487
    • /
    • 2024
  • The in-situ stresses of concrete are an essential index for assessing the safety performance of concrete structures. Conventional methods for pore pressure release often face challenges in selecting drilling ring parameters, uncontrollable stress release, and unstable detection accuracy. In this paper, the parameters affecting the results of the concrete ring hole stress release method are cross-combined, and finite elements are used to simulate the combined parameters and extract the stress release values to establish a training set. The GridSearchCV function is utilized to determine the optimal hyperparameters. The mean absolute error (MAE), root mean square error (RMSE), and coefficient of determination (R2) are used as evaluation indexes to train the gradient boosting decision tree (GBDT) algorithm, and the other three common algorithms are compared. The RMSE of the GBDT algorithm for the test set is 4.499, and the R2 of the GBDT algorithm for the test set is 0.962, which is 9.66% higher than the R2 of the best-performing comparison algorithm. The model generated by the GBDT algorithm can accurately calculate the concrete in-situ stresses based on the drilling ring parameters and the corresponding stress release values and has a high accuracy and generalization ability.

Form-finding of lifting self-forming GFRP elastic gridshells based on machine learning interpretability methods

  • Soheila, Kookalani;Sandy, Nyunn;Sheng, Xiang
    • Structural Engineering and Mechanics
    • /
    • v.84 no.5
    • /
    • pp.605-618
    • /
    • 2022
  • Glass fiber reinforced polymer (GFRP) elastic gridshells consist of long continuous GFRP tubes that form elastic deformations. In this paper, a method for the form-finding of gridshell structures is presented based on the interpretable machine learning (ML) approaches. A comparative study is conducted on several ML algorithms, including support vector regression (SVR), K-nearest neighbors (KNN), decision tree (DT), random forest (RF), AdaBoost, XGBoost, category boosting (CatBoost), and light gradient boosting machine (LightGBM). A numerical example is presented using a standard double-hump gridshell considering two characteristics of deformation as objective functions. The combination of the grid search approach and k-fold cross-validation (CV) is implemented for fine-tuning the parameters of ML models. The results of the comparative study indicate that the LightGBM model presents the highest prediction accuracy. Finally, interpretable ML approaches, including Shapely additive explanations (SHAP), partial dependence plot (PDP), and accumulated local effects (ALE), are applied to explain the predictions of the ML model since it is essential to understand the effect of various values of input parameters on objective functions. As a result of interpretability approaches, an optimum gridshell structure is obtained and new opportunities are verified for form-finding investigation of GFRP elastic gridshells during lifting construction.

A SVR Based-Pseudo Modified Einstein Procedure Incorporating H-ADCP Model for Real-Time Total Sediment Discharge Monitoring (실시간 총유사량 모니터링을 위한 H-ADCP 연계 수정 아인슈타인 방법의 의사 SVR 모형)

  • Noh, Hyoseob;Son, Geunsoo;Kim, Dongsu;Park, Yong Sung
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.43 no.3
    • /
    • pp.321-335
    • /
    • 2023
  • Monitoring sediment loads in natural rivers is the key process in river engineering, but it is costly and dangerous. In practice, suspended loads are directly measured, and total loads, which is a summation of suspended loads and bed loads, are estimated. This study proposes a real-time sediment discharge monitoring system using the horizontal acoustic Doppler current profiler (H-ADCP) and support vector regression (SVR). The proposed system is comprised of the SVR model for suspended sediment concentration (SVR-SSC) and for total loads (SVR-QTL), respectively. SVR-SSC estimates SSC and SVR-QTL mimics the modified Einstein procedure. The grid search with K-fold cross validation (Grid-CV) and the recursive feature elimination (RFE) were employed to determine SVR's hyperparameters and input variables. The two SVR models showed reasonable cross-validation scores (R2) with 0.885 (SVR-SSC) and 0.860 (SVR-QTL). During the time-series sediment load monitoring period, we successfully detected various sediment transport phenomena in natural streams, such as hysteresis loops and sensitive sediment fluctuations. The newly proposed sediment monitoring system depends only on the gauged features by H-ADCP without additional assumptions in hydraulic variables (e.g., friction slope and suspended sediment size distribution). This method can be applied to any ADCP-installed discharge monitoring station economically and is expected to enhance temporal resolution in sediment monitoring.