• Title/Summary/Keyword: hyperparameter tuning

Search Result 35, Processing Time 0.026 seconds

Hyperparameter Tuning Based Machine Learning classifier for Breast Cancer Prediction

  • Md. Mijanur Rahman;Asikur Rahman Raju;Sumiea Akter Pinky;Swarnali Akter
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.2
    • /
    • pp.196-202
    • /
    • 2024
  • Currently, the second most devastating form of cancer in people, particularly in women, is Breast Cancer (BC). In the healthcare industry, Machine Learning (ML) is commonly employed in fatal disease prediction. Due to breast cancer's favorable prognosis at an early stage, a model is created to utilize the Dataset on Wisconsin Diagnostic Breast Cancer (WDBC). Conversely, this model's overarching axiom is to compare the effectiveness of five well-known ML classifiers, including Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), K-Nearest Neighbor (KNN), and Naive Bayes (NB) with the conventional method. To counterbalance the effect with conventional methods, the overarching tactic we utilized was hyperparameter tuning utilizing the grid search method, which improved accuracy, secondary precision, third recall, and finally the F1 score. In this study hyperparameter tuning model, the rate of accuracy increased from 94.15% to 98.83% whereas the accuracy of the conventional method increased from 93.56% to 97.08%. According to this investigation, KNN outperformed all other classifiers in terms of accuracy, achieving a score of 98.83%. In conclusion, our study shows that KNN works well with the hyper-tuning method. These analyses show that this study prediction approach is useful in prognosticating women with breast cancer with a viable performance and more accurate findings when compared to the conventional approach.

Analysis of Accuracy and Loss Performance According to Hyperparameter in RNN Model (RNN모델에서 하이퍼파라미터 변화에 따른 정확도와 손실 성능 분석)

  • Kim, Joon-Yong;Park, Koo-Rack
    • Journal of Convergence for Information Technology
    • /
    • v.11 no.7
    • /
    • pp.31-38
    • /
    • 2021
  • In this paper, in order to obtain the optimization of the RNN model used for sentiment analysis, the correlation of each model was studied by observing the trend of loss and accuracy according to hyperparameter tuning. As a research method, after configuring the hidden layer with LSTM and the embedding layer that are most optimized to process sequential data, the loss and accuracy of each model were measured by tuning the unit, batch-size, and embedding size of the LSTM. As a result of the measurement, the loss was 41.9% and the accuracy was 11.4%, and the trend of the optimization model showed a consistently stable graph, confirming that the tuning of the hyperparameter had a profound effect on the model. In addition, it was confirmed that the decision of the embedding size among the three hyperparameters had the greatest influence on the model. In the future, this research will be continued, and research on an algorithm that allows the model to directly find the optimal hyperparameter will continue.

Development of benthic macroinvertebrate species distribution models using the Bayesian optimization (베이지안 최적화를 통한 저서성 대형무척추동물 종분포모델 개발)

  • Go, ByeongGeon;Shin, Jihoon;Cha, Yoonkyung
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.35 no.4
    • /
    • pp.259-275
    • /
    • 2021
  • This study explored the usefulness and implications of the Bayesian hyperparameter optimization in developing species distribution models (SDMs). A variety of machine learning (ML) algorithms, namely, support vector machine (SVM), random forest (RF), boosted regression tree (BRT), XGBoost (XGB), and Multilayer perceptron (MLP) were used for predicting the occurrence of four benthic macroinvertebrate species. The Bayesian optimization method successfully tuned model hyperparameters, with all ML models resulting an area under the curve (AUC) > 0.7. Also, hyperparameter search ranges that generally clustered around the optimal values suggest the efficiency of the Bayesian optimization in finding optimal sets of hyperparameters. Tree based ensemble algorithms (BRT, RF, and XGB) tended to show higher performances than SVM and MLP. Important hyperparameters and optimal values differed by species and ML model, indicating the necessity of hyperparameter tuning for improving individual model performances. The optimization results demonstrate that for all macroinvertebrate species SVM and RF required fewer numbers of trials until obtaining optimal hyperparameter sets, leading to reduced computational cost compared to other ML algorithms. The results of this study suggest that the Bayesian optimization is an efficient method for hyperparameter optimization of machine learning algorithms.

Framework for improving the prediction rate with respect to outdoor thermal comfort using machine learning

  • Jeong, Jaemin;Jeong, Jaewook;Lee, Minsu;Lee, Jaehyun
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.119-127
    • /
    • 2022
  • Most of the construction works are conducted outdoors, so the construction workers are affected by weather conditions such as temperature, humidity, and wind velocity which can be evaluated the thermal comfort as environmental factors. In our previous researches, it was found that construction accidents are usually occurred in the discomfort ranges. The safety management, therefore, should be planned in consideration of the thermal comfort and measured by a specialized simulation tool. However, it is very complex, time-consuming, and difficult to model. To address this issue, this study is aimed to develop a framework of a prediction model for improving the prediction accuracy about outdoor thermal comfort considering environmental factors using machine learning algorithms with hyperparameter tuning. This study is done in four steps: i) Establishment of database, ii) Selection of variables to develop prediction model, iii) Development of prediction model; iv) Conducting of hyperparameter tuning. The tree type algorithm is used to develop the prediction model. The results of this study are as follows. First, considering three variables related to environmental factor, the prediction accuracy was 85.74%. Second, the prediction accuracy was 86.55% when considering four environmental factors. Third, after conducting hyperparameter tuning, the prediction accuracy was increased up to 87.28%. This study has several contributions. First, using this prediction model, the thermal comfort can be calculated easily and quickly. Second, using this prediction model, the safety management can be utilized to manage the construction accident considering weather conditions.

  • PDF

Product Nutrition Information System for Visually Impaired People (시각 장애인을 위한 상품 영양 정보 안내 시스템)

  • Jonguk Jung;Je-Kyung Lee;Hyori Kim;Yoosoo Oh
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.18 no.5
    • /
    • pp.233-240
    • /
    • 2023
  • Nutrition information about food is written on the label paper, which is very inconvenient for visually impaired people to recognize. In order to solve the inconvenience of visually impaired people with nutritional information recognition, this paper proposes a product nutrition information guide system for visually impaired people. In the proposed system, user's image data input through UI, and object recognition is carried out through YOLO v5. The proposed system is a system that provides voice guidance on the names and nutrition information of recognized products. This paper constructs a new dataset that augments the 319 classes of canned/late-night snack product image data using rotate matrix techniques, pepper noise, and salt noise techniques. The proposed system compared and analyzed the performance of YOLO v5n, YOLO v5m, and YOLO v5l models through hyperparameter tuning and learned the dataset built with YOLO v5n models. This paper compares and analyzes the performance of the proposed system with that of previous studies.

Application of Time-series Cross Validation in Hyperparameter Tuning of a Predictive Model for 2,3-BDO Distillation Process (시계열 교차검증을 적용한 2,3-BDO 분리공정 온도예측 모델의 초매개변수 최적화)

  • An, Nahyeon;Choi, Yeongryeol;Cho, Hyungtae;Kim, Junghwan
    • Korean Chemical Engineering Research
    • /
    • v.59 no.4
    • /
    • pp.532-541
    • /
    • 2021
  • Recently, research on the application of artificial intelligence in the chemical process has been increasing rapidly. However, overfitting is a significant problem that prevents the model from being generalized well to predict unseen data on test data, as well as observed training data. Cross validation is one of the ways to solve the overfitting problem. In this study, the time-series cross validation method was applied to optimize the number of batch and epoch in the hyperparameters of the prediction model for the 2,3-BDO distillation process, and it compared with K-fold cross validation generally used. As a result, the RMSE of the model with time-series cross validation was lower by 9.06%, and the MAPE was higher by 0.61% than the model with K-fold cross validation. Also, the calculation time was 198.29 sec less than the K-fold cross validation method.

Investigation of pile group response to adjacent twin tunnel excavation utilizing machine learning

  • Su-Bin Kim;Dong-Wook Oh;Hyeon-Jun Cho;Yong-Joo Lee
    • Geomechanics and Engineering
    • /
    • v.38 no.5
    • /
    • pp.517-528
    • /
    • 2024
  • For numerous tunnelling projects implemented in urban areas due to limited space, it is crucial to take into account the interaction between the foundation, ground, and tunnel. In predicting the deformation of piled foundations and the ground during twin tunnel excavation, it is essential to consider various factors. Therefore, this study derived a prediction model for pile group settlement using machine learning to analyze the importance of various factors that determine the settlement of piled foundations during twin tunnelling. Laboratory model tests and numerical analysis were utilized as input data for machine learning. The influence of each independent variable on the prediction model was analyzed. Machine learning techniques such as data preprocessing, feature engineering, and hyperparameter tuning were used to improve the performance of the prediction model. Machine learning models, employing Random Forest (RF), eXtreme Gradient Boosting (XGB), and Light Gradient Boosting Machine (LightGBM, LGB) algorithms, demonstrate enhanced performance after hyperparameter tuning, particularly with LGB achieving an R2 of 0.9782 and RMSE value of 0.0314. The feature importance in the prediction models was analyzed and PN was the highest at 65.04% for RF, 64.81% for XGB, and PCTC (distance between the center of piles) was the highest at 31.32% for LGB. SHAP was utilized for analyzing the impact of each variable. PN (the number of piles) consistently exerted the most influence on the prediction of pile group settlement across all models. The results from both laboratory model tests and numerical analysis revealed a reduction in ground displacement with varying pillar spacing in twin tunnels. However, upon further investigation through machine learning with additional variables, it was found that the number of piles has the most significant impact on ground displacement. Nevertheless, as this study is based on laboratory model testing, further research considering real field conditions is necessary. This study contributes to a better understanding of the complex interactions inherent in twin tunnelling projects and provides a reliable tool for predicting pile group settlement in such scenarios.

Optimizing Artificial Neural Network-Based Models to Predict Rice Blast Epidemics in Korea

  • Lee, Kyung-Tae;Han, Juhyeong;Kim, Kwang-Hyung
    • The Plant Pathology Journal
    • /
    • v.38 no.4
    • /
    • pp.395-402
    • /
    • 2022
  • To predict rice blast, many machine learning methods have been proposed. As the quality and quantity of input data are essential for machine learning techniques, this study develops three artificial neural network (ANN)-based rice blast prediction models by combining two ANN models, the feed-forward neural network (FFNN) and long short-term memory, with diverse input datasets, and compares their performance. The Blast_Weathe long short-term memory r_FFNN model had the highest recall score (66.3%) for rice blast prediction. This model requires two types of input data: blast occurrence data for the last 3 years and weather data (daily maximum temperature, relative humidity, and precipitation) between January and July of the prediction year. This study showed that the performance of an ANN-based disease prediction model was improved by applying suitable machine learning techniques together with the optimization of hyperparameter tuning involving input data. Moreover, we highlight the importance of the systematic collection of long-term disease data.

BERT-Based Logits Ensemble Model for Gender Bias and Hate Speech Detection

  • Sanggeon Yun;Seungshik Kang;Hyeokman Kim
    • Journal of Information Processing Systems
    • /
    • v.19 no.5
    • /
    • pp.641-651
    • /
    • 2023
  • Malicious hate speech and gender bias comments are common in online communities, causing social problems in our society. Gender bias and hate speech detection has been investigated. However, it is difficult because there are diverse ways to express them in words. To solve this problem, we attempted to detect malicious comments in a Korean hate speech dataset constructed in 2020. We explored bidirectional encoder representations from transformers (BERT)-based deep learning models utilizing hyperparameter tuning, data sampling, and logits ensembles with a label distribution. We evaluated our model in Kaggle competitions for gender bias, general bias, and hate speech detection. For gender bias detection, an F1-score of 0.7711 was achieved using an ensemble of the Soongsil-BERT and KcELECTRA models. The general bias task included the gender bias task, and the ensemble model achieved the best F1-score of 0.7166.

Anomaly Detection System for Solar Power Distribution Panels utilizing Thermal Images

  • Kwang-Seong Shin;Jong-Chan Kim;Seong-Yoon Shin
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.2
    • /
    • pp.159-164
    • /
    • 2024
  • This study aimed to develop an advanced anomaly-detection system tailored for solar power distribution panels using thermal imaging cameras to ensure operational stability. It addresses the imperative shift toward digitalized safety management in electrical facilities, transcending the limitations of conventional empirical methodologies. Our proposed system leverages a faster R-CNN-based artificial intelligence model optimized through meticulous hyperparameter tuning to efficiently detect anomalies in distribution panels. Through comprehensive experimentation, we validated the efficacy of the system in accurately identifying anomalies, thereby propelling safety protocols forward during the fourth industrial revolution. This study signifies a significant stride toward fortifying the integrity and resilience of solar power distribution systems, which is pivotal for adapting to emerging technological paradigms and evolving safety standards in the energy sector. These findings offer valuable insights for enhancing the reliability and efficiency of safety management practices and fostering a safer and more sustainable energy landscape.