• 제목/요약/키워드: hybrid machine learning

검색결과 170건 처리시간 0.021초

경사면의 안정성 모니터링 데이터의 품질관리를 위한 2 단계 접근방안 (Two-Phase Approach for Data Quality Management for Slope Stability Monitoring)

  • 최준혁;김용진;조준휘;정우철;석송희;최송;김용성;지봉준
    • 한국지반신소재학회논문집
    • /
    • 제22권1호
    • /
    • pp.67-74
    • /
    • 2023
  • 경사면의 안정성을 모니터링 하기 위해 데이터 기반으로 사면의 붕괴를 예측, 경보를 하려는 연구가 증가하고 있다. 하지만 대부분의 논문에서는 데이터의 품질에 대해 간과하고 있다. 이는 오경보와 같은 문제를 발생시킬 수 있다. 이에 본 논문에서는 사면에서 수집된 데이터의 품질관리를 위한 규칙과 기계학습 모델로 구성된 2 단계의 접근 방안을 제안하였다. 규칙 기반은 높은 정확도와 직관적인 해석이 가능하다는 장점이 있으며 기계학습 모델은 명시적으로 표현할 수 없는 패턴을 도출할 수 있다는 장점이 있으며 2단계의 접근 방안은 이 두 장점을 모두 취할 수 있었다. 사례연구를 통해 두 방법을 단독으로 사용하였을 경우와 2단계의 접근 방안을 사용하였을 때의 성능을 비교하였고 2단계 접근 방안이 높은 성능을 보이는 것으로 판단되었다. 따라서 데이터의 품질관리를 위해 단독으로 두 방법을 사용하는 것보다 2단계 접근 방안 방법을 사용하는 것이 적절할것으로 판단된다.

A Novel Image Classification Method for Content-based Image Retrieval via a Hybrid Genetic Algorithm and Support Vector Machine Approach

  • Seo, Kwang-Kyu
    • 반도체디스플레이기술학회지
    • /
    • 제10권3호
    • /
    • pp.75-81
    • /
    • 2011
  • This paper presents a novel method for image classification based on a hybrid genetic algorithm (GA) and support vector machine (SVM) approach which can significantly improve the classification performance for content-based image retrieval (CBIR). Though SVM has been widely applied to CBIR, it has some problems such as the kernel parameters setting and feature subset selection of SVM which impact the classification accuracy in the learning process. This study aims at simultaneously optimizing the parameters of SVM and feature subset without degrading the classification accuracy of SVM using GA for CBIR. Using the hybrid GA and SVM model, we can classify more images in the database effectively. Experiments were carried out on a large-size database of images and experiment results show that the classification accuracy of conventional SVM may be improved significantly by using the proposed model. We also found that the proposed model outperformed all the other models such as neural network and typical SVM models.

Data-Driven-Based Beam Selection for Hybrid Beamforming in Ultra-Dense Networks

  • Ju, Sang-Lim;Kim, Kyung-Seok
    • International journal of advanced smart convergence
    • /
    • 제9권2호
    • /
    • pp.58-67
    • /
    • 2020
  • In this paper, we propose a data-driven-based beam selection scheme for massive multiple-input and multiple-output (MIMO) systems in ultra-dense networks (UDN), which is capable of addressing the problem of high computational cost of conventional coordinated beamforming approaches. We consider highly dense small-cell scenarios with more small cells than mobile stations, in the millimetre-wave band. The analog beam selection for hybrid beamforming is a key issue in realizing millimetre-wave UDN MIMO systems. To reduce the computation complexity for the analog beam selection, in this paper, two deep neural network models are used. The channel samples, channel gains, and radio frequency beamforming vectors between the access points and mobile stations are collected at the central/cloud unit that is connected to all the small-cell access points, and are used to train the networks. The proposed machine-learning-based scheme provides an approach for the effective implementation of massive MIMO system in UDN environment.

청소년 건강행태에 따른 정신건강 위험 예측: 하이브리드 머신러닝 방법의 적용 (Predicting Mental Health Risk based on Adolescent Health Behavior: Application of a Hybrid Machine Learning Method)

  • 고은경;전효정;박현태;옥수열
    • 한국학교보건학회지
    • /
    • 제36권3호
    • /
    • pp.113-125
    • /
    • 2023
  • Purpose: The purpose of this study is to develop a model for predicting mental health risk among adolescents based on health behavior information by employing a hybrid machine learning method. Methods: The study analyzed data of 51,850 domestic middle and high school students from 2022 Youth Health Behavior Survey conducted by the Korea Disease Control and Prevention Agency. Firstly, mental health risk levels (stress perception, suicidal thoughts, suicide attempts, suicide plans, experiences of sadness and despair, loneliness, and generalized anxiety disorder) were classified using the k-mean unsupervised learning technique. Secondly, demographic factors (family economic status, gender, age), academic performance, physical health (body mass index, moderate-intensity exercise, subjective health perception, oral health perception), daily life habits (sleep time, wake-up time, smartphone use time, difficulty recovering from fatigue), eating habits (consumption of high-caffeine drinks, sweet drinks, late-night snacks), violence victimization, and deviance (drinking, smoking experience) data were input to develop a random forest model predicting mental health risk, using logistic and XGBoosting. The model and its prediction performance were compared. Results: First, the subjects were classified into two mental health groups using k-mean unsupervised learning, with the high mental health risk group constituting 26.45% of the total sample (13,712 adolescents). This mental health risk group included most of the adolescents who had made suicide plans (95.1%) or attempted suicide (96.7%). Second, the predictive performance of the random forest model for classifying mental health risk groups significantly outperformed that of the reference model (AUC=.94). Predictors of high importance were 'difficulty recovering from daytime fatigue' and 'subjective health perception'. Conclusion: Based on an understanding of adolescent health behavior information, it is possible to predict the mental health risk levels of adolescents and make interventions in advance.

하이브리드 피처 생성 및 딥 러닝 기반 박테리아 세포의 세분화 (Segmentation of Bacterial Cells Based on a Hybrid Feature Generation and Deep Learning)

  • 임선자;칼렙부누누;권기룡;윤성대
    • 한국멀티미디어학회논문지
    • /
    • 제23권8호
    • /
    • pp.965-976
    • /
    • 2020
  • We present in this work a segmentation method of E. coli bacterial images generated via phase contrast microscopy using a deep learning based hybrid feature generation. Unlike conventional machine learning methods that use the hand-crafted features, we adopt the denoising autoencoder in order to generate a precise and accurate representation of the pixels. We first construct a hybrid vector that combines original image, difference of Gaussians and image gradients. The created hybrid features are then given to a deep autoencoder that learns the pixels' internal dependencies and the cells' shape and boundary information. The latent representations learned by the autoencoder are used as the inputs of a softmax classification layer and the direct outputs from the classifier represent the coarse segmentation mask. Finally, the classifier's outputs are used as prior information for a graph partitioning based fine segmentation. We demonstrate that the proposed hybrid vector representation manages to preserve the global shape and boundary information of the cells, allowing to retrieve the majority of the cellular patterns without the need of any post-processing.

Hybrid CNN-SVM Based Seed Purity Identification and Classification System

  • Suganthi, M;Sathiaseelan, J.G.R.
    • International Journal of Computer Science & Network Security
    • /
    • 제22권10호
    • /
    • pp.271-281
    • /
    • 2022
  • Manual seed classification challenges can be overcome using a reliable and autonomous seed purity identification and classification technique. It is a highly practical and commercially important requirement of the agricultural industry. Researchers can create a new data mining method with improved accuracy using current machine learning and artificial intelligence approaches. Seed classification can help with quality making, seed quality controller, and impurity identification. Seeds have traditionally been classified based on characteristics such as colour, shape, and texture. Generally, this is done by experts by visually examining each model, which is a very time-consuming and tedious task. This approach is simple to automate, making seed sorting far more efficient than manually inspecting them. Computer vision technologies based on machine learning (ML), symmetry, and, more specifically, convolutional neural networks (CNNs) have been widely used in related fields, resulting in greater labour efficiency in many cases. To sort a sample of 3000 seeds, KNN, SVM, CNN and CNN-SVM hybrid classification algorithms were used. A model that uses advanced deep learning techniques to categorise some well-known seeds is included in the proposed hybrid system. In most cases, the CNN-SVM model outperformed the comparable SVM and CNN models, demonstrating the effectiveness of utilising CNN-SVM to evaluate data. The findings of this research revealed that CNN-SVM could be used to analyse data with promising results. Future study should look into more seed kinds to expand the use of CNN-SVMs in data processing.

A SE Approach for Machine Learning Prediction of the Response of an NPP Undergoing CEA Ejection Accident

  • Ditsietsi Malale;Aya Diab
    • 시스템엔지니어링학술지
    • /
    • 제19권2호
    • /
    • pp.18-31
    • /
    • 2023
  • Exploring artificial intelligence and machine learning for nuclear safety has witnessed increased interest in recent years. To contribute to this area of research, a machine learning model capable of accurately predicting nuclear power plant response with minimal computational cost is proposed. To develop a robust machine learning model, the Best Estimate Plus Uncertainty (BEPU) approach was used to generate a database to train three models and select the best of the three. The BEPU analysis was performed by coupling Dakota platform with the best estimate thermal hydraulics code RELAP/SCDAPSIM/MOD 3.4. The Code Scaling Applicability and Uncertainty approach was adopted, along with Wilks' theorem to obtain a statistically representative sample that satisfies the USNRC 95/95 rule with 95% probability and 95% confidence level. The generated database was used to train three models based on Recurrent Neural Networks; specifically, Long Short-Term Memory, Gated Recurrent Unit, and a hybrid model with Long Short-Term Memory coupled to Convolutional Neural Network. In this paper, the System Engineering approach was utilized to identify requirements, stakeholders, and functional and physical architecture to develop this project and ensure success in verification and validation activities necessary to ensure the efficient development of ML meta-models capable of predicting of the nuclear power plant response.

약물유전체학에서 약물반응 예측모형과 변수선택 방법 (Feature selection and prediction modeling of drug responsiveness in Pharmacogenomics)

  • 김규환;김원국
    • 응용통계연구
    • /
    • 제34권2호
    • /
    • pp.153-166
    • /
    • 2021
  • 약물유전체학 연구의 주요 목표는 고차원의 유전 변수를 기반으로 개인의 약물 반응성을 예측하는 것이다. 변수의 개수가 많기 때문에 변수의 개수를 줄이기 위해서는 변수 선택이 필요하며, 선택된 변수들은 머신러닝 알고리즘을 사용하여 예측 모델을 구축하는데 사용된다. 본 연구에서는 400명의 뇌전증 환자의 차세대 염기서열 분석 데이터에 로지스틱 회귀, ReliefF, TurF, 랜덤 포레스트, LASSO의 조합과 같은 여러 가지 혼합 변수 선택 방법을 적용하였다. 선택된 변수들에 랜덤포레스트, 그래디언트 부스팅, 서포트벡터머신을 포함한 머신러닝 방법들을 적용했고 스태킹을 통해 앙상블 모형을 구축하였다. 본 연구의 결과는 랜덤포레스트와 ReliefF의 혼합 변수 선택 방법을 이용한 스태킹 모형이 다른 모형보다 더 좋은 성능을 보인다는 것을 보여주었다. 5-폴드 교차 검증을 기반으로 하여 적합한 최적 모형의 평균 검증 정확도는 0.727이고 평균 검증 AUC 값은 0.761로 나타났다. 또한, 동일한 변수를 사용할 때 스태킹 모델이 단일 머신러닝 예측 모델보다 성능이 우수한 것으로 나타났다.

Hybrid machine learning with mode shape assessment for damage identification of plates

  • Pei Yi Siow;Zhi Chao Ong;Shin Yee Khoo;Kok-Sing Lim;Bee Teng Chew
    • Smart Structures and Systems
    • /
    • 제31권5호
    • /
    • pp.485-500
    • /
    • 2023
  • Machine learning-based structural health monitoring (ML-based SHM) methods are researched extensively in the recent decade due to the availability of advanced information and sensing technology. ML methods are well-known for their pattern recognition capability for complex problems. However, the main obstacle of ML-based SHM is that it often requires pre-collected historical data for model training. In most actual scenarios, damage presence can be detected using the unsupervised learning method through anomaly detection, but to further identify the damage types would require prior knowledge or historical events as references. This creates the cold-start problem, especially for new and unobserved structures. Modal-based methods identify damages based on the changes in the structural global properties but often require dense measurements for accurate results. Therefore, a two-stage hybrid modal-machine learning damage detection scheme is proposed. The first stage detects damage presence using Principal Component Analysis-Frequency Response Function (PCA-FRF) in an unsupervised manner, whereas the second stage further identifies the damage. To solve the cold-start problem, mode shape assessment using the first mode is initiated when no trained model is available yet in the second stage. The damage identified by the modal-based method would be stored for future training. This work highlights the performance of the scheme in alleviating the cold-start issue as it transitions through different phases, starting from zero damage sample available. Results showed that single and multiple damages can be identified at an acceptable accuracy level even when training samples are limited.

A comparative study of machine learning methods for automated identification of radioisotopes using NaI gamma-ray spectra

  • Galib, S.M.;Bhowmik, P.K.;Avachat, A.V.;Lee, H.K.
    • Nuclear Engineering and Technology
    • /
    • 제53권12호
    • /
    • pp.4072-4079
    • /
    • 2021
  • This article presents a study on the state-of-the-art methods for automated radioactive material detection and identification, using gamma-ray spectra and modern machine learning methods. The recent developments inspired this in deep learning algorithms, and the proposed method provided better performance than the current state-of-the-art models. Machine learning models such as: fully connected, recurrent, convolutional, and gradient boosted decision trees, are applied under a wide variety of testing conditions, and their advantage and disadvantage are discussed. Furthermore, a hybrid model is developed by combining the fully-connected and convolutional neural network, which shows the best performance among the different machine learning models. These improvements are represented by the model's test performance metric (i.e., F1 score) of 93.33% with an improvement of 2%-12% than the state-of-the-art model at various conditions. The experimental results show that fusion of classical neural networks and modern deep learning architecture is a suitable choice for interpreting gamma spectra data where real-time and remote detection is necessary.