Search | Korea Science

A New Ensemble Machine Learning Technique with Multiple Stacking (다중 스태킹을 가진 새로운 앙상블 학습 기법)

Lee, Su-eun;Kim, Han-joon
- The Journal of Society for e-Business Studies
- /
- v.25 no.3
- /
- pp.1-13
- /
- 2020
Machine learning refers to a model generation technique that can solve specific problems from the generalization process for given data. In order to generate a high performance model, high quality training data and learning algorithms for generalization process should be prepared. As one way of improving the performance of model to be learned, the Ensemble technique generates multiple models rather than a single model, which includes bagging, boosting, and stacking learning techniques. This paper proposes a new Ensemble technique with multiple stacking that outperforms the conventional stacking technique. The learning structure of multiple stacking ensemble technique is similar to the structure of deep learning, in which each layer is composed of a combination of stacking models, and the number of layers get increased so as to minimize the misclassification rate of each layer. Through experiments using four types of datasets, we have showed that the proposed method outperforms the exiting ones.
https://doi.org/10.7838/jsebs.2020.25.3.001 인용 PDF KSCI

Development of Product Recommender System using Collaborative Filtering and Stacking Model (협업필터링과 스태킹 모형을 이용한 상품추천시스템 개발)

Park, Sung-Jong;Kim, Young-Min;Ahn, Jae-Joon
- Journal of Convergence for Information Technology
- /
- v.9 no.6
- /
- pp.83-90
- /
- 2019
People constantly strive for better choices. For this reason, recommender system has been developed since the early 1990s. In particular, collaborative filtering technique has shown excellent performance in the field of recommender systems, and research of recommender system using machine learning has been actively conducted. This study constructs recommender system using collaborative filtering and machine learning based on stacking model which is one of ensemble methods. The results of this study confirm that the recommender system with the stacking model is useful in aspects of recommender performance. In the future, the model proposed in this study is expected to help individuals or firms to make better choices.
https://doi.org/10.22156/CS4SMB.2019.9.6.083 인용 PDF KSCI HTML

Automatic Multi-layer Stacking Ensemble Generation Technique for Predicting Diabetes Mellitus Incidence (당뇨병 발생 예측을 위한 다층 스태킹 앙상블 모델 구축 기법)

Ayeong Seong;Sohyun Yun;Suyeon Kang;Gun-Woo Kim
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.11a
- /
- pp.426-427
- /
- 2023
최근 현대인의 식습관 및 고령화로 인해 당뇨병 환자의 수가 연간 증가하고 있다. 따라서 현재는 아직 당뇨병이 발생하지 않았더라도 미래에 발생할 가능성 예측의 중요성이 커지고 있다. 기존의 당뇨병 발생 여부 진단 연구는 회귀 분석과 같은 단일 모델을 사용하여 수행된다. 그러나 당뇨병에 영향을 미치는 변수들은 복잡하게 얽혀있어 단일 모델만으로는 패턴을 충분히 학습하기 어렵다. 본 논문에서는 데이터에 적합하게 자동으로 다층 스태킹 앙상블 모델을 구성하는 알고리즘을 이용한 다층 스태킹 앙상블 모델을 제안한다. 제안하는 방법은 성능이 높은 모델들을 기준으로 층을 쌓으며 모델을 구성하며 실험 결과 다른 자동 기계학습 라이브러리와 비교해 F1 score 기준으로 최대 12.89%p의 성능 향상을 보였다.
https://doi.org/10.3745/PKIPS.y2023m11a.426 인용 PDF

Improved Estimation of Hourly Surface Ozone Concentrations using Stacking Ensemble-based Spatial Interpolation (스태킹 앙상블 모델을 이용한 시간별 지상 오존 공간내삽 정확도 향상)

KIM, Ye-Jin;KANG, Eun-Jin;CHO, Dong-Jin;LEE, Si-Woo;IM, Jung-Ho
- Journal of the Korean Association of Geographic Information Studies
- /
- v.25 no.3
- /
- pp.74-99
- /
- 2022
Surface ozone is produced by photochemical reactions of nitrogen oxides(NOx) and volatile organic compounds(VOCs) emitted from vehicles and industrial sites, adversely affecting vegetation and the human body. In South Korea, ozone is monitored in real-time at stations(i.e., point measurements), but it is difficult to monitor and analyze its continuous spatial distribution. In this study, surface ozone concentrations were interpolated to have a spatial resolution of 1.5km every hour using the stacking ensemble technique, followed by a 5-fold cross-validation. Base models for the stacking ensemble were cokriging, multi-linear regression(MLR), random forest(RF), and support vector regression(SVR), while MLR was used as the meta model, having all base model results as additional input variables. The results showed that the stacking ensemble model yielded the better performance than the individual base models, resulting in an averaged R of 0.76 and RMSE of 0.0065ppm during the study period of 2020. The surface ozone concentration distribution generated by the stacking ensemble model had a wider range with a spatial pattern similar with terrain and urbanization variables, compared to those by the base models. Not only should the proposed model be capable of producing the hourly spatial distribution of ozone, but it should also be highly applicable for calculating the daily maximum 8-hour ozone concentrations.
https://doi.org/10.11108/kagis.2022.25.3.074 인용 PDF KSCI

Development of a High-Performance Concrete Compressive-Strength Prediction Model Using an Ensemble Machine-Learning Method Based on Bagging and Stacking (배깅 및 스태킹 기반 앙상블 기계학습법을 이용한 고성능 콘크리트 압축강도 예측모델 개발)

Yun-Ji Kwak;Chaeyeon Go;Shinyoung Kwag;Seunghyun Eem
- Journal of the Computational Structural Engineering Institute of Korea
- /
- v.36 no.1
- /
- pp.9-18
- /
- 2023
Predicting the compressive strength of high-performance concrete (HPC) is challenging because of the use of additional cementitious materials; thus, the development of improved predictive models is essential. The purpose of this study was to develop an HPC compressive-strength prediction model using an ensemble machine-learning method of combined bagging and stacking techniques. The result is a new ensemble technique that integrates the existing ensemble methods of bagging and stacking to solve the problems of a single machine-learning model and improve the prediction performance of the model. The nonlinear regression, support vector machine, artificial neural network, and Gaussian process regression approaches were used as single machine-learning methods and bagging and stacking techniques as ensemble machine-learning methods. As a result, the model of the proposed method showed improved accuracy results compared with single machine-learning models, an individual bagging technique model, and a stacking technique model. This was confirmed through a comparison of four representative performance indicators, verifying the effectiveness of the method.
https://doi.org/10.7734/COSEIK.2023.36.1.9 인용 PDF

CV-based malicious URL detection ensemble stacking model (CV 기반 악성 URL 탐지 앙상블 스태킹 모델)

Jong-Ho Lee;Yong-Tae Shin
- Proceedings of the Korea Information Processing Society Conference
- /
- 2024.05a
- /
- pp.846-849
- /
- 2024
다양한 분야에서 QR 코드가 급속도로 확산되면서, QR 코드를 악용하여 사용자를 악성 웹사이트로 리디렉션하는 '큐싱(Qshing)'이라는 새로운 형태의 사이버 범죄가 등장했다. 이에 본 연구에서는 일반화 성능을 향상시키기 위해 교차 검증(CV)을 활용하여 QR 코드 스캔과 관련된 악성 URL을 탐지하도록 설계된 스태킹 앙상블 모델을 제안한다. 이러한 통합은 실제 애플리케이션에서 높은 성능을 기대할 수 있도록 설계되었다. 본 연구는 이 모델이 기존의 연구보다 QR 코드 관련 사이버 위협에 대처하는 보다 효과적인 수단을 제공할 것으로 기대한다.
https://doi.org/10.3745/PKIPS.y2024m05a.846 인용 PDF

Feature selection and prediction modeling of drug responsiveness in Pharmacogenomics (약물유전체학에서 약물반응 예측모형과 변수선택 방법)

Kim, Kyuhwan;Kim, Wonkuk
- The Korean Journal of Applied Statistics
- /
- v.34 no.2
- /
- pp.153-166
- /
- 2021
A main goal of pharmacogenomics studies is to predict individual's drug responsiveness based on high dimensional genetic variables. Due to a large number of variables, feature selection is required in order to reduce the number of variables. The selected features are used to construct a predictive model using machine learning algorithms. In the present study, we applied several hybrid feature selection methods such as combinations of logistic regression, ReliefF, TurF, random forest, and LASSO to a next generation sequencing data set of 400 epilepsy patients. We then applied the selected features to machine learning methods including random forest, gradient boosting, and support vector machine as well as a stacking ensemble method. Our results showed that the stacking model with a hybrid feature selection of random forest and ReliefF performs better than with other combinations of approaches. Based on a 5-fold cross validation partition, the mean test accuracy value of the best model was 0.727 and the mean test AUC value of the best model was 0.761. It also appeared that the stacking models outperform than single machine learning predictive models when using the same selected features.
https://doi.org/10.5351/KJAS.2021.34.2.153 인용 PDF KSCI

Enhancing Autonomous Vehicle RADAR Performance Prediction Model Using Stacking Ensemble (머신러닝 스태킹 앙상블을 이용한 자율주행 자동차 RADAR 성능 향상)

Si-yeon Jang;Hye-lim Choi;Yun-ju Oh
- Journal of Internet Computing and Services
- /
- v.25 no.2
- /
- pp.21-28
- /
- 2024
Radar is an essential sensor component in autonomous vehicles, and the market for radar applications in this context is steadily expanding with a growing variety of products. In this study, we aimed to enhance the stability and performance of radar systems by developing and evaluating a radar performance prediction model that can predict radar defects. We selected seven machine learning and deep learning algorithms and trained the model with a total of 49 input data types. Ultimately, when we employed an ensemble of 17 models, it exhibited the highest performance. We anticipate that these research findings will assist in predicting product defects at the production stage, thereby maximizing production yield and minimizing the costs associated with defective products.
https://doi.org/10.7472/jksii.2024.25.2.21 인용 PDF HTML

Time-Direction Stacking Method for a Single-Station Azimuth Estimation (단일지진관측 방위각 결정을 위한 시간-방향 스태킹 방법)

김소구;우종량;가오푸천
- The Journal of Engineering Geology
- /
- v.5 no.3
- /
- pp.331-337
- /
- 1995
In estimating the azimuth of regional earthquakes with single -station three - component data, in some cases the result is dependent on the selection of waveforms, making the measurement subjective and inconvenient in automatic detection. In this paper an alternative approach is proposed in which the azimuth is measured from quite a long wave train by time - direction stacking technique. Test with digital waveform data from Korean seimic stations shows that the simple algorithm seems to be able to give a better estima- tion of azimuth of earthquakes at regional distances.
PDF

Stacking Kernel Ridge Regression Network for Smart Phone's Touch-Stroke Continuous Authentication (스마트 폰의 터치 스트로크 지속적 인증을 위한 스태킹 커널 릿지 리그레션 네트워크)

Chang, Inho;Teoh, Andrew Beng-Jin
- Proceedings of the Korea Information Processing Society Conference
- /
- 2018.05a
- /
- pp.381-383
- /
- 2018
이 논문은 스마트 폰에서 터치 스트로크를 이용하여 지속적 인증을 할 수 있는 딥 러닝 네트워크인 스태킹 커널 릿지 리그레션 네트워크 (Stacking Kernel Ridge Regression Network: SKRRN)에 대한 연구이다. SKRRN 은 여러 개의 커널 릿지 리그레션 (Kernel Ridge Regression: KRR) 으로 구성되어있고, 계층적이며 모든 KRR 은 해석적이고 독립적으로 훈련된다. SKRRN 은 다른 딥 러닝 네트워크와는 다르게 비가공 터치 스트로크 데이터로부터 특징을 배우지 않고 Hand-Crafted 피처와 같이 추출된 데이터로부터 재학습을 한다. 이러한 재학습은 기존 데이터 셋을 더 구별 하기 쉽고 풍부하게 만들어준다. SKRRN 은 HMOG 데이터 셋을 사용하여 4.295%의 동일 오류율을 달성하였다.
https://doi.org/10.3745/PKIPS.y2018m05a.381 인용 PDF

Search Result 43, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)