• 제목/요약/키워드: Ensemble approach

검색결과 169건 처리시간 0.026초

머신러닝을 활용한 모돈의 생산성 예측모델 (Forecasting Sow's Productivity using the Machine Learning Models)

  • 이민수;최영찬
    • 농촌지도와개발
    • /
    • 제16권4호
    • /
    • pp.939-965
    • /
    • 2009
  • The Machine Learning has been identified as a promising approach to knowledge-based system development. This study aims to examine the ability of machine learning techniques for farmer's decision making and to develop the reference model for using pig farm data. We compared five machine learning techniques: logistic regression, decision tree, artificial neural network, k-nearest neighbor, and ensemble. All models are well performed to predict the sow's productivity in all parity, showing over 87.6% predictability. The model predictability of total litter size are highest at 91.3% in third parity and decreasing as parity increases. The ensemble is well performed to predict the sow's productivity. The neural network and logistic regression is excellent classifier for all parity. The decision tree and the k-nearest neighbor was not good classifier for all parity. Performance of models varies over models used, showing up to 104% difference in lift values. Artificial Neural network and ensemble models have resulted in highest lift values implying best performance among models.

  • PDF

An Ensemble Classifier using Two Dimensional LDA

  • Park, Cheong-Hee
    • 한국멀티미디어학회논문지
    • /
    • 제13권6호
    • /
    • pp.817-824
    • /
    • 2010
  • Linear Discriminant Analysis (LDA) has been successfully applied for dimension reduction in face recognition. However, LDA requires the transformation of a face image to a one-dimensional vector and this process can cause the correlation information among neighboring pixels to be disregarded. On the other hand, 2D-LDA uses 2D images directly without a transformation process and it has been shown to be superior to the traditional LDA. Nevertheless, there are some problems in 2D-LDA. First, it is difficult to determine the optimal number of feature vectors in a reduced dimensional space. Second, the size of rectangular windows used in 2D-LDA makes strong impacts on classification accuracies but there is no reliable way to determine an optimal window size. In this paper, we propose a new algorithm to overcome those problems in 2D-LDA. We adopt an ensemble approach which combines several classifiers obtained by utilizing various window sizes. And a practical method to determine the number of feature vectors is also presented. Experimental results demonstrate that the proposed method can overcome the difficulties with choosing an optimal window size and the number of feature vectors.

Accounting for Uncertainty Propagation: Streamflow Forecasting using Multiple Climate and Hydrological Models

  • 권현한;문영일;박세훈;오태석
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2008년도 학술발표회 논문집
    • /
    • pp.1388-1392
    • /
    • 2008
  • Water resources management depends on dealing inherent uncertainties stemming from climatic and hydrological inputs and models. Dealing with these uncertainties remains a challenge. Streamflow forecasts basically contain uncertainties arising from model structure and initial conditions. Recent enhancements in climate forecasting skill and hydrological modeling provide an breakthrough for delivering improved streamflow forecasts. However, little consideration has been given to methodologies that include coupling both multiple climate and multiple hydrological models, increasing the pool of streamflow forecast ensemble members and accounting for cumulative sources of uncertainty. The approach here proposes integration and coupling of global climate models (GCM), multiple regional climate models, and numerous hydrological models to improve streamflow forecasting and characterize system uncertainty through generation of ensemble forecasts.

  • PDF

Anisotropic absorption of CdSe/ZnS quantum rods embedded in polymer film

  • Mukhina, Maria V.;Maslov, Vladimir G.;Baranov, Alexander V.;Artemyev, Mikhail V.;Fedorov, Anatoly V.
    • Advances in nano research
    • /
    • 제1권3호
    • /
    • pp.153-158
    • /
    • 2013
  • An approach to achieving of spatially homogeneous, ordered ensemble of semiconductor quantum rods in polymer film of polyvinyl butyral is reported. The CdSe/ZnS quantum rods are embedded to the polymer film. Obtained film is stretched up to four times to its initial length. A concentration of quantum rods in the samples is around $2{\times}10^{-5}$ M. The absorption spectra, obtained in the light with orthogonal polarization, confirm the occurrence of spatial ordering in a quantum rod ensemble. Anisotropy of the optical properties in the ordered quantum rod ensemble is examined. The presented method can be used as a low-cost solution for preparing the nanostructured materials with anisotropic properties and high concentration of nanocrystals.

인공신경망 앙상블을 이용한 옵션 투자예측 시스템 (A Forecasting System for KOSPI 200 Option Trading using Artificial Neural Network Ensemble)

  • 이재식;송영균;허성회
    • 한국지능정보시스템학회:학술대회논문집
    • /
    • 한국지능정보시스템학회 2000년도 추계정기학술대회:지능형기술과 CRM
    • /
    • pp.489-497
    • /
    • 2000
  • After IMF situation, the money market environment is changing rapidly. Therefore, many companies including financial institutions and many individual investors are concerned about forecasting the money market, and they make an effort to insure the various profit and hedge methods using derivatives like option, futures and swap. In this research, we developed a prototype of forecasting system for KOSPI 200 option, especially call option, trading using artificial neural networks(ANN), To avoid the overfitting problem and the problem involved int the choice of ANN structure and parameters, we employed the ANN ensemble approach. We conducted two types of simulation. One is conducted with the hold signals taken into account, and the other is conducted without hold signals. Even though our models show low accuracy for the sample set extracted from the data collected in the early stage of IMF situation, they perform better in terms of profit and stability than the model that uses only the theoretical price.

  • PDF

An Ensemble Model for Credit Default Discrimination: Incorporating BERT-based NLP and Transformer

  • Sophot Ky;Ju-Hong Lee
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2023년도 춘계학술발표대회
    • /
    • pp.624-626
    • /
    • 2023
  • Credit scoring is a technique used by financial institutions to assess the creditworthiness of potential borrowers. This involves evaluating a borrower's credit history to predict the likelihood of defaulting on a loan. This paper presents an ensemble of two Transformer based models within a framework for discriminating the default risk of loan applications in the field of credit scoring. The first model is FinBERT, a pretrained NLP model to analyze sentiment of financial text. The second model is FT-Transformer, a simple adaptation of the Transformer architecture for the tabular domain. Both models are trained on the same underlying data set, with the only difference being the representation of the data. This multi-modal approach allows us to leverage the unique capabilities of each model and potentially uncover insights that may not be apparent when using a single model alone. We compare our model with two famous ensemble-based models, Random Forest and Extreme Gradient Boosting.

Incorporating BERT-based NLP and Transformer for An Ensemble Model and its Application to Personal Credit Prediction

  • Sophot Ky;Ju-Hong Lee;Kwangtek Na
    • 스마트미디어저널
    • /
    • 제13권4호
    • /
    • pp.9-15
    • /
    • 2024
  • Tree-based algorithms have been the dominant methods used build a prediction model for tabular data. This also includes personal credit data. However, they are limited to compatibility with categorical and numerical data only, and also do not capture information of the relationship between other features. In this work, we proposed an ensemble model using the Transformer architecture that includes text features and harness the self-attention mechanism to tackle the feature relationships limitation. We describe a text formatter module, that converts the original tabular data into sentence data that is fed into FinBERT along with other text features. Furthermore, we employed FT-Transformer that train with the original tabular data. We evaluate this multi-modal approach with two popular tree-based algorithms known as, Random Forest and Extreme Gradient Boosting, XGBoost and TabTransformer. Our proposed method shows superior Default Recall, F1 score and AUC results across two public data sets. Our results are significant for financial institutions to reduce the risk of financial loss regarding defaulters.

마코프 체인 몬테카를로 및 앙상블 칼만필터와 연계된 추계학적 단순 수문분할모형 (Stochastic Simple Hydrologic Partitioning Model Associated with Markov Chain Monte Carlo and Ensemble Kalman Filter)

  • 최정현;이옥정;원정은;김상단
    • 한국물환경학회지
    • /
    • 제36권5호
    • /
    • pp.353-363
    • /
    • 2020
  • Hydrologic models can be classified into two types: those for understanding physical processes and those for predicting hydrologic quantities. This study deals with how to use the model to predict today's stream flow based on the system's knowledge of yesterday's state and the model parameters. In this regard, for the model to generate accurate predictions, the uncertainty of the parameters and appropriate estimates of the state variables are required. In this study, a relatively simple hydrologic partitioning model is proposed that can explicitly implement the hydrologic partitioning process, and the posterior distribution of the parameters of the proposed model is estimated using the Markov chain Monte Carlo approach. Further, the application method of the ensemble Kalman filter is proposed for updating the normalized soil moisture, which is the state variable of the model, by linking the information on the posterior distribution of the parameters and by assimilating the observed steam flow data. The stochastically and recursively estimated stream flows using the data assimilation technique revealed better representation of the observed data than the stream flows predicted using the deterministic model. Therefore, the ensemble Kalman filter in conjunction with the Markov chain Monte Carlo approach could be a reliable and effective method for forecasting daily stream flow, and it could also be a suitable method for routinely updating and monitoring the watershed-averaged soil moisture.

One Step Measurements of hippocampal Pure Volumes from MRI Data Using an Ensemble Model of 3-D Convolutional Neural Network

  • Basher, Abol;Ahmed, Samsuddin;Jung, Ho Yub
    • 스마트미디어저널
    • /
    • 제9권2호
    • /
    • pp.22-32
    • /
    • 2020
  • The hippocampal volume atrophy is known to be linked with neuro-degenerative disorders and it is also one of the most important early biomarkers for Alzheimer's disease detection. The measurements of hippocampal pure volumes from Magnetic Resonance Imaging (MRI) is a crucial task and state-of-the-art methods require a large amount of time. In addition, the structural brain development is investigated using MRI data, where brain morphometry (e.g. cortical thickness, volume, surface area etc.) study is one of the significant parts of the analysis. In this study, we have proposed a patch-based ensemble model of 3-D convolutional neural network (CNN) to measure the hippocampal pure volume from MRI data. The 3-D patches were extracted from the volumetric MRI scans to train the proposed 3-D CNN models. The trained models are used to construct the ensemble 3-D CNN model and the aggregated model predicts the pure volume in one-step in the test phase. Our approach takes only 5 seconds to estimate the volumes from an MRI scan. The average errors for the proposed ensemble 3-D CNN model are 11.7±8.8 (error%±STD) and 12.5±12.8 (error%±STD) for the left and right hippocampi of 65 test MRI scans, respectively. The quantitative study on the predicted volumes over the ground truth volumes shows that the proposed approach can be used as a proxy.

A Jittering-based Neural Network Ensemble Approach for Regionalized Low-flow Frequency Analysis

  • Ahn, Kuk-Hyun
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2020년도 학술발표회
    • /
    • pp.382-382
    • /
    • 2020
  • 과거 많은 연구에서 다수의 모형의 결과를 이용한 앙상블 방법론은 인공지능 모형 (artificial neural network)의 예측 능력에 향상을 갖고 온다 논하였다. 본 연구에서는 미계측유역의 저수량(low flow)의 예측을 위하여 Jittering을 기반으로 한 인공지능 모형을 제시하고자 한다. 기본적인 방법론은 설명변수들에게 백색 잡음(white noise)를 삽입하여 훈련되는 자료를 증가시키는 것이다. Jittering을 기반으로 한 인공지능 모형에 대한 효과를 검증하기 위하여 본 연구에서는 Multi-output neural network model을 기반으로 모형을 구축하였다. 다음으로 Jittering을 기반으로 한 앙상블 모형을 variable importance measuring algorithm과 결합시켜서 유역특성치와 예측되는 저수량의 특성치들의 관계를 추론하였다. 본 연구에서 사용되는 방법론들의 효용성을 평가하기 위해서 미동북부에 위치하고 있는 총 207개의 유역을 사용하였다. 결과적으로 본 연구에서 제시한 Jittering을 기반으로 한 인공지능 앙상블 모형은 단일예측모형 (single modeling approach)을 정확도 측면에서 우수한 것으로 확인되었다. 또한, 적은 숫자의 앙상블 모형에서도 그 정확성이 단일예측모형보다 우수한 것을 확인하였다. 마지막으로 본 연구에서는 유역특성치들의 효과가 살펴보고자 하는 저수량의 특성치들에 따라서 일관적으로 영향을 미치거나 그 중요도가 변화하는 것을 확인하였다.

  • PDF