• Title/Summary/Keyword: Ensemble Modeling

Search Result 82, Processing Time 0.027 seconds

Estimation of lightweight aggregate concrete characteristics using a novel stacking ensemble approach

  • Kaloop, Mosbeh R.;Bardhan, Abidhan;Hu, Jong Wan;Abd-Elrahman, Mohamed
    • Advances in nano research
    • /
    • v.13 no.5
    • /
    • pp.499-512
    • /
    • 2022
  • This study investigates the efficiency of ensemble machine learning for predicting the lightweight-aggregate concrete (LWC) characteristics. A stacking ensemble (STEN) approach was proposed to estimate the dry density (DD) and 28 days compressive strength (Fc-28) of LWC using two meta-models called random forest regressor (RFR) and extra tree regressor (ETR), and two novel ensemble models called STEN-RFR and STEN-ETR, were constructed. Four standalone machine learning models including artificial neural network, gradient boosting regression, K neighbor regression, and support vector regression were used to compare the performance of the proposed models. For this purpose, a sum of 140 LWC mixtures with 21 influencing parameters for producing LWC with a density less than 1000 kg/m3, were used. Based on the experimental results with multiple performance criteria, it can be concluded that the proposed STEN-ETR model can be used to estimate the DD and Fc-28 of LWC. Moreover, the STEN-ETR approach was found to be a significant technique in prediction DD and Fc-28 of LWC with minimal prediction error. In the validation phase, the accuracy of the proposed STEN-ETR model in predicting DD and Fc-28 was found to be 96.79% and 81.50%, respectively. In addition, the significance of cement, water-cement ratio, silica fume, and aggregate with expanded glass variables is efficient in modeling DD and Fc-28 of LWC.

Evaluation of Ensemble Approach for O3 and PM2.5 Simulation

  • Morino, Yu;Chatani, Satoru;Hayami, Hiroshi;Sasaki, Kansuke;Mori, Yasuaki;Morikawa, Tazuko;Ohara, Toshimasa;Hasegawa, Shuichi;Kobayashi, Shinji
    • Asian Journal of Atmospheric Environment
    • /
    • v.4 no.3
    • /
    • pp.150-156
    • /
    • 2010
  • Inter-comparison of chemical transport models (CTMs) was conducted among four modeling research groups. Model performance of the ensemble approach to $O_3$ and $PM_{2.5}$ simulation was evaluated by using observational data with a time resolution of 1 or 6 hours at four sites in the Kanto area, Japan, in summer 2007. All groups applied the Community Multiscale Air Quality model. The ensemble average of the four CTMs reproduced well the temporal variation of $O_3$ (r=0.65-0.85) and the daily maximum $O_3$ concentration within a factor of 1.3. By contrast, it underestimated $PM_{2.5}$ concentrations by a factor of 1.4-2, and did not reproduce the $PM_{2.5}$ temporal variation at two suburban sites (r=~0.2). The ensemble average improved the simulation of ${SO_4}^{2-}$, ${NO_3}^-$, and ${NH_4}^+$, whose production pathways are well known. In particular, the ensemble approach effectively simulated ${NO_3}^-$, despite the large variability among CTMs (up to a factor of 10). However, the ensemble average did not improve the simulation of organic aerosols (OAs), underestimating their concentrations by a factor of 5. The contribution of OAs to $PM_{2.5}$ (36-39%) was large, so improvement of the OA simulation model is essential to improve the $PM_{2.5}$ simulation.

Seasonal Prediction of Tropical Cyclone Frequency in the Western North Pacific using GDAPS Ensemble Prediction System (GDAPS 앙상블 예보 시스템을 이용한 북서태평양에서의 태풍 발생 계절 예측)

  • Kim, Ji-Sun;Kwon, H. Joe
    • Atmosphere
    • /
    • v.17 no.3
    • /
    • pp.269-279
    • /
    • 2007
  • This study investigates the possibility of seasonal prediction for tropical cyclone activity in the western North Pacific by using a dynamical modeling approach. We use data from the SMIP/HFP (Seasonal Prediction Model Inter-comparison Project/Historical Forecast Project) experiment with the Korea Meteorological Administration's GDAPS (Global Data Assimilation and Prediction System) T106 model, focusing our analysis on model-generated tropical cyclones. It is found that the prediction depends primarily on the tropical cyclone (TC) detecting criteria. Additionally, a scaling factor and a different weighting to each ensemble member are found to be essential for the best predictions of summertime TC activity. This approach indeed shows a certain skill not only in the category forecast but in the standard verifications such as Brier score and relative operating characteristics (ROC).

Accounting for Uncertainty Propagation: Streamflow Forecasting using Multiple Climate and Hydrological Models

  • Kwon, Hyun-Han;Moon, Young-Il;Park, Se-Hoon;Oh, Tae-Suck
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2008.05a
    • /
    • pp.1388-1392
    • /
    • 2008
  • Water resources management depends on dealing inherent uncertainties stemming from climatic and hydrological inputs and models. Dealing with these uncertainties remains a challenge. Streamflow forecasts basically contain uncertainties arising from model structure and initial conditions. Recent enhancements in climate forecasting skill and hydrological modeling provide an breakthrough for delivering improved streamflow forecasts. However, little consideration has been given to methodologies that include coupling both multiple climate and multiple hydrological models, increasing the pool of streamflow forecast ensemble members and accounting for cumulative sources of uncertainty. The approach here proposes integration and coupling of global climate models (GCM), multiple regional climate models, and numerous hydrological models to improve streamflow forecasting and characterize system uncertainty through generation of ensemble forecasts.

  • PDF

Incorporating BERT-based NLP and Transformer for An Ensemble Model and its Application to Personal Credit Prediction

  • Sophot Ky;Ju-Hong Lee;Kwangtek Na
    • Smart Media Journal
    • /
    • v.13 no.4
    • /
    • pp.9-15
    • /
    • 2024
  • Tree-based algorithms have been the dominant methods used build a prediction model for tabular data. This also includes personal credit data. However, they are limited to compatibility with categorical and numerical data only, and also do not capture information of the relationship between other features. In this work, we proposed an ensemble model using the Transformer architecture that includes text features and harness the self-attention mechanism to tackle the feature relationships limitation. We describe a text formatter module, that converts the original tabular data into sentence data that is fed into FinBERT along with other text features. Furthermore, we employed FT-Transformer that train with the original tabular data. We evaluate this multi-modal approach with two popular tree-based algorithms known as, Random Forest and Extreme Gradient Boosting, XGBoost and TabTransformer. Our proposed method shows superior Default Recall, F1 score and AUC results across two public data sets. Our results are significant for financial institutions to reduce the risk of financial loss regarding defaulters.

Improvement of the Ensemble Streamflow Prediction System Using Optimal Linear Correction (최적선형보정을 이용한 앙상블 유량예측 시스템의 개선)

  • Jeong, Dae-Il;Lee, Jae-Kyoung;Kim, Young-Oh
    • Journal of Korea Water Resources Association
    • /
    • v.38 no.6 s.155
    • /
    • pp.471-483
    • /
    • 2005
  • A monthly Ensemble Streamflow Prediction (ESP) system was developed by applying a daily rainfall-runoff model known as the Streamflow Synthesis and Reservoir Regulation (SSARR) model to the Han, Nakdong, and Seomjin River basins in Korea. This study first assesses the accuracy of the averaged monthly runoffs simulated by SSARR for the 3 basins and proposes some improvements. The study found that the SSARR modeling of the Han and Nakdong River basins tended to significantly underestimate the actual runoff levels and the modeling of the Seomjin River basinshowed a large error variance. However, by implementing optimal linear correction (OLC), the accuracy of the SSARR model was considerably improved in predicting averaged monthly runoffs of the Han and Nakdong River basins. This improvement was not seen in the modeling of the Seomjin River basin. In addition, the ESP system was applied to forecast probabilistic runoff forecasts one month in advance for the 3 river basins from 1998 to 2003. Considerably improvement was also achieved with OLC in probabilistic forecasting accuracy for the Han and Nakdong River basins, but not in that of the Seomjin River basin.

Impact of Ensemble Member Size on Confidence-based Selection in Bankruptcy Prediction (부도예측을 위한 확신 기반의 선택 접근법에서 앙상블 멤버 사이즈의 영향에 관한 연구)

  • Kim, Na-Ra;Shin, Kyung-Shik;Ahn, Hyunchul
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.2
    • /
    • pp.55-71
    • /
    • 2013
  • The prediction model is the main factor affecting the performance of a knowledge-based system for bankruptcy prediction. Earlier studies on prediction modeling have focused on the building of a single best model using statistical and artificial intelligence techniques. However, since the mid-1980s, integration of multiple techniques (hybrid techniques) and, by extension, combinations of the outputs of several models (ensemble techniques) have, according to the experimental results, generally outperformed individual models. An ensemble is a technique that constructs a set of multiple models, combines their outputs, and produces one final prediction. The way in which the outputs of ensemble members are combined is one of the important issues affecting prediction accuracy. A variety of combination schemes have been proposed in order to improve prediction performance in ensembles. Each combination scheme has advantages and limitations, and can be influenced by domain and circumstance. Accordingly, decisions on the most appropriate combination scheme in a given domain and contingency are very difficult. This paper proposes a confidence-based selection approach as part of an ensemble bankruptcy-prediction scheme that can measure unified confidence, even if ensemble members produce different types of continuous-valued outputs. The present experimental results show that when varying the number of models to combine, according to the creation type of ensemble members, the proposed combination method offers the best performance in the ensemble having the largest number of models, even when compared with the methods most often employed in bankruptcy prediction.

Evaluation of conceptual rainfall-runoff models for different flow regimes and development of ensemble model (개념적 강우유출 모형의 유량구간별 적합성 평가 및 앙상블 모델 구축)

  • Yu, Jae-Ung;Park, Moon-Hyung;Kim, Jin-Guk;Kwon, Hyun-Han
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.2
    • /
    • pp.105-119
    • /
    • 2021
  • An increase in the frequency and intensity of both floods and droughts has been recently observed due to an increase in climate variability. Especially, land-use change associated with industrial structure and urbanization has led to an imbalance between water supply and demand, acting as a constraint in water resource management. Accurate rainfall-runoff analysis plays a critical role in evaluating water availability in the water budget analysis. This study aimed to explore various continuous rainfall-runoff models over the Soyanggang dam watershed. Moreover, the ensemble modeling framework combining multiple models was introduced to present scenarios on streamflow considering uncertainties. In the ensemble modeling framework, rainfall-runoff models with fewer parameters are generally preferred for effective regionalization. In this study, more than 40 continuous rainfall-runoff models were applied to the Soyanggang dam watershed, and nine rainfall-runoff models were primarily selected using different goodness-of-fit measures. This study confirmed that the ensemble model showed better performance than the individual model over different flow regimes.

Multiscale approach to predict the effective elastic behavior of nanoparticle-reinforced polymer composites

  • Kim, B.R.;Pyo, S.H.;Lemaire, G.;Lee, H.K.
    • Interaction and multiscale mechanics
    • /
    • v.4 no.3
    • /
    • pp.173-185
    • /
    • 2011
  • A multiscale modeling scheme that addresses the influence of the nanoparticle size in nanocomposites consisting of nano-sized spherical particles embedded in a polymer matrix is presented. A micromechanics-based constitutive model for nanoparticle-reinforced polymer composites is derived by incorporating the Eshelby tensor considering the interface effects (Duan et al. 2005a) into the ensemble-volume average method (Ju and Chen 1994). A numerical investigation is carried out to validate the proposed micromechanics-based constitutive model, and a parametric study on the interface moduli is conducted to investigate the effect of interface moduli on the overall behavior of the composites. In addition, molecular dynamics (MD) simulations are performed to determine the mechanical properties of the nanoparticles and polymer. Finally, the overall elastic moduli of the nanoparticle-reinforced polymer composites are estimated using the proposed multiscale approach combining the ensemble-volume average method and the MD simulation. The predictive capability of the proposed multiscale approach has been demonstrated through the multiscale numerical simulations.