• Title/Summary/Keyword: Model Ensemble

Search Result 638, Processing Time 0.031 seconds

A Development of a Tailored Follow up Management Model Using the Data Mining Technique on Hypertension (데이터마이닝 기법을 활용한 맞춤형 고혈압 사후관리 모형 개발)

  • Park, Il-Su;Yong, Wang-Sik;Kim, Yu-Mi;Kang, Sung-Hong;Han, Jun-Tae
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.4
    • /
    • pp.639-647
    • /
    • 2008
  • This study used the characteristics of the knowledge discovery and data mining algorithms to develop tailored hypertension follow up management model - hypertension care predictive model and hypertension care compliance segmentation model - for hypertension management using the Korea National Health Insurance Corporation database(the insureds’ screening and health care benefit data). This study validated the predictive power of data mining algorithms by comparing the performance of logistic regression, decision tree, and ensemble technique. On the basis of internal and external validation, it was found that the model performance of logistic regression method was the best among the above three techniques on hypertension care predictive model and hypertension care compliance segmentation model was developed by Decision tree analysis. This study produced several factors affecting the outbreak of hypertension using screening. It is considered to be a contributing factor towards the nation’s building of a Hypertension follow up Management System in the near future by bringing forth representative results on the rise and care of hypertension.

Monitoring Ground-level SO2 Concentrations Based on a Stacking Ensemble Approach Using Satellite Data and Numerical Models (위성 자료와 수치모델 자료를 활용한 스태킹 앙상블 기반 SO2 지상농도 추정)

  • Choi, Hyunyoung;Kang, Yoojin;Im, Jungho;Shin, Minso;Park, Seohui;Kim, Sang-Min
    • Korean Journal of Remote Sensing
    • /
    • v.36 no.5_3
    • /
    • pp.1053-1066
    • /
    • 2020
  • Sulfur dioxide (SO2) is primarily released through industrial, residential, and transportation activities, and creates secondary air pollutants through chemical reactions in the atmosphere. Long-term exposure to SO2 can result in a negative effect on the human body causing respiratory or cardiovascular disease, which makes the effective and continuous monitoring of SO2 crucial. In South Korea, SO2 monitoring at ground stations has been performed, but this does not provide spatially continuous information of SO2 concentrations. Thus, this research estimated spatially continuous ground-level SO2 concentrations at 1 km resolution over South Korea through the synergistic use of satellite data and numerical models. A stacking ensemble approach, fusing multiple machine learning algorithms at two levels (i.e., base and meta), was adopted for ground-level SO2 estimation using data from January 2015 to April 2019. Random forest and extreme gradient boosting were used as based models and multiple linear regression was adopted for the meta-model. The cross-validation results showed that the meta-model produced the improved performance by 25% compared to the base models, resulting in the correlation coefficient of 0.48 and root-mean-square-error of 0.0032 ppm. In addition, the temporal transferability of the approach was evaluated for one-year data which were not used in the model development. The spatial distribution of ground-level SO2 concentrations based on the proposed model agreed with the general seasonality of SO2 and the temporal patterns of emission sources.

Comparative Analysis of Subsurface Estimation Ability and Applicability Based on Various Geostatistical Model (다양한 지구통계기법의 지하매질 예측능 및 적용성 비교연구)

  • Ahn, Jeongwoo;Jeong, Jina;Park, Eungyu
    • Journal of Soil and Groundwater Environment
    • /
    • v.19 no.4
    • /
    • pp.31-44
    • /
    • 2014
  • In the present study, a few of recently developed geostatistical models are comparatively studied. The models are two-point statistics based sequential indicator simulation (SISIM) and generalized coupled Markov chain (GCMC), multi-point statistics single normal equation simulation (SNESIM), and object based model of FLUVSIM (fluvial simulation) that predicts structures of target object from the provided geometric information. Out of the models, SNESIM and FLUVSIM require additional information other than conditioning data such as training map and geometry, respectively, which generally claim demanding additional resources. For the comparative studies, three-dimensional fluvial reservoir model is developed considering the genetic information and the samples, as input data for the models, are acquired by mimicking realistic sampling (i.e. random sampling). For SNESIM and FLUVSIM, additional training map and the geometry data are synthesized based on the same information used for the objective model. For the comparisons of the predictabilities of the models, two different measures are employed. In the first measure, the ensemble probability maps of the models are developed from multiple realizations, which are compared in depth to the objective model. In the second measure, the developed realizations are converted to hydrogeologic properties and the groundwater flow simulation results are compared to that of the objective model. From the comparisons, it is found that the predictability of GCMC outperforms the other models in terms of the first measure. On the other hand, in terms of the second measure, the both predictabilities of GCMC and SNESIM are outstanding out of the considered models. The excellences of GCMC model in the comparisons may attribute to the incorporations of directional non-stationarity and the non-linear prediction structure. From the results, it is concluded that the various geostatistical models need to be comprehensively considered and comparatively analyzed for appropriate characterizations.

Using Bayesian tree-based model integrated with genetic algorithm for streamflow forecasting in an urban basin

  • Nguyen, Duc Hai;Bae, Deg-Hyo
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.140-140
    • /
    • 2021
  • Urban flood management is a crucial and challenging task, particularly in developed cities. Therefore, accurate prediction of urban flooding under heavy precipitation is critically important to address such a challenge. In recent years, machine learning techniques have received considerable attention for their strong learning ability and suitability for modeling complex and nonlinear hydrological processes. Moreover, a survey of the published literature finds that hybrid computational intelligent methods using nature-inspired algorithms have been increasingly employed to predict or simulate the streamflow with high reliability. The present study is aimed to propose a novel approach, an ensemble tree, Bayesian Additive Regression Trees (BART) model incorporating a nature-inspired algorithm to predict hourly multi-step ahead streamflow. For this reason, a hybrid intelligent model was developed, namely GA-BART, containing BART model integrating with Genetic algorithm (GA). The Jungrang urban basin located in Seoul, South Korea, was selected as a case study for the purpose. A database was established based on 39 heavy rainfall events during 2003 and 2020 that collected from the rain gauges and monitoring stations system in the basin. For the goal of this study, the different step ahead models will be developed based in the methods, including 1-hour, 2-hour, 3-hour, 4-hour, 5-hour, and 6-hour step ahead streamflow predictions. In addition, the comparison of the hybrid BART model with a baseline model such as super vector regression models is examined in this study. It is expected that the hybrid BART model has a robust performance and can be an optional choice in streamflow forecasting for urban basins.

  • PDF

An AutoML-driven Antenna Performance Prediction Model in the Autonomous Driving Radar Manufacturing Process

  • So-Hyang Bak;Kwanghoon Pio Kim
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.12
    • /
    • pp.3330-3344
    • /
    • 2023
  • This paper proposes an antenna performance prediction model in the autonomous driving radar manufacturing process. Our research work is based upon a challenge dataset, Driving Radar Manufacturing Process Dataset, and a typical AutoML machine learning workflow engine, Pycaret open-source Python library. Note that the dataset contains the total 70 data-items, out of which 54 used as input features and 16 used as output features, and the dataset is properly built into resolving the multi-output regression problem. During the data regression analysis and preprocessing phase, we identified several input features having similar correlations and so detached some of those input features, which may become a serious cause of the multicollinearity problem that affect the overall model performance. In the training phase, we train each of output-feature regression models by using the AutoML approach. Next, we selected the top 5 models showing the higher performances in the AutoML result reports and applied the ensemble method so as for the selected models' performances to be improved. In performing the experimental performance evaluation of the regression prediction model, we particularly used two metrics, MAE and RMSE, and the results of which were 0.6928 and 1.2065, respectively. Additionally, we carried out a series of experiments to verify the proposed model's performance by comparing with other existing models' performances. In conclusion, we enhance accuracy for safer autonomous vehicles, reduces manufacturing costs through AutoML-Pycaret and machine learning ensembled model, and prevents the production of faulty radar systems, conserving resources. Ultimately, the proposed model holds significant promise not only for antenna performance but also for improving manufacturing quality and advancing radar systems in autonomous vehicles.

Molecular Dynamics Simulation Studies of Viscosity and Diffusion of n-Alkane Oligomers at High Temperatures

  • Lee, Song-Hi
    • Bulletin of the Korean Chemical Society
    • /
    • v.32 no.11
    • /
    • pp.3909-3913
    • /
    • 2011
  • In this paper we have carried out molecular dynamics simulations (MD) for model systems of liquid n-alkane oligomers ($C_{12}{\sim}C_{80}$) at high temperatures (~2300 K) in a canonical ensemble to calculate viscosity ${\eta}$, self-diffusion constants D, and monomeric friction constant ${\zeta}$. We found that the long chains of these n-alkanes at high temperatures show an abnormality in density and in monomeric friction constant. The behavior of both activation energies, $E_{\eta}$ and $E_D$, and the mass and temperature dependence of ${\eta}$, D, and ${\zeta}$ are discussed.

Molecular dynamics simulation of bulk silicon under strain

  • Zhao, H.;Aluru, N.R.
    • Interaction and multiscale mechanics
    • /
    • v.1 no.2
    • /
    • pp.303-315
    • /
    • 2008
  • In this paper, thermodynamical properties of crystalline silicon under strain are calculated using classical molecular dynamics (MD) simulations based on the Tersoff interatomic potential. The Helmholtz free energy of the silicon crystal under strain is calculated by using the ensemble method developed by Frenkel and Ladd (1984). To account for quantum corrections under strain in the classical MD simulations, we propose an approach where the quantum corrections to the internal energy and the Helmholtz free energy are obtained by using the corresponding energy deviation between the classical and quantum harmonic oscillators. We calculate the variation of thermodynamic properties with temperature and strain and compare them with results obtained by using the quasi-harmonic model in the reciprocal space.

Effective viscosity of bidisperse suspensions

  • Koo Sangkyun;Song Kwang Ho
    • Korea-Australia Rheology Journal
    • /
    • v.17 no.1
    • /
    • pp.27-32
    • /
    • 2005
  • We determine the effective viscosity of suspensions with bidisperse particle size distribution by modifying an effective-medium theory that was proposed by Acrivos and Chang (1987) for monodisperse suspensions. The modified theory uses a simple model that captures some important effects of multi-particle hydrodynamic interactions. The modifications are described in detail in the present study. Estimations of effective viscosity by the modified theory are compared with the results of prior work for monodisperse and bidisperse suspensions. It is shown that the estimations agree very well with experimental or other calculated results up to approximately 0.45 of normalized particle volume fraction which is the ratio of volume faction to the maximum volume fraction of particles for bidisperse suspensions.

PIV Velocity Field Measurements of Flow around a Ship with Rotating Propeller (PIV를 이용한 선박 프로펠러 후류의 속도장 계측)

  • 이상준;백부근
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.40 no.5
    • /
    • pp.17-25
    • /
    • 2003
  • Velocity field behind a container ship model with a rotating propeller has been investigated using PIV (particle image velocimetry) system. Four hundred instantaneous velocity fields were measured at 4 different blade phases and ensemble-averaged to investigate the spatial evolution of vortical structure of near wake within one propeller diameter downstream. The phase-averaged mean velocity fields show the potential wake and the viscous wake formed due to the boundary layers developed on the blade surfaces. The interaction between bilge vortex developed along the hull surface and the tangential velocity component of incoming flow causes to have asymmetric flow structure in the transverse plane.

Proper Orthogonal Mode Analysis of AFM Microcantilevers in Dynamic Mode (동적모드 AFM 마이크로캔틸레버의 적합직교모드 해석)

  • Cho, Hong-Mo;Lee, Soo-Il
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2007.11a
    • /
    • pp.606-611
    • /
    • 2007
  • Proper orthogonal decomposition (POD) is a method for extracting bases for modal decomposition from the ensemble of dynamic signals. Using the POD method, we analyzed the proper orthogonal modes (POMs) of AFM microcantilevers in dynamic mode operations such as Tapping Mode. The POMs and POVs (proper orthogonal values) were computed through MATLAB simulation for the 5-mode model of the microcantilever. We found that the POV portion of the higher POMs of the tapping microcanilever slightly increased in comparison with no tapping. This implies that the modal energy in the fundamental mode can be transferred to the higher modes during tapping.

  • PDF