• Title/Summary/Keyword: bayesian models

Search Result 527, Processing Time 0.027 seconds

Software Quality Classification using Bayesian Classifier (베이지안 분류기를 이용한 소프트웨어 품질 분류)

  • Hong, Euy-Seok
    • Journal of Information Technology Services
    • /
    • v.11 no.1
    • /
    • pp.211-221
    • /
    • 2012
  • Many metric-based classification models have been proposed to predict fault-proneness of software module. This paper presents two prediction models using Bayesian classifier which is one of the most popular modern classification algorithms. Bayesian model based on Bayesian probability theory can be a promising technique for software quality prediction. This is due to the ability to represent uncertainty using probabilities and the ability to partly incorporate expert's knowledge into training data. The two models, Na$\ddot{i}$veBayes(NB) and Bayesian Belief Network(BBN), are constructed and dimensionality reduction of training data and test data are performed before model evaluation. Prediction accuracy of the model is evaluated using two prediction error measures, Type I error and Type II error, and compared with well-known prediction models, backpropagation neural network model and support vector machine model. The results show that the prediction performance of BBN model is slightly better than that of NB. For the data set with ambiguity, although the BBN model's prediction accuracy is not as good as the compared models, it achieves better performance than the compared models for the data set without ambiguity.

A Comparison Study of Bayesian Methods for a Threshold Autoregressive Model with Regime-Switching (국면전환 임계 자기회귀 분석을 위한 베이지안 방법 비교연구)

  • Roh, Taeyoung;Jo, Seongil;Lee, Ryounghwa
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.1049-1068
    • /
    • 2014
  • Autoregressive models are used to analyze an univariate time series data; however, these methods can be inappropriate when a structural break appears in a time series since they assume that a trend is consistent. Threshold autoregressive models (popular regime-switching models) have been proposed to address this problem. Recently, the models have been extended to two regime-switching models with delay parameter. We discuss two regime-switching threshold autoregressive models from a Bayesian point of view. For a Bayesian analysis, we consider a parametric threshold autoregressive model and a nonparametric threshold autoregressive model using Dirichlet process prior. The posterior distributions are derived and the posterior inferences is performed via Markov chain Monte Carlo method and based on two Bayesian threshold autoregressive models. We present a simulation study to compare the performance of the models. We also apply models to gross domestic product data of U.S.A and South Korea.

A Comparative Study on the Performance of Bayesian Partially Linear Models

  • Woo, Yoonsung;Choi, Taeryon;Kim, Wooseok
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.6
    • /
    • pp.885-898
    • /
    • 2012
  • In this paper, we consider Bayesian approaches to partially linear models, in which a regression function is represented by a semiparametric additive form of a parametric linear regression function and a nonparametric regression function. We make a comparative study on the performance of widely used Bayesian partially linear models in terms of empirical analysis. Specifically, we deal with three Bayesian methods to estimate the nonparametric regression function, one method using Fourier series representation, the other method based on Gaussian process regression approach, and the third method based on the smoothness of the function and differencing. We compare the numerical performance of three methods by the root mean squared error(RMSE). For empirical analysis, we consider synthetic data with simulation studies and real data application by fitting each of them with three Bayesian methods and comparing the RMSEs.

Leave-one-out Bayesian model averaging for probabilistic ensemble forecasting

  • Kim, Yongdai;Kim, Woosung;Ohn, Ilsang;Kim, Young-Oh
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.1
    • /
    • pp.67-80
    • /
    • 2017
  • Over the last few decades, ensemble forecasts based on global climate models have become an important part of climate forecast due to the ability to reduce uncertainty in prediction. Moreover in ensemble forecast, assessing the prediction uncertainty is as important as estimating the optimal weights, and this is achieved through a probabilistic forecast which is based on the predictive distribution of future climate. The Bayesian model averaging has received much attention as a tool of probabilistic forecasting due to its simplicity and superior prediction. In this paper, we propose a new Bayesian model averaging method for probabilistic ensemble forecasting. The proposed method combines a deterministic ensemble forecast based on a multivariate regression approach with Bayesian model averaging. We demonstrate that the proposed method is better in prediction than the standard Bayesian model averaging approach by analyzing monthly average precipitations and temperatures for ten cities in Korea.

Shear strength prediction for SFRC and UHPC beams using a Bayesian approach

  • Cho, Hae-Chang;Park, Min-Kook;Hwang, Jin-Ha;Kang, Won-Hee;Kim, Kang Su
    • Structural Engineering and Mechanics
    • /
    • v.74 no.4
    • /
    • pp.503-514
    • /
    • 2020
  • This study proposes prediction models for the shear strength of steel fiber reinforced concrete (SFRC) and ultra-high-performance fiber reinforced concrete (UHPC) beams using a Bayesian parameter estimation approach and a collected experimental database. Previous researchers had already proposed shear strength prediction models for SFRC and UHPC beams, but their performances were limited in terms of their prediction accuracies and the applicability to UHPC beams. Therefore, this study adopted a statistical approach based on a collected database to develop prediction models. In the database, 89 and 37 experimental data for SFRC and UHPC beams without stirrups were collected, respectively, and the proposed equations were developed using the Bayesian parameter estimation approach. The proposed models have a simplified form with important parameters, and in comparison to the existing prediction models, provide unbiased high prediction accuracy.

Development of benthic macroinvertebrate species distribution models using the Bayesian optimization (베이지안 최적화를 통한 저서성 대형무척추동물 종분포모델 개발)

  • Go, ByeongGeon;Shin, Jihoon;Cha, Yoonkyung
    • Journal of Korean Society of Water and Wastewater
    • /
    • v.35 no.4
    • /
    • pp.259-275
    • /
    • 2021
  • This study explored the usefulness and implications of the Bayesian hyperparameter optimization in developing species distribution models (SDMs). A variety of machine learning (ML) algorithms, namely, support vector machine (SVM), random forest (RF), boosted regression tree (BRT), XGBoost (XGB), and Multilayer perceptron (MLP) were used for predicting the occurrence of four benthic macroinvertebrate species. The Bayesian optimization method successfully tuned model hyperparameters, with all ML models resulting an area under the curve (AUC) > 0.7. Also, hyperparameter search ranges that generally clustered around the optimal values suggest the efficiency of the Bayesian optimization in finding optimal sets of hyperparameters. Tree based ensemble algorithms (BRT, RF, and XGB) tended to show higher performances than SVM and MLP. Important hyperparameters and optimal values differed by species and ML model, indicating the necessity of hyperparameter tuning for improving individual model performances. The optimization results demonstrate that for all macroinvertebrate species SVM and RF required fewer numbers of trials until obtaining optimal hyperparameter sets, leading to reduced computational cost compared to other ML algorithms. The results of this study suggest that the Bayesian optimization is an efficient method for hyperparameter optimization of machine learning algorithms.

A study on the Bayesian nonparametric model for predicting group health claims

  • Muna Mauliza;Jimin Hong
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.3
    • /
    • pp.323-336
    • /
    • 2024
  • The accurate forecasting of insurance claims is a critical component for insurers' risk management decisions. Hierarchical Bayesian parametric (BP) models can be used for health insurance claims forecasting, but they are unsatisfactory to describe the claims distribution. Therefore, Bayesian nonparametric (BNP) models can be a more suitable alternative to deal with the complex characteristics of the health insurance claims distribution, including heavy tails, skewness, and multimodality. In this study, we apply both a BP model and a BNP model to predict group health claims using simulated and real-world data for a private life insurer in Indonesia. The findings show that the BNP model outperforms the BP model in terms of claims prediction accuracy. Furthermore, our analysis highlights the flexibility and robustness of BNP models in handling diverse data structures in health insurance claims.

Bayesian Hypothesis Testing for Intraclass Correlation Coefficient

  • Lee, Seung-A;Kim, Dal-Ho
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.3
    • /
    • pp.551-566
    • /
    • 2006
  • In this paper, we consider a Bayesian model selection for the intraclass correlation coefficient in familiar data. In particular, we compare two nested models such as the independence and intraclass models using the reference prior. A criterion for testing is the Bayesian Reference Criterion by Bernardo (1999) and the Intrinsic Bayes Factor by Berger and Pericchi (1996). We provide numerical examples using simulation data sets for illustration.

Recent advances in Bayesian inference of isolation-with-migration models

  • Chung, Yujin
    • Genomics & Informatics
    • /
    • v.17 no.4
    • /
    • pp.37.1-37.8
    • /
    • 2019
  • Isolation-with-migration (IM) models have become popular for explaining population divergence in the presence of migrations. Bayesian methods are commonly used to estimate IM models, but they are limited to small data analysis or simple model inference. Recently three methods, IMa3, MIST, and AIM, resolved these limitations. Here, we describe the major problems addressed by these three software and compare differences among their inference methods, despite their use of the same standard likelihood function.

Bayesian pooling for contingency tables from small areas

  • Jo, Aejung;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.6
    • /
    • pp.1621-1629
    • /
    • 2016
  • This paper studies Bayesian pooling for analysis of categorical data from small areas. Many surveys consist of categorical data collected on a contingency table in each area. Statistical inference for small areas requires considerable care because the subpopulation sample sizes are usually very small. Typically we use the hierarchical Bayesian model for pooling subpopulation data. However, the customary hierarchical Bayesian models may specify more exchangeability than warranted. We, therefore, investigate the effects of pooling in hierarchical Bayesian modeling for the contingency table from small areas. In specific, this paper focuses on the methods of direct or indirect pooling of categorical data collected on a contingency table in each area through Dirichlet priors. We compare the pooling effects of hierarchical Bayesian models by fitting the simulated data. The analysis is carried out using Markov chain Monte Carlo methods.