• 제목/요약/키워드: Bayesian MCMC(Markov Chain Monte Carlo)

검색결과 88건 처리시간 0.027초

가우시안 과정 분류에 대한 변분 베이지안 다항 프로빗 모형: 쥐 단백질 발현 데이터에의 적용 (Variational Bayesian multinomial probit model with Gaussian process classification on mice protein expression level data)

  • 손동현;황범석
    • 응용통계연구
    • /
    • 제36권2호
    • /
    • pp.115-127
    • /
    • 2023
  • 다항 프로빗 모형은 다중 분류와 선택 모형에서 흔히 사용하는 모형이다. 다항 프로빗 모형을 추정하기 위해 일반적으로 널리 사용하는 베이지안 접근법인 마르코프 연쇄 몬테카를로(MCMC) 방법은 계산 복잡도가 매우 높다는 문제점을 가지고 있다. 반면, 변분 베이즈 방법은 MCMC 방법보다 계산 복잡도는 낮으면서도 분류 성능적인 면에서 큰 차이가 나지 않아 더 효율적인 방법으로 알려져 있다. 본 연구에서는 가우시안 과정에 기반한 다항 프로빗 모형을 설명하고 해당 모형에 적용할 수 있는 변분 베이지안 근사법을 알아보고자 한다. 그리고 UCI에서 제공되는 쥐 단백질 발현 데이터에 가우시안 과정 분류에 대한 변분 베이지안 다항 프로빗 모형을 적용하여 그 성능을 확인하고 나이브 베이즈, K-최근접 이웃법, 서포트 벡터 머신 분류기의 성능과 비교한다.

Bayesian Inferences for Software Reliability Models Based on Beta-Mixture Mean Value Functions

  • Nam, Seung-Min;Kim, Ki-Woong;Cho, Sin-Sup;Yeo, In-Kwon
    • 응용통계연구
    • /
    • 제21권5호
    • /
    • pp.835-843
    • /
    • 2008
  • In this paper, we investigate a Bayesian inference for software reliability models based on mean value functions which take the form of the mixture of beta distribution functions. The posterior simulation via the Markov chain Monte Carlo approach is used to produce estimates of posterior properties. Its applicability is illustrated with two real data sets. We compute the predictive distribution and the marginal likelihood of various models to compare the performance of them. The model comparison results show that the model based on the beta-mixture performs better than other models.

The Exponentiated Weibull-Geometric Distribution: Properties and Estimations

  • Chung, Younshik;Kang, Yongbeen
    • Communications for Statistical Applications and Methods
    • /
    • 제21권2호
    • /
    • pp.147-160
    • /
    • 2014
  • In this paper, we introduce the exponentiated Weibull-geometric (EWG) distribution which generalizes two-parameter exponentiated Weibull (EW) distribution introduced by Mudholkar et al. (1995). This proposed distribution is obtained by compounding the exponentiated Weibull with geometric distribution. We derive its cumulative distribution function (CDF), hazard function and the density of the order statistics and calculate expressions for its moments and the moments of the order statistics. The hazard function of the EWG distribution can be decreasing, increasing or bathtub-shaped among others. Also, we give expressions for the Renyi and Shannon entropies. The maximum likelihood estimation is obtained by using EM-algorithm (Dempster et al., 1977; McLachlan and Krishnan, 1997). We can obtain the Bayesian estimation by using Gibbs sampler with Metropolis-Hastings algorithm. Also, we give application with real data set to show the flexibility of the EWG distribution. Finally, summary and discussion are mentioned.

Bayesian Nonstationary Flood Frequency Analysis Using Climate Information

  • Moon, Young-Il;Kwon, Hyun-Han
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2007년도 학술발표회 논문집
    • /
    • pp.1441-1444
    • /
    • 2007
  • It is now widely acknowledged that climate variability modifies the frequency spectrum of hydrological extreme events. Traditional hydrological frequency analysis methodologies are not devised to account for nonstationarity that arises due to variation in exogenous factors of the causal structure. We use Hierarchical Bayesian Analysis to consider the exogenous factors that can influence on the frequency of extreme floods. The sea surface temperatures, predicted GCM precipitation, climate indices and snow pack are considered as potential predictors of flood risk. The parameters of the model are estimated using a Markov Chain Monte Carlo (MCMC) algorithm. The predictors are compared in terms of the resulting posterior distributions of the parameters associated with estimated flood frequency distributions.

  • PDF

Robust Bayesian analysis for autoregressive models

  • Ryu, Hyunnam;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제26권2호
    • /
    • pp.487-493
    • /
    • 2015
  • Time series data sometimes show violation of normal assumptions. For cases where the assumption of normality is untenable, more exible models can be adopted to accommodate heavy tails. The exponential power distribution (EPD) is considered as possible candidate for errors of time series model that may show violation of normal assumption. Besides, the use of exible models for errors like EPD might be able to conduct the robust analysis. In this paper, we especially consider EPD as the exible distribution for errors of autoregressive models. Also, we represent this distribution as scale mixture of uniform and this form enables efficient Bayesian estimation via Markov chain Monte Carlo (MCMC) methods.

Bayesian estimation of kinematic parameters of disk galaxies in large HI galaxy surveys

  • Oh, Se-Heon;Staveley-Smith, Lister
    • 천문학회보
    • /
    • 제41권2호
    • /
    • pp.62.2-62.2
    • /
    • 2016
  • We present a newly developed algorithm based on a Bayesian method for 2D tilted-ring analysis of disk galaxies which operates on velocity fields. Compared to the conventional ones based on a chi-squared minimisation procedure, this new Bayesian-based algorithm less suffers from local minima of the model parameters even with high multi-modality of their posterior distributions. Moreover, the Bayesian analysis implemented via Markov Chain Monte Carlo (MCMC) sampling only requires broad ranges of posterior distributions of the parameters, which makes the fitting procedure fully automated. This feature is essential for performing kinematic analysis of an unprecedented number of resolved galaxies from the upcoming Square Kilometre Array (SKA) pathfinders' galaxy surveys. A standalone code, the so-called '2D Bayesian Automated Tilted-ring fitter' (2DBAT) that implements the Bayesian fits of 2D tilted-ring models is developed for deriving rotation curves of galaxies that are at least marginally resolved (> 3 beams across the semi-major axis) and moderately inclined (20 < i < 70 degree). The main layout of 2DBAT and its performance test are discussed using sample galaxies from Australia Telescope Compact Array (ATCA) observations as well as artificial data cubes built based on representative rotation curves of intermediate-mass and massive spiral galaxies.

  • PDF

Seismic risk assessment of intake tower in Korea using updated fragility by Bayesian inference

  • Alam, Jahangir;Kim, Dookie;Choi, Byounghan
    • Structural Engineering and Mechanics
    • /
    • 제69권3호
    • /
    • pp.317-326
    • /
    • 2019
  • This research aims to assess the tight seismic risk curve of the intake tower at Geumgwang reservoir by considering the recorded historical earthquake data in the Korean Peninsula. The seismic fragility, a significant part of risk assessment, is updated by using Bayesian inference to consider the uncertainties and computational efficiency. The reservoir is one of the largest reservoirs in Korea for the supply of agricultural water. The intake tower controls the release of water from the reservoir. The seismic risk assessment of the intake tower plays an important role in the risk management of the reservoir. Site-specific seismic hazard is computed based on the four different seismic source maps of Korea. Probabilistic Seismic Hazard Analysis (PSHA) method is used to estimate the annual exceedance rate of hazard for corresponding Peak Ground Acceleration (PGA). Hazard deaggregation is shown at two customary hazard levels. Multiple dynamic analyses and a nonlinear static pushover analysis are performed for deriving fragility parameters. Thereafter, Bayesian inference with Markov Chain Monte Carlo (MCMC) is used to update the fragility parameters by integrating the results of the analyses. This study proves to reduce the uncertainties associated with fragility and risk curve, and to increase significant statistical and computational efficiency. The range of seismic risk curve of the intake tower is extracted for the reservoir site by considering four different source models and updated fragility function, which can be effectively used for the risk management and mitigation of reservoir.

Bayesian model update for damage detection of a steel plate girder bridge

  • Xin Zhou;Feng-Liang Zhang;Yoshinao Goi;Chul-Woo Kim
    • Smart Structures and Systems
    • /
    • 제31권1호
    • /
    • pp.29-43
    • /
    • 2023
  • This study investigates the possibility of damage detection of a real bridge by means of a modal parameter-based finite element (FE) model update. Field moving vehicle experiments were conducted on an actual steel plate girder bridge. In the damage experiment, cracks were applied to the bridge to simulate damage states. A fast Bayesian FFT method was employed to identify and quantify uncertainties of the modal parameters then these modal parameters were used in the Bayesian model update. Material properties and boundary conditions are taken as uncertainties and updated in the model update process. Observations showed that although some differences existed in the results obtained from different model classes, the discrepancy between modal parameters of the FE model and those experimentally obtained was reduced after the model update process, and the updated parameters in the numerical model were indeed affected by the damage. The importance of boundary conditions in the model updating process is also observed. The capability of the MCMC model update method for application to the actual bridge structure is assessed, and the limitation of FE model update in damage detection of bridges using only modal parameters is observed.

베이지안 기법에 기반한 수명자료 분석에 관한 문헌 연구: 2000~2016 (A Review on the Analysis of Life Data Based on Bayesian Method: 2000~2016)

  • 원동연;임준형;심현수;성시일;임헌상;김용수
    • 한국신뢰성학회지:신뢰성응용연구
    • /
    • 제17권3호
    • /
    • pp.213-223
    • /
    • 2017
  • Purpose: The purpose of this study is to arrange the life data analysis literatures based on the Bayesian method quantitatively and provide it as tables. Methods: The Bayesian method produces a more accurate estimates of other traditional methods in a small sample size, and it requires specific algorithm and prior information. Based on these three characteristics of the Bayesian method, the criteria for classifying the literature were taken into account. Results: In many studies, there are comparisons of estimation methods for the Bayesian method and maximum likelihood estimation (MLE), and sample size was greater than 10 and not more than 25. In probability distributions, a variety of distributions were found in addition to the distributions of Weibull commonly used in life data analysis, and MCMC and Lindley's Approximation were used evenly. Finally, Gamma, Uniform, Jeffrey and extension of Jeffrey distributions were evenly used as prior information. Conclusion: To verify the characteristics of the Bayesian method which are more superior to other methods in a smaller sample size, studies in less than 10 samples should be carried out. Also, comparative study is required by various distributions, thereby providing guidelines necessary.

Bayesian Variable Selection in the Proportional Hazard Model with Application to DNA Microarray Data

  • Lee, Kyeon-Eun;Mallick, Bani K.
    • 한국생물정보학회:학술대회논문집
    • /
    • 한국생물정보시스템생물학회 2005년도 BIOINFO 2005
    • /
    • pp.357-360
    • /
    • 2005
  • In this paper we consider the well-known semiparametric proportional hazards (PH) models for survival analysis. These models are usually used with few covariates and many observations (subjects). But, for a typical setting of gene expression data from DNA microarray, we need to consider the case where the number of covariates p exceeds the number of samples n. For a given vector of response values which are times to event (death or censored times) and p gene expressions (covariates), we address the issue of how to reduce the dimension by selecting the significant genes. This approach enable us to estimate the survival curve when n < < p. In our approach, rather than fixing the number of selected genes, we will assign a prior distribution to this number. The approach creates additional flexibility by allowing the imposition of constraints, such as bounding the dimension via a prior, which in effect works as a penalty. To implement our methodology, we use a Markov Chain Monte Carlo (MCMC) method. We demonstrate the use of the methodology to diffuse large B-cell lymphoma (DLBCL) complementary DNA(cDNA) data.

  • PDF