• 제목/요약/키워드: Bayesian Probability Statistics

검색결과 109건 처리시간 0.025초

베이지안 신경망을 이용한 분류분석 (A Classification Analysis using Bayesian Neural Network)

  • 황진수;최성용;전홍석
    • Journal of the Korean Data and Information Science Society
    • /
    • 제12권2호
    • /
    • pp.11-25
    • /
    • 2001
  • 자료들 사이에 존재하는 관계, 패턴, 규칙등을 찾아내서 모형화 하는 통계적인 분류기법은 여러가지가 있다. 그러나 우리가 얻게 되는 지식은 어떤 일련의 분류규칙에 의해서가 아닌 관찰과 학습을 통한 훈련으로부터 얻게 된다. 본 베이지안 학습은 모든 형태의 불확실성을 표현하는 확률로써 우리의 믿음의 정도를 표현하는 것으로 해석될 수 있으며, 확실한 결과가 알려짐에 따라 확률이론 법칙을 사용하여 이러한 확률들을 갱신한다. 또한 신경망 모형은 이미 알고 있는 속성들에 근거하여 아직 알지 못하는 집단이나 특질들을 예측하게 해준다. 본 논문에서는 이러한 두 가지 방법을 결합한 베이지안 신경망과 기존의 CHAID, CART, QUBST 분류 알고리즘에 있어서 각각 오분류율을 비교연구하였다.

  • PDF

A dynamic Bayesian approach for probability of default and stress test

  • Kim, Taeyoung;Park, Yousung
    • Communications for Statistical Applications and Methods
    • /
    • 제27권5호
    • /
    • pp.579-588
    • /
    • 2020
  • Obligor defaults are cross-sectionally correlated as obligors share common economic conditions; in addition obligors are longitudinally correlated so that an economic shock like the IMF crisis in 1998 lasts for a period of time. A longitudinal correlation should be used to construct statistical scenarios of stress test with which we replace a type of artificial scenario that the banks have used. We propose a Bayesian model to accommodate such correlation structures. Using 402 obligors to a domestic bank in Korea, our model with a dynamic correlation is compared to a Bayesian model with a stationary longitudinal correlation and the classical logistic regression model. Our model generates statistical financial statement under a stress situation on individual obligor basis so that the genearted financial statement produces a similar distribution of credit grades to when the IMF crisis occurred and complies with Basel IV (Basel Committee on Banking Supervision, 2017) requirement that the credit grades under a stress situation are not sensitive to the business cycle.

순서범주형자료 분석을 위한 베이지안 분계점 모형 (A Bayesian Threshold Model for Ordered Categorical Traits)

  • 최병수;이승천
    • 응용통계연구
    • /
    • 제18권1호
    • /
    • pp.173-182
    • /
    • 2005
  • 순서를 갖는 범주형자료의 분석을 위한 중요한 통계적 방법인 순위로짓모형의 대안으로 무정보 사전분포에 의한 베이지안 분계점 모형을 정의하고, 실증 자료분석을 통해 베이지안 모형의 유용성을 살펴보았다.

Bayesian Multiple Change-Point Estimation and Segmentation

  • Kim, Jaehee;Cheon, Sooyoung
    • Communications for Statistical Applications and Methods
    • /
    • 제20권6호
    • /
    • pp.439-454
    • /
    • 2013
  • This study presents a Bayesian multiple change-point detection approach to segment and classify the observations that no longer come from an initial population after a certain time. Inferences are based on the multiple change-points in a sequence of random variables where the probability distribution changes. Bayesian multiple change-point estimation is classifies each observation into a segment. We use a truncated Poisson distribution for the number of change-points and conjugate prior for the exponential family distributions. The Bayesian method can lead the unsupervised classification of discrete, continuous variables and multivariate vectors based on latent class models; therefore, the solution for change-points corresponds to the stochastic partitions of observed data. We demonstrate segmentation with real data.

A Bayesian Method for Narrowing the Scope of Variable Selection in Binary Response Logistic Regression

  • Kim, Hea-Jung;Lee, Ae-Kyung
    • 품질경영학회지
    • /
    • 제26권1호
    • /
    • pp.143-160
    • /
    • 1998
  • This article is concerned with the selection of subsets of predictor variables to be included in bulding the binary response logistic regression model. It is based on a Bayesian aproach, intended to propose and develop a procedure that uses probabilistic considerations for selecting promising subsets. This procedure reformulates the logistic regression setup in a hierarchical normal mixture model by introducing a set of hyperparameters that will be used to identify subset choices. It is done by use of the fact that cdf of logistic distribution is a, pp.oximately equivalent to that of $t_{(8)}$/.634 distribution. The a, pp.opriate posterior probability of each subset of predictor variables is obtained by the Gibbs sampler, which samples indirectly from the multinomial posterior distribution on the set of possible subset choices. Thus, in this procedure, the most promising subset of predictors can be identified as that with highest posterior probability. To highlight the merit of this procedure a couple of illustrative numerical examples are given.

  • PDF

베이지안 확률기법을 이용한 당량비 오차분석에 관한 연구 (Error Analysis of Equivalence Ratio using Bayesian Statistics)

  • 안중기;박익수;이호일
    • 한국추진공학회지
    • /
    • 제22권2호
    • /
    • pp.131-137
    • /
    • 2018
  • 엔진 당량비의 제어오차가 요구범위를 불만족할 확률을 분석하였다. 당량비의 제어오차는 무작위 불확실요소와 인식론적 불확실요소로부터 동시에 영향을 받는다. 무작위 불확실요소는 일반적으로 확률 분포가 주어지므로 민감도 기반의 신뢰성 해석기법을 이용해 쉽게 해석이 가능하다. 확률분포를 알기 어려운 인식론적 불확실요소를 다루기 위해서는 새로운 접근법이 필요하다. 무작위 불확실요소에 대한 신뢰성 해석결과를 베이지안 추론에 이용함으로서 엔진 당량비의 제어오차가 요구범위를 불만족할 확률에 대한 확률분포를 구할 수 있었다. 이러한 접근은 무작위 불확실요소와 인식론적 불확실 요소가 동시에 존재하는 공학시스템 해석에 유용하게 사용될 수 있다.

A Bayesian Comparison of Two Multivariate Normal Genralized Variances

  • 김혜중
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2002년도 춘계 학술발표회 논문집
    • /
    • pp.73-78
    • /
    • 2002
  • In this paper we develop a method for constructing a Bayesian HPD (highest probability density) interval of a ratio of two multivariate normal generalized variances. The method gives a way of comparing two multivariate populations in terms of their dispersion or spread, because the generalized variance is a scalar measure of the overall multivariate scatter. Fully parametric frequentist approaches for the interval is intractable and thus a Bayesian HPD(highest probability densith) interval is pursued using a variant of weighted Monte Carlo (WMC) sampling based approach introduced by Chen and Shao(1999). Necessary theory involved in the method and computation is provided.

  • PDF

베이즈 추론을 수학과 교육과정에 도입하는 것의 실제 의미에 대한 일고찰 (A consideration of the real meanings of introducing Bayesian inference into school mathematics curriculum)

  • 박선용
    • 한국수학사학회지
    • /
    • 제37권1호
    • /
    • pp.1-17
    • /
    • 2024
  • In this study, we identified the intellectual triggers for Bayesian inference and what key ideas contributed to its occurrence and discussed the practical implications of introducing Bayesian inference into the school mathematics curriculum by reflecting them. The results of the study show that the need for statistical inference about the parameter itself served as a trigger for the occurrence of Bayesian inference, and the most important idea for the occurrence of that inference was to regard the parameter itself as a probability variable rather than any fixed value. On the other hand, these research results suggest that the meaning of introducing Bayesian inference into the secondary mathematics curriculum is 'statistics education that expands the scope of uncertainty'.

A Bayesian Criterion for a Multiple test of Two Multivariate Normal Populations

  • 김혜중;손영숙
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2000년도 추계학술발표회 논문집
    • /
    • pp.147-152
    • /
    • 2000
  • A Bayesian criterion is proposed for a multiple test of two independent multivariate normal populations. For a Bayesian test the fractional Bayes facto.(FBF) of O'Hagan(1995) is used under the assumption of Jeffreys priors, noninformative improper proirs. In this test the FBF without the need of sampling minimal training samples is much simpler to use than the intrinsic Bayes facotr(IBF) of Berger and Pericchi(1996). Finally, a simulation study is performed to show the behaviors of the FBF.

  • PDF

Bayesian Outlier Detection in Regression Model

  • Younshik Chung;Kim, Hyungsoon
    • Journal of the Korean Statistical Society
    • /
    • 제28권3호
    • /
    • pp.311-324
    • /
    • 1999
  • The problem of 'outliers', observations which look suspicious in some way, has long been one of the most concern in the statistical structure to experimenters and data analysts. We propose a model for an outlier problem and also analyze it in linear regression model using a Bayesian approach. Then we use the mean-shift model and SSVS(George and McCulloch, 1993)'s idea which is based on the data augmentation method. The advantage of proposed method is to find a subset of data which is most suspicious in the given model by the posterior probability. The MCMC method(Gibbs sampler) can be used to overcome the complicated Bayesian computation. Finally, a proposed method is applied to a simulated data and a real data.

  • PDF