• 제목/요약/키워드: Bayesian inference model

검색결과 225건 처리시간 0.03초

Estimating dose-response curves using splines: a nonparametric Bayesian knot selection method

  • Lee, Jiwon;Kim, Yongku;Kim, Young Min
    • Communications for Statistical Applications and Methods
    • /
    • 제29권3호
    • /
    • pp.287-299
    • /
    • 2022
  • In radiation epidemiology, the excess relative risk (ERR) model is used to determine the dose-response relationship. In general, the dose-response relationship for the ERR model is assumed to be linear, linear-quadratic, linear-threshold, quadratic, and so on. However, since none of these functions dominate other functions for expressing the dose-response relationship, a Bayesian semiparametric method using splines has recently been proposed. Thus, we improve the Bayesian semiparametric method for the selection of the tuning parameters for splines as the number and location of knots using a Bayesian knot selection method. Equally spaced knots cannot capture the characteristic of radiation exposed dose distribution which is highly skewed in general. Therefore, we propose a nonparametric Bayesian knot selection method based on a Dirichlet process mixture model. Inference of the spline coefficients after obtaining the number and location of knots is performed in the Bayesian framework. We apply this approach to the life span study cohort data from the radiation effects research foundation in Japan, and the results illustrate that the proposed method provides competitive curve estimates for the dose-response curve and relatively stable credible intervals for the curve.

계절성과 경향성을 고려한 극치수문자료의 비정상성 빈도해석 (Nonstationary Frequency Analysis of Hydrologic Extreme Variables Considering of Seasonality and Trend)

  • 이정주;권현한;문영일
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2010년도 학술발표회
    • /
    • pp.581-585
    • /
    • 2010
  • This study introduced a Bayesian based frequency analysis in which the statistical trend seasonal analysis for hydrologic extreme series is incorporated. The proposed model employed Gumbel and GEV extreme distribution to characterize extreme events and a fully coupled bayesian frequency model was finally utilized to estimate design rainfalls in Seoul. Posterior distributions of the model parameters in both trend and seasonal analysis were updated through Markov Chain Monte Carlo Simulation mainly utilizing Gibbs sampler. This study proposed a way to make use of nonstationary frequency model for dynamic risk analysis, and showed an increase of hydrologic risk with time varying probability density functions. In addition, full annual cycle of the design rainfall through seasonal model could be applied to annual control such as dam operation, flood control, irrigation water management, and so on. The proposed study showed advantage in assessing statistical significance of parameters associated with trend analysis through statistical inference utilizing derived posterior distributions.

  • PDF

Bayesian pooling for contingency tables from small areas

  • Jo, Aejung;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권6호
    • /
    • pp.1621-1629
    • /
    • 2016
  • This paper studies Bayesian pooling for analysis of categorical data from small areas. Many surveys consist of categorical data collected on a contingency table in each area. Statistical inference for small areas requires considerable care because the subpopulation sample sizes are usually very small. Typically we use the hierarchical Bayesian model for pooling subpopulation data. However, the customary hierarchical Bayesian models may specify more exchangeability than warranted. We, therefore, investigate the effects of pooling in hierarchical Bayesian modeling for the contingency table from small areas. In specific, this paper focuses on the methods of direct or indirect pooling of categorical data collected on a contingency table in each area through Dirichlet priors. We compare the pooling effects of hierarchical Bayesian models by fitting the simulated data. The analysis is carried out using Markov chain Monte Carlo methods.

영과잉 포아송 회귀모형에 대한 베이지안 추론: 구강위생 자료에의 적용 (Bayesian Analysis of a Zero-inflated Poisson Regression Model: An Application to Korean Oral Hygienic Data)

  • 임아경;오만숙
    • 응용통계연구
    • /
    • 제19권3호
    • /
    • pp.505-519
    • /
    • 2006
  • 셀 수 있는 이산 자료(discrete count data)에 대한 분석은 여러 분야에서 활용되고 있지만 영(zero)을 과도하게 포함하고 있는 영과잉 자료는 자료의 성격상 포아송 분포를 따르지 못할 때가 있어 분석에 어려움이 따른다. Zero-Inflated Poisson(ZIP)모형은 이런 어려움을 극복하기 위하여 영에 대한 점확률을 가지는 분포와 포아송 분포를 합성하여 과도한 영과 영이 아닌 자료를 설명하는 모형이다. 설명 변수가 존재할 때는 포아송 분포 부분에서 반응변수의 평균과 공변량사이에 로그선형 연결함수를 사용한 Zero-Inflated Poisson Regression(ZIPR)모형이 사용될 수 있다. 본 논문에서는 Markov Chain Monte Carlo 기법을 이용한 ZIPR모형의 베이지안 추론방법을 제안하고, 이를 실제 구강위생 자료에 적용하며 다른 모형들과 비교한다. 그 결과 베이지안 추론 방법을 적용한 영과잉 모형의 추정오차가 다른 모형들의 추정오차보다 작았고, 예측치가 더 정확했다는 점에서 우수함을 알 수 있었다.

DEFAULT BAYESIAN INFERENCE OF REGRESSION MODELS WITH ARMA ERRORS UNDER EXACT FULL LIKELIHOODS

  • Son, Young-Sook
    • Journal of the Korean Statistical Society
    • /
    • 제33권2호
    • /
    • pp.169-189
    • /
    • 2004
  • Under the assumption of default priors, such as noninformative priors, Bayesian model determination and parameter estimation of regression models with stationary and invertible ARMA errors are developed under exact full likelihoods. The default Bayes factors, the fractional Bayes factor (FBF) of O'Hagan (1995) and the arithmetic intrinsic Bayes factors (AIBF) of Berger and Pericchi (1996a), are used as tools for the selection of the Bayesian model. Bayesian estimates are obtained by running the Metropolis-Hastings subchain in the Gibbs sampler. Finally, the results of numerical studies, designed to check the performance of the theoretical results discussed here, are presented.

Bayesian Analysis for Heat Effects on Mortality

  • Jo, Young-In;Lim, Youn-Hee;Kim, Ho;Lee, Jae-Yong
    • Communications for Statistical Applications and Methods
    • /
    • 제19권5호
    • /
    • pp.705-720
    • /
    • 2012
  • In this paper, we introduce a hierarchical Bayesian model to simultaneously estimate the thresholds of each 6 cities. It was noted in the literature there was a dramatic increases in the number of deaths if the mean temperature passes a certain value (that we call a threshold). We estimate the difference of mortality before and after the threshold. For the hierarchical Bayesian analysis, some proper prior distribution of parameters and hyper-parameters are assumed. By combining the Gibbs and Metropolis-Hastings algorithm, we constructed a Markov chain Monte Carlo algorithm and the posterior inference was based on the posterior sample. The analysis shows that the estimates of the threshold are located at $25^{\circ}C{\sim}29^{\circ}C$ and the mortality around the threshold changes from -1% to 2~13%.

Spatio-temporal models for generating a map of high resolution NO2 level

  • Yoon, Sanghoo;Kim, Mingyu
    • Journal of the Korean Data and Information Science Society
    • /
    • 제27권3호
    • /
    • pp.803-814
    • /
    • 2016
  • Recent times have seen an exponential increase in the amount of spatial data, which is in many cases associated with temporal data. Recent advances in computer technology and computation of hierarchical Bayesian models have enabled to analyze complex spatio-temporal data. Our work aims at modeling data of daily average nitrogen dioxide (NO2) levels obtained from 25 air monitoring sites in Seoul between 2003 and 2010. We considered an independent Gaussian process model and an auto-regressive model and carried out estimation within a hierarchical Bayesian framework with Markov chain Monte Carlo techniques. A Gaussian predictive process approximation has shown the better prediction performance rather than a Hierarchical auto-regressive model for the illustrative NO2 concentration levels at any unmonitored location.

Bayes factors for accelerated life testing models

  • Smit, Neill;Raubenheimer, Lizanne
    • Communications for Statistical Applications and Methods
    • /
    • 제29권5호
    • /
    • pp.513-532
    • /
    • 2022
  • In this paper, the use of Bayes factors and the deviance information criterion for model selection are compared in a Bayesian accelerated life testing setup. In Bayesian accelerated life testing, the most used tool for model comparison is the deviance information criterion. An alternative and more formal approach is to use Bayes factors to compare models. However, Bayesian accelerated life testing models with more than one stressor often have mathematically intractable posterior distributions and Markov chain Monte Carlo methods are employed to obtain posterior samples to base inference on. The computation of the marginal likelihood is challenging when working with such complex models. In this paper, methods for approximating the marginal likelihood and the application thereof in the accelerated life testing paradigm are explored for dual-stress models. A simulation study is also included, where Bayes factors using the different approximation methods and the deviance information are compared.

Probabilistic-based assessment of composite steel-concrete structures through an innovative framework

  • Matos, Jose C.;Valente, Isabel B.;Cruz, Paulo J.S.;Moreira, Vicente N.
    • Steel and Composite Structures
    • /
    • 제20권6호
    • /
    • pp.1345-1368
    • /
    • 2016
  • This paper presents the probabilistic-based assessment of composite steel-concrete structures through an innovative framework. This framework combines model identification and reliability assessment procedures. The paper starts by describing current structural assessment algorithms and the most relevant uncertainty sources. The developed model identification algorithm is then presented. During this procedure, the model parameters are automatically adjusted, so that the numerical results best fit the experimental data. Modelling and measurement errors are respectively incorporated in this algorithm. The reliability assessment procedure aims to assess the structure performance, considering randomness in model parameters. Since monitoring and characterization tests are common measures to control and acquire information about those parameters, a Bayesian inference procedure is incorporated to update the reliability assessment. The framework is then tested with a set of composite steel-concrete beams, which behavior is complex. The experimental tests, as well as the developed numerical model and the obtained results from the proposed framework, are respectively present.

Survival Analysis for White Non-Hispanic Female Breast Cancer Patients

  • Khan, Hafiz Mohammad Rafiqullah;Saxena, Anshul;Gabbidon, Kemesha;Stewart, Tiffanie Shauna-Jeanne;Bhatt, Chintan
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제15권9호
    • /
    • pp.4049-4054
    • /
    • 2014
  • Background: Race and ethnicity are significant factors in predicting survival time of breast cancer patients. In this study, we applied advanced statistical methods to predict the survival of White non-Hispanic female breast cancer patients, who were diagnosed between the years 1973 and 2009 in the United States (U.S.). Materials and Methods: Demographic data from the Surveillance Epidemiology and End Results (SEER) database were used for the purpose of this study. Nine states were randomly selected from 12 U.S. cancer registries. A stratified random sampling method was used to select 2,000 female breast cancer patients from these nine states. We compared four types of advanced statistical probability models to identify the best-fit model for the White non-Hispanic female breast cancer survival data. Three model building criterion were used to measure and compare goodness of fit of the models. These include Akaike Information Criteria (AIC), Bayesian Information Criteria (BIC), and Deviance Information Criteria (DIC). In addition, we used a novel Bayesian method and the Markov Chain Monte Carlo technique to determine the posterior density function of the parameters. After evaluating the model parameters, we selected the model having the lowest DIC value. Using this Bayesian method, we derived the predictive survival density for future survival time and its related inferences. Results: The analytical sample of White non-Hispanic women included 2,000 breast cancer cases from the SEER database (1973-2009). The majority of cases were married (55.2%), the mean age of diagnosis was 63.61 years (SD = 14.24) and the mean survival time was 84 months (SD = 35.01). After comparing the four statistical models, results suggested that the exponentiated Weibull model (DIC= 19818.220) was a better fit for White non-Hispanic females' breast cancer survival data. This model predicted the survival times (in months) for White non-Hispanic women after implementation of precise estimates of the model parameters. Conclusions: By using modern model building criteria, we determined that the data best fit the exponentiated Weibull model. We incorporated precise estimates of the parameter into the predictive model and evaluated the survival inference for the White non-Hispanic female population. This method of analysis will assist researchers in making scientific and clinical conclusions when assessing survival time of breast cancer patients.