• Title/Summary/Keyword: Markov chain Monte Carlo

Search Result 271, Processing Time 0.026 seconds

At-site Low Flow Frequency Analysis Using Bayesian MCMC: I. Theoretical Background and Construction of Prior Distribution (Bayesian MCMC를 이용한 저수량 점 빈도분석: I. 이론적 배경과 사전분포의 구축)

  • Kim, Sang-Ug;Lee, Kil-Seong
    • Journal of Korea Water Resources Association
    • /
    • v.41 no.1
    • /
    • pp.35-47
    • /
    • 2008
  • The low flow analysis is an important part in water resources engineering. Also, the results of low flow frequency analysis can be used for design of reservoir storage, water supply planning and design, waste-load allocation, and maintenance of quantity and quality of water for irrigation and wild life conservation. Especially, for identification of the uncertainty in frequency analysis, the Bayesian approach is applied and compared with conventional methodologies in at-site low flow frequency analysis. In the first manuscript, the theoretical background for the Bayesian MCMC (Bayesian Markov Chain Monte Carlo) method and Metropolis-Hasting algorithm are studied. Two types of the prior distribution, a non-data- based and a data-based prior distributions are developed and compared to perform the Bayesian MCMC method. It can be suggested that the results of a data-based prior distribution is more effective than those of a non-data-based prior distribution. The acceptance rate of the algorithm is computed to assess the effectiveness of the developed algorithm. In the second manuscript, the Bayesian MCMC method using a data-based prior distribution and MLE(Maximum Likelihood Estimation) using a quadratic approximation are performed for the at-site low flow frequency analysis.

A Bayesian Extreme Value Analysis of KOSPI Data (코스피 지수 자료의 베이지안 극단값 분석)

  • Yun, Seok-Hoon
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.833-845
    • /
    • 2011
  • This paper conducts a statistical analysis of extreme values for both daily log-returns and daily negative log-returns, which are computed using a collection of KOSPI data from January 3, 1998 to August 31, 2011. The Poisson-GPD model is used as a statistical analysis model for extreme values and the maximum likelihood method is applied for the estimation of parameters and extreme quantiles. To the Poisson-GPD model is also added the Bayesian method that assumes the usual noninformative prior distribution for the parameters, where the Markov chain Monte Carlo method is applied for the estimation of parameters and extreme quantiles. According to this analysis, both the maximum likelihood method and the Bayesian method form the same conclusion that the distribution of the log-returns has a shorter right tail than the normal distribution, but that the distribution of the negative log-returns has a heavier right tail than the normal distribution. An advantage of using the Bayesian method in extreme value analysis is that there is nothing to worry about the classical asymptotic properties of the maximum likelihood estimators even when the regularity conditions are not satisfied, and that in prediction it is effective to reflect the uncertainties from both the parameters and a future observation.

Bayesian analysis of Korean income data using zero-inflated Tobit model (영과잉 토빗모형을 이용한 한국 소득분포 자료의 베이지안 분석)

  • Hwang, Jisu;Kim, Sei-Wan;Oh, Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.917-929
    • /
    • 2017
  • Korean income data obtained from Korea Labor Panel Survey shows excessive zeros, which may not be properly explained by the Tobit model. In this paper, we analyze the data using a zero-inflated Tobit model to incorporate excessive zeros. A zero-inflated Tobit model consists of two stages. In the first stage, individuals with 0 income are divided into two groups: genuine zero group and random zero group. Individuals in the genuine zero group did not participate labor market since they have no intention to do so. Individuals in the random zero group participated labor market but their incomes are very low and truncated at 0. In the second stage, the Tobit model is assumed to a subset of data combining random zeros and positive observations. Regression models are employed in both stages to obtain the effect of explanatory variables on the participation of labor market and the income amount. Markov chain Monte Carlo methods are applied for the Bayesian analysis of the data. The proposed zero-inflated Tobit model outperforms the Tobit model in model fit and prediction of zero frequency. The analysis results show strong evidence that the probability of participating in the labor market increases with age, decreases with education, and women tend to have stronger intentions on participating in the labor market than men. There also exists moderate evidence that the probability of participating in the labor market decreases with socio-economic status and reserved wage. However, the amount of monthly wage increases with age and education, and it is larger for married than unmarried and for men than women.

Analysis of the Wave Spectral Shape Parameters for the Definition of Swell Waves (너울성파랑 정의를 위한 파랑스펙트럼의 형상모수 특성 분석)

  • Ahn, Kyungmo;Chun, Hwusub;Jeong, Weon Mu;Park, Deungdae;Kang, Tae-Soon;Hong, Sung-Jin
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.25 no.6
    • /
    • pp.394-404
    • /
    • 2013
  • In the present study, the characteristics of spectral peakedness parameter $Q_p$, bandwidth parameter ${\varepsilon}$, and spectral width parameter ${\nu}$ were analyzed as a first step to define the swell waves quantitatively. For the analysis, the joint probability density function of significant wave heights and peak periods were newly developed. The MCMC(Markov Chain Monte Carlo) simulations have been performed to generate the significant wave heights and peak periods from the developed probability density functions. Applying the simulated significant wave heights and peak periods to the theoretical wave spectrum models, the spectral shapes parameters were obtained and analyzed. Among the spectral shape parameters, only the spectral peakedness parameter $Q_p$, is shown to be independent with the significant wave height and peak wave period. It also best represents the peakedness of the spectral shape, and henceforth $Q_p$ should be used to define the swell waves with a wave period. For the field verification of the results, wave data obtained from Hupo port and Ulleungdo were analyzed and results showed the same trend with the MCMC simulation results.

A Study on Regionalization of Parameters for Sacramento Continuous Rainfall-Runoff Model Using Watershed Characteristics (유역특성인자를 활용한 Sacramento 장기유출모형의 매개변수 지역화 기법 연구)

  • Kim, Tae-Jeong;Jeong, Ga-In;Kim, Ki-Young;Kwon, Hyun-Han
    • Journal of Korea Water Resources Association
    • /
    • v.48 no.10
    • /
    • pp.793-806
    • /
    • 2015
  • The simulation of natural streamflow at ungauged basins is one of the fundamental challenges in hydrology community. The key to runoff simulation in ungauged basins is generally involved with a reliable parameter estimation in a rainfall-runoff model. However, the parameter estimation of the rainfall-runoff model is a complex issue due to an insufficient hydrologic data. This study aims to regionalize the parameters of a continuous rainfall-runoff model in conjunction with a Bayesian statistical technique to consider uncertainty more precisely associated with the parameters. First, this study employed Bayesian Markov Chain Monte Carlo scheme for the estimation of the Sacramento rainfall-runoff model. The Sacramento model is calibrated against observed daily runoff data, and finally, the posterior density function of the parameters is derived. Second, we applied a multiple linear regression model to the set of the parameters with watershed characteristics, to obtain a functional relationship between pairs of variables. The proposed model was also validated with gauged watersheds in accordance with the efficiency criteria such as the Nash-Sutcliffe efficiency, index of agreement and the coefficient of correlation.

Effects of Financial College Tuition Support by Korean Parents using a Hierarchical Bayes Model (계층적 베이즈 모형을 이용한 대학등록금에 대한 부모님의 경제적 지원 영향 분석)

  • Oh, Man-Suk;Oh, Hyun Sook;Oh, Min Jung
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.2
    • /
    • pp.267-280
    • /
    • 2013
  • College tuition is a significant economic, social, and political issue in Korea. We conduct a Bayesian analysis of a hierarchical model to address the factors related to college tuition based on a survey data collected by Statistics Korea. A binary response variable is selected depending on if more than 70% of tuition costs are supported by parents, and a hierarchical Probit model is constructed with areas as groups. A set of explanatory variables is selected from a factor analysis of available variables in the survey. A Markov chain Monte Carlo algorithm is used to estimate parameters. From the analysis results, income and stress are significantly related to college tuition support from parents. Parents with high income tend to support children's college tuition and students with parents' financial support tend to be mentally less stressed; subsequently, this shows that the economic status of parents significantly affects the mental health of college students. Gender, a healthy life style, and college satisfaction are not significant factors. Comparing areas in terms of the degrees of correlation between stress/income and tuition support from parents, students in Kangwon-do are the most mentally stressed when parents' support is limited; in addition, the positive correlation between parents support and income is stronger in big cities compared to provincial areas.

Concept of Seasonality Analysis of Hydrologic Extreme Variables and Design Rainfall Estimation Using Nonstationary Frequency Analysis (극치수문자료의 계절성 분석 개념 및 비정상성 빈도해석을 이용한 확률강수량 해석)

  • Lee, Jeong-Ju;Kwon, Hyun-Han;Hwang, Kyu-Nam
    • Journal of Korea Water Resources Association
    • /
    • v.43 no.8
    • /
    • pp.733-745
    • /
    • 2010
  • Seasonality of hydrologic extreme variable is a significant element from a water resources managemental point of view. It is closely related with various fields such as dam operation, flood control, irrigation water management, and so on. Hydrological frequency analysis conjunction with partial duration series rather than block maxima, offers benefits that include data expansion, analysis of seasonality and occurrence. In this study, nonstationary frequency analysis based on the Bayesian model has been suggested which effectively linked with advantage of POT (peaks over threshold) analysis that contains seasonality information. A selected threshold that the value of upper 98% among the 24 hours duration rainfall was applied to extract POT series at Seoul station, and goodness-fit-test of selected GEV distribution has been examined through graphical representation. Seasonal variation of location and scale parameter ($\mu$ and $\sigma$) of GEV distribution were represented by Fourier series, and the posterior distributions were estimated by Bayesian Markov Chain Monte Carlo simulation. The design rainfall estimated by GEV quantile function and derived posterior distribution for the Fourier coefficients, were illustrated with a wide range of return periods. The nonstationary frequency analysis considering seasonality can reasonably reproduce underlying extreme distribution and simultaneously provide a full annual cycle of the design rainfall as well.

A Study on the War Simulation and Prediction Using Bayesian Inference (베이지안 추론을 이용한 전쟁 시뮬레이션과 예측 연구)

  • Lee, Seung-Lyong;Yoo, Byung Joo;Youn, Sangyoun;Bang, Sang-Ho;Jung, Jae-Woong
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.11
    • /
    • pp.77-86
    • /
    • 2021
  • A method of constructing a war simulation based on Bayesian Inference was proposed as a method of constructing heterogeneous historical war data obtained with a time difference into a single model. A method of applying a linear regression model can be considered as a method of predicting future battles by analyzing historical war results. However it is not appropriate for two heterogeneous types of historical data that reflect changes in the battlefield environment due to different times to be suitable as a single linear regression model and violation of the model's assumptions. To resolve these problems a Bayesian inference method was proposed to obtain a post-distribution by assuming the data from the previous era as a non-informative prior distribution and to infer the final posterior distribution by using it as a prior distribution to analyze the data obtained from the next era. Another advantage of the Bayesian inference method is that the results sampled by the Markov Chain Monte Carlo method can be used to infer posterior distribution or posterior predictive distribution reflecting uncertainty. In this way, it has the advantage of not only being able to utilize a variety of information rather than analyzing it with a classical linear regression model, but also continuing to update the model by reflecting additional data obtained in the future.

Estimation of Interaction Effects among Nucleotide Sequence Variants in Animal Genomes

  • Lee, Chaeyoung;Kim, Younyoung
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.22 no.1
    • /
    • pp.124-130
    • /
    • 2009
  • Estimating genetic interaction effects in animal genomics would be one of the most challenging studies because the phenotypic variation for economically important traits might be largely explained by interaction effects among multiple nucleotide sequence variants under various environmental exposures. Genetic improvement of economic animals would be expected by understanding multi-locus genetic interaction effects associated with economic traits. Most analyses in animal breeding and genetics, however, have excluded the possibility of genetic interaction effects in their analytical models. This review discusses a historical estimation of the genetic interaction and difficulties in analyzing the interaction effects. Furthermore, two recently developed methods for assessing genetic interactions are introduced to animal genomics. One is the restricted partition method, as a nonparametric grouping-based approach, that iteratively utilizes grouping of genotypes with the smallest difference into a new group, and the other is the Bayesian method that draws inferences about the genetic interaction effects based on their marginal posterior distributions and attains the marginalization of the joint posterior distribution through Gibbs sampling as a Markov chain Monte Carlo. Further developing appropriate and efficient methods for assessing genetic interactions would be urgent to achieve accurate understanding of genetic architecture for complex traits of economic animals.

Comparison of Estimation Methods in NONMEM 7.2: Application to a Real Clinical Trial Dataset (실제 임상 데이터를 이용한 NONMEM 7.2에 도입된 추정법 비교 연구)

  • Yun, Hwi-Yeol;Chae, Jung-Woo;Kwon, Kwang-Il
    • Korean Journal of Clinical Pharmacy
    • /
    • v.23 no.2
    • /
    • pp.137-141
    • /
    • 2013
  • Purpose: This study compared the performance of new NONMEM estimation methods using a population analysis dataset collected from a clinical study that consisted of 40 individuals and 567 observations after a single oral dose of glimepiride. Method: The NONMEM 7.2 estimation methods tested were first-order conditional estimation with interaction (FOCEI), importance sampling (IMP), importance sampling assisted by mode a posteriori (IMPMAP), iterative two stage (ITS), stochastic approximation expectation-maximization (SAEM), and Markov chain Monte Carlo Bayesian (BAYES) using a two-compartment open model. Results: The parameters estimated by IMP, IMPMAP, ITS, SAEM, and BAYES were similar to those estimated using FOCEI, and the objective function value (OFV) for diagnosing the model criteria was significantly decreased in FOCEI, IMPMAP, SAEM, and BAYES in comparison with IMP. Parameter precision in terms of the estimated standard error was estimated precisely with FOCEI, IMP, IMPMAP, and BAYES. The run time for the model analysis was shortest with BAYES. Conclusion: In conclusion, the new estimation methods in NONMEM 7.2 performed similarly in terms of parameter estimation, but the results in terms of parameter precision and model run times using BAYES were most suitable for analyzing this dataset.