Browse > Article
http://dx.doi.org/10.7465/jkdi.2017.28.6.1547

Bayesian test of homogenity in small areas: A discretization approach  

Kim, Min Sup (Department of Statistics, Kyungpook National University)
Nandram, Balgobin (Department of Mathematical Sciences, Worcester Polytechnic Institute)
Kim, Dal Ho (Department of Statistics, Kyungpook National University)
Publication Information
Journal of the Korean Data and Information Science Society / v.28, no.6, 2017 , pp. 1547-1555 More about this Journal
Abstract
This paper studies Bayesian test of homogeneity in contingency tables made by discretizing a continuous variable. Sometimes when we are considering events of interest in small area setup, we can think of discretization approaches about the continuous variable. If we properly discretize the continuous variable, we can find invisible relationships between areas (groups) and a continuous variable of interest. The proper discretization of the continuous variable can support the alternative hypothesis of the homogeneity test in contingency tables even if the null hypothesis was not rejected through k-sample tests involving one-way ANOVA. In other words, the proportions of variables with a particular level can vary from group to group by the discretization. If we discretize the the continuous variable, it can be treated as an analysis of the contingency table. In this case, the chi-squared test is the most commonly employed method. However, further discretization gives rise to more cells in the table. As a result, the count in the cells becomes smaller and the accuracy of the test becomes lower. To prevent this, we can consider the Bayesian approach and apply it to the setup of the homogeneity test.
Keywords
Contingency table; Dirichlet prior; discretization; hierarchical Bayesian model; test of homogeneity;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 Agresti, A. and Hitchcock, D. B. (2005). Bayesian inference for categorical data analysis. Statistical Methods and Applications, 14, 297-330.   DOI
2 Jiang, B., Ye, C., and Liu, J. S. (2015). Nonparametric K-sample tests via dynamic slicing. Journal of the American Statistical Association, 110, 642-653.   DOI
3 Jo, A., and Kim, D. H. (2017). Bayes tests of independence for contingency tables from small areas. Journal of the Korean Data & Information Science Society, 28, 207-215.   DOI
4 Kass, R. E. and Raftery, A. E. (1995). Bayes factor. Journal of the American Statistical Association, 90, 773-795.   DOI
5 Miller, R. and Siegmund, D. (1982). Maximally selected chi square statistics. Biometrics, 38, 1011-1016.   DOI
6 Rousseeuw, P. J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Computational and Applied Mathematics, 20, 53-65.   DOI
7 Scholz, F. W. and Stephens, M. A. (1987). K-sample Anderson-Darling tests. Journal of the American Statistical Association, 82, 918-924.
8 Tibshirani, R., Walther, G. and Hastie, T. (2001). Estimating the number of clusters in a data set via the gap statistic. Journal of the Royal Statistical Society, Series B, 63, 411-423.   DOI
9 Woo, N. and Kim, D. H. (2015). A Bayesian uncertainty analysis for nonignorable nonresponse in two-way contingency table. Journal of the Korean Data & Information Science Society, 26, 1547-1555.   DOI
10 Woo, N. and Kim, D. H. (2016). A Bayesian model for two-way contingency tables with nonignorable nonresponse from small areas. Journal of the Korean Data & Information Science Society, 27, 245-254.   DOI
11 Cressie, N. and Read, T. R. C. (1984). Multinomial goodness-of-fit tests. Journal of the Royal Statistical Society, Series B, 46, 440-464.