Browse > Article

PERFORMANCE EVALUATION OF INFORMATION CRITERIA FOR THE NAIVE-BAYES MODEL IN THE CASE OF LATENT CLASS ANALYSIS: A MONTE CARLO STUDY  

Dias, Jose G. (Department of Quantitative Methods and GIESTA-UNIDE, Higher Institute of Social Sciences and Business Studies-ISCTE)
Publication Information
Journal of the Korean Statistical Society / v.36, no.3, 2007 , pp. 435-445 More about this Journal
Abstract
This paper addresses for the first time the use of complete data information criteria in unsupervised learning of the Naive-Bayes model. A Monte Carlo study sets a large experimental design to assess these criteria, unusual in the Bayesian network literature. The simulation results show that complete data information criteria underperforms the Bayesian information criterion (BIC) for these Bayesian networks.
Keywords
Bayesian information criterion (BIC); complete data information criteria; latent class model; Monte Carlo simulation; Naive-Bayes model.;
Citations & Related Records

Times Cited By Web Of Science : 0  (Related Records In Web of Science)
연도 인용수 순위
  • Reference
1 AKAIKE, H. (1974). 'A new look at the statistical model identification', IEEE Transactions on Automatic Control, 19, 716-723   DOI
2 BIERNACKI, C. AND GOVAERT, G. (1997). 'Using the classification likelihood to choose the number of clusters', Computing Science and Statistics, 29, 451-457
3 CELEUX, G. AND SOROMENHO, G. (1996). 'An entropy criterion for assessing the number of clusters in a mixture model', Journal of Classification, 13, 195-212   DOI
4 DUDA, R. O., HART, P. E. AND STORK, D. G. (2001). Pattern Classification, 2nd ed., Wiley-Interscience, New York
5 FRIEDMAN, N., GEIGER, D. AND GOLDSZMIDT, M. (1997). 'Bayesian network classifiers', Machine Learning, 29, 131-163   DOI
6 McLACHLAN, G. AND PEEL, D. (2000). Finite Mixture Models, Wiley-Interscience, New York
7 HATHAWAY, R. J. (1986). 'Another interpretation of the EM algorithm for mixture distributions', Statistics & Probability Letters, 4, 53-56   DOI   ScienceOn
8 BIERNACKI, C., CELEUX, G. AND GOVAERT, G. (1999). 'An improvement of the NEC criterion for assessing the number of clusters in a mixture model' , Pattern Recognition Letters, 20, 267-272   DOI   ScienceOn
9 BIERNACKI, C., CELEUX, G. AND GOVAERT, G. (2000). 'Assessing a mixture model for clustering with the integrated completed likelihood', IEEE Transactions on Pattern Analysis and Machine Intelligence, 22, 719-725   DOI   ScienceOn
10 GOODMAN, L. A. (1974). 'Exploratory latent structure analysis using both identifiable and unidentifiable models', Biometrika, 61, 215-231   DOI   ScienceOn
11 RISSANEN, J. (1987). 'Stochastic complexity', Journal of the Royal Statistical Society, Ser. B, 49, 223-239
12 TIERNEY, L. AND KADANE, J. B. (1986). 'Accurate approximations for posterior moments and marginal densities', Journal of the American Statistical Association, 81, 82-86   DOI
13 DIAS, J. G. (2004). 'Controlling the level of separation of components in Monte Carlo studies of latent class models', In Classification, Clustering, and Data Mining Applications (Banks, D., House, L., McMorris, F. R., Arabie, P. and Gaul, W., eds.), 77-84, Springer, Berlin
14 SCHWARZ, G. (1978). 'Estimating the dimension of a model', The Annals of Statistics, 6, 461-464   DOI
15 DIAS, J. G. AND WEDEL, M. (2004). 'An empirical comparison of EM, SEM and MCMC performance for problematic Gaussian mixture likelihoods', Statistics and Computing, 14, 323-332   DOI
16 PEARL, J. (1988). Probabilistic Reasoning in Intelligent Systems, Morgan Kaufmann, San Mateo