Efficient Markov Chain Monte Carlo for Bayesian Analysis of Neural Network Models

  • Paul E. Green (Department of Statistics, Seoul National University) ;
  • Changha Hwang (Department of Statistics, Catholic University of Taegu) ;
  • Lee, Sangbock (Department of Statistics, Catholic University)
  • Published : 2002.03.01

Abstract

Most attempts at Bayesian analysis of neural networks involve hierarchical modeling. We believe that similar results can be obtained with simpler models that require less computational effort, as long as appropriate restrictions are placed on parameters in order to ensure propriety of posterior distributions. In particular, we adopt a model first introduced by Lee (1999) that utilizes an improper prior for all parameters. Straightforward Gibbs sampling is possible, with the exception of the bias parameters, which are embedded in nonlinear sigmoidal functions. In addition to the problems posed by nonlinearity, direct sampling from the posterior distributions of the bias parameters is compounded due to the duplication of hidden nodes, which is a source of multimodality. In this regard, we focus on sampling from the marginal posterior distribution of the bias parameters with Markov chain Monte Carlo methods that combine traditional Metropolis sampling with a slice sampler described by Neal (1997, 2001). The methods are illustrated with data examples that are largely confined to the analysis of nonparametric regression models.

Keywords

References

  1. Nonlinear Regression Analysis and Its Applications Bates, D. M.; Watts, D. G.
  2. Neural Networks for Pattern Recognition Bishop, C. M.
  3. Technical Report NCRG95009. Department of Computer Science and Applied Mathematics Bayesian methods for neural networks Bishop, C. M.
  4. Astrophysical Journal Supplement Series v.64 The structure and dynamics of ringed galaxies, III Buta, R. https://doi.org/10.1086/191190
  5. Biometrika v.82 Reversible jump Markov chain Monte Carlo computation and Bayesian model determination Green, P. J. https://doi.org/10.1093/biomet/82.4.711
  6. Neural Computation v.10 Bayesian radial basis functions of variable dimension Holmes, C. C.; Mallick, B. K. https://doi.org/10.1162/089976698300017421
  7. Ph.D. thesis, Department of Statistics Model Selection and Model Averaging for Neural Networks Lee, H. K. H.
  8. review paper to appear in Network, IDPP Probable networks and plausible predictions- a review of practical Bayesian methods for supervised neural networks MacKay, D. J. C.
  9. Neural Computation v.10 Issues in Bayesian analysis of neural network models Muller, P.; Rios Insua, D. https://doi.org/10.1162/089976698300017737
  10. Bayesian Learning for Neural Networks Neal, R. M.
  11. Technical Report No. 9722, Department of Statistics Markov chain Monte Carlo methods based on 'slicing' the density function Neal, R. M.
  12. The Annals of Statistics Slice sampling Neal, R. M.
  13. Practical Nonparametric and Semiparametric Bayesian Statistics Feedforward neural networks for nonparametric regression Rios Insua, D.; Muller, P.; Dey, D.(ed.); Muller, P.(ed.); D. Sinha(ed.)
  14. Pattern Recognition and Neural Networks Ripley, B. D.