Browse > Article
http://dx.doi.org/10.7465/jkdi.2017.28.3.685

Semiparametric mixture of experts with unspecified gate network  

Jung, Dahai (Department of Statistics, Sungkyunkwan University)
Seo, Byungtae (Department of Statistics, Sungkyunkwan University)
Publication Information
Journal of the Korean Data and Information Science Society / v.28, no.3, 2017 , pp. 685-695 More about this Journal
Abstract
The traditional mixture of experts (ME) modeled the gate network using a certain parametric function. However, if the assumed parametric function does not properly reflect the true nature, the prediction strength of ME would become weak. For example, the parametric ME often uses logistic or multinomial logistic models for the network model. However, this could be very misleading if the true nature of the data is quite different from those models. Although, in this case, we may develop more flexible parametric models by extending the model at hand, we will never be free from such misspecification problems. In order to alleviate such weakness of the parametric ME, we propose to use the semi-parametric mixture of experts (SME) in which the gate network is estimated in a non-parametrical way. Based on this, we compared the performance of the SME with those of ME and neural networks via several simulation experiments and real data examples.
Keywords
EM algorithm; mixture of experts; neural network; semiparametric models;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 Benaglia, T., Chauveau, D., Hunter, D. R. and Young, D. (2009). Mixtools: An R package for analyzing finite mixture models. Journal of Statistical Software, 32, 1-29.
2 Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, 39, 1-38.
3 Goldfeld, S. M. and Quandt, R. E. (1973). A Markov model for switching regression. Journal of Econometrics, 1, 3-15.   DOI
4 Gunther, F. and Fritsch, S. (2010). Neuralnet: Training of neural networks. The R Journal, 2, 30-38.
5 Hwang, S., Sohn, S. H. and Oh. C. (2015). Maximum likelihood estimation for a mixure distributions. Journal of the Korean Data & Information Science Society, 26, 313-322.   DOI
6 Masoudnia, S. and Ebrahimpour, R. (2014). Mixture of experts: A literature survey. Artificial Intelligence Review, 42, 275-293.   DOI
7 Jacobs, R. A., Jordan, M. I., Nowlan, S. J. and Hinton, G. E. (1991). Adaptive mixtures of local experts. Neural Computation, 3, 79-87.   DOI
8 Jordan, M. I. and Jacobs, R. A. (1994). Hierarchical mixtures of experts and the EM algorithm. Neural Computation, 6, 181-214.   DOI
9 Lee, K. E. (2004). Curve clustering in microarray. Journal of the Korean Data & Information Science Society, 15, 575-584.
10 Oh, C. (2014). A maximum likelihood estimation method for a mixture of shifted binomial distributions. Journal of the Korean Data & Information Science Society, 25, 255-261.   DOI
11 Young, D. S. and Hunter, D. R. (2010). Mixtures of regressions with predictor-dependent mixing proportions. Computational Statistics and Data Analysis, 54, 2253-2266.   DOI
12 Huang M. and Yao, W. (2012). Mixture of regression models with varying mixing proportions: A semiparametric approach. Journal of the American Statistical Association, 107, 711-724.   DOI