[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5351/KJAS.2016.29.7.1347

The EM algorithm for mixture regression with missing covariates

Kim, Hyungmin (Department of Statistics, Sungkyunkwan University)
Ham, Geonhee (Center for Public Opinion and Quantitative Research, The Asan Institute for Policy Studies)
Seo, Byungtae (Department of Statistics, Sungkyunkwan University)

Publication Information

The Korean Journal of Applied Statistics / v.29, no.7, 2016 , pp. 1347-1359 More about this Journal

Abstract

Finite mixtures of regression models provide an effective tool to explore a hidden functional relationship between a response variable and covariates. However, it is common in practice that data are not fully observed due to several reasons. In this paper, we derived an expectation-maximization (EM) algorithm to obtain the maximum likelihood estimator when some covariates are missing at random in the finite mixture of regression models. We conduct some simulation studies and we also provide some real data examples to show the validity of the derived EM algorithm.

Keywords

mixture models; missing covariates; mixture regression; EM algorithm;

Citations & Related Records

Reference

1	Bandeen, R. K., Miglioretti, D. L., Zeger, S. L., and Rathouz, P. J. (1997). Latent variable regression for multiple discrete outcomes, Journal of the American Statistical Association, 92, 1375-1386. DOI
2	Benaglia, T., Chauveau, D., Hunter, D., and Young, D. (2009). mixtools: an R package for analyzing finite mixture models, Journal of Statistical Software, 32, 1-29.
3	DeSarbo, W. S. and Cron, W. L. (1988). A maximum likelihood methodology for clusterwise linear regression, Journal of Classification, 5, 249-282. DOI
4	Dempster, A. P., Laird, N. M., and Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, Series B (Methodological), 39, 1-38.
5	Ingrassia, S., Minotti, S., and Vittadini, G. (2012). Local statistical modeling via the cluster-weighted approach with elliptical distributions, Journal of Classification, 29, 363-401. DOI
6	Ingrassia, S., Minotti, S., and Punzo, A. (2014). Model-based clustering via linear cluster-weighted models, Computational Statistics and Data Analysis, 71, 159-182. DOI
7	Leisch, F. (2004). FlexMix: a general framework for finite mixture models and latent glass regression in R, Journal of Statistical Software, 11, 1-18.
8	Mclachlan, G. J. and Krishnan, T. (1997). The EM Algorithm and Extension, Wiley, New York.
9	Punzo, A. (2014). Flexible mixture modeling with the polynomial Gaussian cluster-weighted model, Statistical Modelling, 14, 257-291. DOI
10	Quandt, R. and Ramsey, J. (1978). Estimating mixtures of normal distributions and switching regressions, Journal of the American Statistical Association, 73, 730-738. DOI
11	Redner, R. A. and Walker, H. F. (1984). Mixture densities, maximum likelihood and the EM algorithm, SIAM Review, 26, 195-239. DOI
12	Subedi, S., Punzo, A., Ingrassia, S., and McNicholas, P. (2013). Clustering and classification via clusterweighted factor analyzers, Advances in Data Analysis and Classification, 7, 5-40. DOI
13	Hennig, C. (2000). Identifiability of models for clusterwise linear regression, Journal of Classification, 17, 273-296. DOI
14	Jacobs, R. A., Jordan, M. I., Nowlan, S. J., and Hinton, G. E. (1991). Adaptive mixtures of local experts, Neural Computation, 3, 79-87. DOI

KSCI

The EM algorithm for mixture regression with missing covariates 결측 공변량을 갖는 혼합회귀모형에서의 EM 알고리즘

The EM algorithm for mixture regression with missing covariates