• Title/Summary/Keyword: mixture regression

Search Result 209, Processing Time 0.022 seconds

A Graphical Method for Evaluating the Mixture Component Effects of Ridge Regression Estimator in Mixture Experiments

  • Jang, Dae-Heung
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.1
    • /
    • pp.1-10
    • /
    • 1999
  • When the component proportions in mixture experiments are restricted by lower and upper bounds multicollinearity appears all too frequently. The ridge regression can be used to stabilize the coefficient estimates in the fitted model. I propose a graphical method for evaluating the mixture component effects of ridge regression estimator with respect to the prediction variance and the prediction bias.

  • PDF

Segmentation of the Compensation Packages for Doctors by Mixture Regression Model (혼합회귀모델을 이용한 의사의 선호보상체계 분석)

  • Paik, Soo-Kyung;Kwak, Young-Sik
    • Korea Journal of Hospital Management
    • /
    • v.10 no.4
    • /
    • pp.75-97
    • /
    • 2005
  • The research objective is to empirically investigate the compensation packages maximizing the utilities of internal customers by applying the market segmentation theory. Data was collected from four Korean hospitals in Seoul, Busan and Gyunggi-do. The research is designed to seek the compensation package maximizing the utility of doctors by mixture regression model, which has been applied as latent structure and other type of finite mixture models from various academic fields since early 1980s. The mixture regression model shows the optimal segments number and fuzzy classification for each observation by EM(expectation-maximization algorism). The finite mixture regression model is to unmix the sample, to identify the groups, and to estimate the parameters of the density function underlying the observed data within each group. The doctors were segmented into 5 groups by their preference for the compensation package. The results of this study imply that the utility of doctors increases with differentiated compensation package segmented by their preference.

  • PDF

Optimal Restrictions on Regression Parameters For Linear Mixture Model

  • Ahn, Jung-Yeon;Park, Sung-Hyun
    • Journal of the Korean Statistical Society
    • /
    • v.28 no.3
    • /
    • pp.325-336
    • /
    • 1999
  • Collinearity among independent variables can have severe effects on the precision of response estimation for some region of interest in the experiments with mixture. A method of finding optimal linear restriction on regression parameter in linear model for mixture experiments in the sense of minimizing integrated mean squared error is studied. We use the formulation of optimal restrictions on regression parameters for estimating responses proposed by Park(1981) by transforming mixture components to mathematically independent variables.

  • PDF

Market Segmentation With Price-Dependent Quality Evaluation in Denim Jeans Market ; Based on Conjoin analysis and mixture model (청바지제품 세분시장 내 가격-품질 평가집단 추출에 관한 연구: 결합분석과 mixture model를 이용하여)

  • 곽영식;이진화
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.26 no.11
    • /
    • pp.1605-1614
    • /
    • 2002
  • The purpose of this study was to identify the consumers who use the level of price as the indicator of the product quality. In order to implement the purpose of this study, Jeans market had been segmented by the mixture regression model, and price response function was calibrated for each segment. Based on the types of price response function, segments were allocated into one of two groups; the group using the level of price as the quality indicator or the group not using the level of price as that. Then, characteristics of both groups were compared in terms of product attributes and demographic variables. Data were co]looted from the sample of the 23o undergraduate and graduate students in Seoul. For the data analysis, mixture regression model, conjoint analysis, and t-test were used. As a result, jeans market was divided into 5 segments. Segment 1,2,3 were allocated into the group not using the level of price as the quality indicator while segment 4,5 were done into the other group. Significant differences existed between two groups in product attributes, not in demographic variables. Mixture model and conjoint analysis were proved to be an effective set of tools in market segmentation.

Model-based inverse regression for mixture data

  • Choi, Changhwan;Park, Chongsun
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.1
    • /
    • pp.97-113
    • /
    • 2017
  • This paper proposes a method for sufficient dimension reduction (SDR) of mixture data. We consider mixture data containing more than one component that have distinct central subspaces. We adopt an approach of a model-based sliced inverse regression (MSIR) to the mixture data in a simple and intuitive manner. We employed mixture probabilistic principal component analysis (MPPCA) to estimate each central subspaces and cluster the data points. The results from simulation studies and a real data set show that our method is satisfactory to catch appropriate central spaces and is also robust regardless of the number of slices chosen. Discussions about root selection, estimation accuracy, and classification with initial value issues of MPPCA and its related simulation results are also provided.

OPTIMAL RESTRICTIONS ON REGRESSION PARAMETERS FOR LINEAR MIXTURE MODEL

  • Park, Sung-Hyun;Ahn, Jung-Yeon
    • Proceedings of the Korean Society for Quality Management Conference
    • /
    • 1998.11a
    • /
    • pp.239-250
    • /
    • 1998
  • A method of finding optimal linear restriction on regression parameters in linear model for mixture experiments in the sense of minimizing integrated mean squared error is studied. We use the formulation of optimal restrictions on regression parameters for estimating responses proposed by Park(1981) by transforming mixture components to mathematically independent variables.

  • PDF

Joint penalization of components and predictors in mixture of regressions (혼합회귀모형에서 콤포넌트 및 설명변수에 대한 벌점함수의 적용)

  • Park, Chongsun;Mo, Eun Bi
    • The Korean Journal of Applied Statistics
    • /
    • v.32 no.2
    • /
    • pp.199-211
    • /
    • 2019
  • This paper is concerned with issues in the finite mixture of regression modeling as well as the simultaneous selection of the number of mixing components and relevant predictors. We propose a penalized likelihood method for both mixture components and regression coefficients that enable the simultaneous identification of significant variables and the determination of important mixture components in mixture of regression models. To avoid over-fitting and bias problems, we applied smoothly clipped absolute deviation (SCAD) penalties on the logarithm of component probabilities suggested by Huang et al. (Statistical Sinica, 27, 147-169, 2013) as well as several well-known penalty functions for coefficients in regression models. Simulation studies reveal that our method is satisfactory with well-known penalties such as SCAD, MCP, and adaptive lasso.

The EM algorithm for mixture regression with missing covariates (결측 공변량을 갖는 혼합회귀모형에서의 EM 알고리즘)

  • Kim, Hyungmin;Ham, Geonhee;Seo, Byungtae
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1347-1359
    • /
    • 2016
  • Finite mixtures of regression models provide an effective tool to explore a hidden functional relationship between a response variable and covariates. However, it is common in practice that data are not fully observed due to several reasons. In this paper, we derived an expectation-maximization (EM) algorithm to obtain the maximum likelihood estimator when some covariates are missing at random in the finite mixture of regression models. We conduct some simulation studies and we also provide some real data examples to show the validity of the derived EM algorithm.

The Analysis of the Number of Donations Based on a Mixture of Poisson Regression Model (포아송 분포의 혼합모형을 이용한 기부 횟수 자료 분석)

  • Kim In-Young;Park Su-Bum;Kim Byung-Soo;Park Tae-Kyu
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.1
    • /
    • pp.1-12
    • /
    • 2006
  • The aim of this study is to analyse a survey data on the number of charitable donations using a mixture of two Poisson regression models. The survey was conducted in 2002 by Volunteer 21, an nonprofit organization, based on Koreans, who were older than 20. The mixture of two Poisson distributions is used to model the number of donations based on the empirical distribution of the data. The mixture of two Poisson distributions implies the whole population is subdivided into two groups, one with lesser number of donations and the other with larger number of donations. We fit the mixture of Poisson regression models on the number of donations to identify significant covariates. The expectation-maximization algorithm is employed to estimate the parameters. We computed 95% bootstrap confidence interval based on bias-corrected and accelerated method and used then for selecting significant explanatory variables. As a result, the income variable with four categories and the volunteering variable (1: experience of volunteering, 0: otherwise) turned out to be significant with the positive regression coefficients both in the lesser and the larger donation groups. However, the regression coefficients in the lesser donation group were larger than those in larger donation group.

A Bayesian Method for Narrowing the Scope fo Variable Selection in Binary Response t-Link Regression

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.4
    • /
    • pp.407-422
    • /
    • 2000
  • This article is concerned with the selecting predictor variables to be included in building a class of binary response t-link regression models where both probit and logistic regression models can e approximately taken as members of the class. It is based on a modification of the stochastic search variable selection method(SSVS), intended to propose and develop a Bayesian procedure that used probabilistic considerations for selecting promising subsets of predictor variables. The procedure reformulates the binary response t-link regression setup in a hierarchical truncated normal mixture model by introducing a set of hyperparameters that will be used to identify subset choices. In this setup, the most promising subset of predictors can be identified as that with highest posterior probability in the marginal posterior distribution of the hyperparameters. To highlight the merit of the procedure, an illustrative numerical example is given.

  • PDF