• 제목/요약/키워드: Mixture regression model

검색결과 108건 처리시간 0.023초

Analysis of Variables Affecting on Customer Loyalty by Market Segments for the Korean Open Air Market Using Mixture Regression Model (Mixture Regression Model을 이용한 재래시장의 세분집단별 고객충성도에 미치는 영향 변수 분석)

  • Kim, Jong-Kook;Park, Youn-Jae;Park, Ju-Young;Choi, Ja-Young
    • Journal of Distribution Research
    • /
    • 제12권4호
    • /
    • pp.1-25
    • /
    • 2007
  • The purpose of this study is to provide the strategic implication of the Korean open air market by examining the factors affecting customer loyalty for various market segments as their competitive environment becomes more turbulent. We have undertaken empirical research that uses the methodology of a mixture regression modeling, as a way to ascertain the determinants of customer loyalty toward the Korean open air market, which should form the base of strategy for each segment. We construct a mixture regression model which uses perceived the Korean open air market value dimensions as explanatory variables, an income as a covariate variable, and a customer loyalty as a dependent variable. The analysis of results show that customers are statistically divided into four segments: 'Accessibility'(33.7%), 'Price'(29.7%), 'Shopping environment,'(22.0%), and 'Merchandising,'(14.5%) groups. The findings also showed that parameter estimates are different for each group, which indicates that the sensitivity to changes in the Korean traditional market perceived value and the income variable affecting customer loyalty vary among segments.

  • PDF

A comparison of models for the quantal response on tumor incidence data in mixture experiments (계수적 반응을 갖는 종양 억제 혼합물 실험에서 모형 비교)

  • Kim, Jung Il
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권5호
    • /
    • pp.1021-1026
    • /
    • 2017
  • Mixture experiments are commonly encountered in many fields including food, chemical and pharmaceutical industries. In mixture experiments, measured response depends on the proportions of the components present in the mixture and not on the amount of the mixture. Statistical analysis of the data from mixture experiments has mainly focused on a continuous response variable. In the example of quantal response data in mixture experiments, however, the tumor incidence data have been analyzed in Chen et al. (1996) to study the effects of 3 dietary components on the expression of mammary gland tumor. In this paper, we compared the logistic regression models with linear predictors such as second degree Scheffe polynomial model, Becker model and Akay model in terms of classification accuracy.

Practical designs for mixture component-process experiments (실용적인 혼합물 성분 공정변수 실험설계)

  • Lim, Yong-B.
    • Journal of Korean Society for Quality Management
    • /
    • 제39권3호
    • /
    • pp.400-411
    • /
    • 2011
  • Process variables are factors in an experiment that are not mixture components but could affect the blending properties of the mixture ingredients. For example, the effectiveness of an etching solution which is measured as an etch rate is not only a function of the proportions of the three acids that are combined to form the mixture, but also depends on the temperature of the solution and the agitation rate. Efficient designs for the mixture components-process variables experiments depend on the mixture components-process variables model which is called a combined model. We often use the product model between the canonical polynomial model for the mixture and process variables model as a combined model. In this paper we propose three starting models for the mixture components-process variables experiments. One of the starting model we are considering is the model which includes product terms up to cubic order interactions between mixture effects and the linear & pure quadratic effect of the process variables from the product model. In this paper, we propose a method for finding robust designs and practical designs with respect to D-, G-, and I-optimality for the various starting combined models and then, we find practically efficient and robust designs for estimating the regression coefficients for those models. We find the prediction capability of those recommended designs in the case of three components and three process variables to be good by checking FDS(Fraction of Design Space) plots.

Influence of Merchandise Composition on the Competitiveness for the Korean Open Air Market (재래시장의 상품구성이 재래시장 활성화에 미치는 영향)

  • Park, Ju-Young
    • Proceedings of the Korean DIstribution Association Conference
    • /
    • 한국유통학회 2007년도 추계학술대회 발표논문집
    • /
    • pp.155-178
    • /
    • 2007
  • The purpose of this study is to provide the strategic implication of the Korean open air market by examining the factors affecting their competitiveness. I have undertaken empirical research that uses the methodology of a mixture regression modeling, as a way to ascertain the determinants of competitiveness for the Korean open air market. I construct a mixture regression model which uses the proportions of merchandise categories as explanatory variables and the number of visitors as a dependent variable. The analysis of results show that competitive and non-competitive markets have different proportions of merchandise categories. The finding shows that stock farm products and home appliances are major influencers on the number of visitors in neighborhood markets. The finding also presents that stock farm products and processed foods are major influencers on the number of visitors in small & medium-sized city markets.

  • PDF

Bayesian Analysis of a Zero-inflated Poisson Regression Model: An Application to Korean Oral Hygienic Data (영과잉 포아송 회귀모형에 대한 베이지안 추론: 구강위생 자료에의 적용)

  • Lim, Ah-Kyoung;Oh, Man-Suk
    • The Korean Journal of Applied Statistics
    • /
    • 제19권3호
    • /
    • pp.505-519
    • /
    • 2006
  • We consider zero-inflated count data, which is discrete count data but has too many zeroes compared to the Poisson distribution. Zero-inflated data can be found in various areas. Despite its increasing importance in practice, appropriate statistical inference on zero-inflated data is limited. Classical inference based on a large number theory does not fit unless the sample size is very large. And regular Poisson model shows lack of St due to many zeroes. To handle the difficulties, a mixture of distributions are considered for the zero-inflated data. Specifically, a mixture of a point mass at zero and a Poisson distribution is employed for the data. In addition, when there exist meaningful covariates selected to the response variable, loglinear link is used between the mean of the response and the covariates in the Poisson distribution part. We propose a Bayesian inference for the zero-inflated Poisson regression model by using a Markov Chain Monte Carlo method. We applied the proposed method to a Korean oral hygienic data and compared the inference results with other models. We found that the proposed method is superior in that it gives small parameter estimation error and more accurate predictions.

Modelling and packed bed column studies on adsorptive removal of phosphate from aqueous solutions by a mixture of ground burnt patties and red soil

  • Rout, Prangya R.;Dash, Rajesh R.;Bhunia, Puspendu
    • Advances in environmental research
    • /
    • 제3권3호
    • /
    • pp.231-251
    • /
    • 2014
  • The present study examines the phosphate adsorption potential and behavior of mixture of Ground Burnt Patties (GBP), a solid waste generated from cooking fuel used in earthen stoves and Red Soil (RS), a natural substance in fixed bed column mode operation. The characterization of adsorbent was done by Proton Induced X-ray Emission (PIXE), and Proton Induced ${\gamma}$-ray Emission (PIGE) methods. The FTIR spectroscopy of spent adsorbent reveals the presence of absorbance peak at $1127cm^{-1}$ which appears due to P = O stretching, thus confirming phosphate adsorption. The effects of bed height (10, 15 and 20 cm), flow rate (2.5, 5 and 7.5 mL/min) and initial phosphate concentration (5 and 15 mg/L) on breakthrough curves were explored. Both the breakthrough and exhaustion time increased with increase in bed depth, decrease in flow rate and influent concentration. Thomas model, Yoon-Nelson model and Modified Dose Response model were used to fit the column adsorption data using nonlinear regression analysis while Bed Depth Service Time model followed linear regression analysis under different experimental condition to evaluate model parameters that are useful in scale up of the process. The values of correlation coefficient ($R^2$) and the Sum of Square Error (SSE) revealed the Modified Dose Response model as the best fitted model to the experimental data. The adsorbent mixture responded effectively to the desorption and reusability experiment. The results of this finding advocated that mixture of GBP and RS can be used as a low cost, highly efficient adsorbent for phosphate removal from aqueous solution.

The Analysis of the Number of Donations Based on a Mixture of Poisson Regression Model (포아송 분포의 혼합모형을 이용한 기부 횟수 자료 분석)

  • Kim In-Young;Park Su-Bum;Kim Byung-Soo;Park Tae-Kyu
    • The Korean Journal of Applied Statistics
    • /
    • 제19권1호
    • /
    • pp.1-12
    • /
    • 2006
  • The aim of this study is to analyse a survey data on the number of charitable donations using a mixture of two Poisson regression models. The survey was conducted in 2002 by Volunteer 21, an nonprofit organization, based on Koreans, who were older than 20. The mixture of two Poisson distributions is used to model the number of donations based on the empirical distribution of the data. The mixture of two Poisson distributions implies the whole population is subdivided into two groups, one with lesser number of donations and the other with larger number of donations. We fit the mixture of Poisson regression models on the number of donations to identify significant covariates. The expectation-maximization algorithm is employed to estimate the parameters. We computed 95% bootstrap confidence interval based on bias-corrected and accelerated method and used then for selecting significant explanatory variables. As a result, the income variable with four categories and the volunteering variable (1: experience of volunteering, 0: otherwise) turned out to be significant with the positive regression coefficients both in the lesser and the larger donation groups. However, the regression coefficients in the lesser donation group were larger than those in larger donation group.

Use of Factor Analyzer Normal Mixture Model with Mean Pattern Modeling on Clustering Genes

  • Kim Seung-Gu
    • Communications for Statistical Applications and Methods
    • /
    • 제13권1호
    • /
    • pp.113-123
    • /
    • 2006
  • Normal mixture model(NMM) frequently used to cluster genes on microarray gene expression data. In this paper some of component means of NMM are modelled by a linear regression model so that its design matrix presents the pattern between sample classes in microarray matrix. This modelling for the component means by given design matrices certainly has an advantage that we can lead the clusters that are previously designed. However, it suffers from 'overfitting' problem because in practice genes often are highly dimensional. This problem also arises when the NMM restricted by the linear model for component-means is fitted. To cope with this problem, in this paper, the use of the factor analyzer NMM restricted by linear model is proposed to cluster genes. Also several design matrices which are useful for clustering genes are provided.

The Reanalysis of the Donation Data Using the Zero-Inflated Possion Regression (0이 팽창된 포아송 회귀모형을 이용한 기부회수 자료의 재분석)

  • Kim, In-Young;Park, Tae-Kyu;Kim, Byung-Soo
    • The Korean Journal of Applied Statistics
    • /
    • 제22권4호
    • /
    • pp.819-827
    • /
    • 2009
  • Kim et al. (2006) analyzed the donation data surveyed by Voluneteer 21 in year 2002 at South Korea using a Poisson regression based on the mixture of two Poissons and detected significant variables for affecting the number of donations. However, noting the large deviation between the predicted and the actual frequencies of zero, we developed in this note a Poisson regression model based on a distribution in which zero inflated Poisson was added to the mixture of two Poissons. Thus the population distribution is now a mixture of three Poissons in which one component is concentrated on zero mass. We used the EM algorithm for estimating the regression parameters and detected the same variables with Kim et al's for significantly affecting the response. However, we could estimate the proportion of the fixed zero group to be 0.201, which was the characteristic of this model. We also noted that among two significant variables, the income and the volunteer experience(yes, no), the second variable could be utilized as a strategric variable for promoting the donation.

Tree Size Distribution Modelling: Moving from Complexity to Finite Mixture

  • Ogana, Friday Nwabueze;Chukwu, Onyekachi;Ajayi, Samuel
    • Journal of Forest and Environmental Science
    • /
    • 제36권1호
    • /
    • pp.7-16
    • /
    • 2020
  • Tree size distribution modelling is an integral part of forest management. Most distribution yield systems rely on some flexible probability models. In this study, a simple finite mixture of two components two-parameter Weibull distribution was compared with complex four-parameter distributions in terms of their fitness to predict tree size distribution of teak (Tectona grandis Linn f) plantations. Also, a system of equation was developed using Seemingly Unrelated Regression wherein the size distributions of the stand were predicted. Generalized beta, Johnson's SB, Logit-Logistic and generalized Weibull distributions were the four-parameter distributions considered. The Kolmogorov-Smirnov test and negative log-likelihood value were used to assess the distributions. The results show that the simple finite mixture outperformed the four-parameter distributions especially in stands that are bimodal and heavily skewed. Twelve models were developed in the system of equation-one for predicting mean diameter, seven for predicting percentiles and four for predicting the parameters of the finite mixture distribution. Predictions from the system of equation are reasonable and compare well with observed distributions of the stand. This simplified mixture would allow for wider application in distribution modelling and can also be integrated as component model in stand density management diagram.