DOI QR코드

DOI QR Code

내재된 인자회귀모형의 베이지안 분석법

Bayesian analysis of latent factor regression model

  • 경민정 (덕성여자대학교 정보통계학과)
  • Kyung, Minjung (Department of Statistics, Duksung Women's University)
  • 투고 : 2020.02.04
  • 심사 : 2020.06.17
  • 발행 : 2020.08.31

초록

선형모형에서 두개 이상의 설명변수들 사이에 존재하는 다중공선성 문제를 변수들 간에 내재되어 있는 공통의 구조인 인자를 구성하고, 인자들을 회귀변수로 사용하여 해결하는 인자회귀모형에 대하여 논의한다. 무한개로 가정 가능한 내재된 인자 중 유의미한 인자적재행렬을 구성하기 위하여 벌점모수의 값이 큰 LASSO 사전분포를 적용하는 베이지안 추정법을 사용한다. 결정된 인자적재행렬과 다른 모수들의 추정값을 각 설명변수의 선형모수로 역변환 하여, 새로운 관측값에 대한 예측 모형으로도 사용한다. 제안한 방법을 제품 서비스 관리 자료에 적용하여 정해진 인자의 개수에 대한 인자가 일반적인 공통인자회귀모형과 동일한 결과를 나타냄을 확인하였고, 일반적인 공통인자회귀모형과 비교를 위해 계산한 평균 제곱 오차값이 더 작다는 것을 알 수 있었다.

We discuss latent factor regression when constructing a common structure inherent among explanatory variables to solve multicollinearity and use them as regressors to construct a linear model of a response variable. Bayesian estimation with LASSO prior of a large penalty parameter to construct a significant factor loading matrix of intrinsic interests among infinite latent structures. The estimated factor loading matrix with estimated other parameters can be inversely transformed into linear parameters of each explanatory variable and used as prediction models for new observations. We apply the proposed method to Product Service Management data of HBAT and observe that the proposed method constructs the same factors of general common factor analysis for the fixed number of factors. The calculated MSE of predicted values of Bayesian latent factor regression model is also smaller than the common factor regression model.

키워드

참고문헌

  1. Aguilar, O. and West, M. (2000). Bayesian dynamic factor models and portfolio allocation, Journal of Business and Economic Statistics, 18, 338-357. https://doi.org/10.2307/1392266
  2. Bartlett, M. S. (1951). The effect of standardization on a 2 approximation in factor analysis, Biometrika, 38, 337-344 https://doi.org/10.1093/biomet/38.3-4.337
  3. Bhattacharya, A. and Dunson, D. B. (2011). Sparse Bayesian infinite factor models, Biometrika, 98, 291-306. https://doi.org/10.1093/biomet/asr013
  4. Hair, J. F., Black, W. C., Babin, B. J., and Anderson, R. E. (2014). Multivariate Data Analysis : Pearson New International Edition (7th ed), Pearson.
  5. Hoerl, A. E. and Kennard, R. W. (1950). Ridge regression: Biased estimation for nonorthogonal problems, Technometrics, 12, 55-67. https://doi.org/10.1080/00401706.1970.10488634
  6. Hotelling, H. (1957). The relations of the newer multivariate statistical methods to factor analysis, British Journal of Statistical Psychology, 10, 69-79. https://doi.org/10.1111/j.2044-8317.1957.tb00179.x
  7. Jeffers, J. N. R. (1967). Two case studies in the application of principal component analysis, Applied Statistics, 16, 225-236. https://doi.org/10.2307/2985919
  8. Kendall, M. G. (1957). A Course in Multivariate Analysis, Griffin, London.
  9. Kyung, M., Gill, J., Ghosh, M., and Casella, G. (2010). Penalized regression, standard errors, and Bayesian lassos, Bayesian Analysis, 5, 369-412. https://doi.org/10.1214/10-BA607
  10. Park, T. and Casella, G. (2008). The Bayesian lasso, Journal of the American Statistical Association, 103, 681-686. https://doi.org/10.1198/016214508000000337
  11. Raftery, A. E. (1995). Bayesian model selection in social research, Sociological Methodology, 25, 111-163. https://doi.org/10.2307/271063
  12. Tibshirani, R. (1996). Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society. Series B, 58, 267-288.
  13. West, M. (2003). Bayesian factor regression models in the "Large p, Small n" paradigm, in: J.M. Bernardo, M. Bayarri, J. Berger, A. Dawid, D. Heckerman, A. Smith, M. West (Eds.), Bayesian Statistics, Vol. 7, OxfordUniversity Press, Oxford, 723-732