DOI QR코드

DOI QR Code

A Study on the Use of Cluster Analysis for Multivariate and Multipurpose Stratification

군집분석을 이용한 다목적 조사의 층화에 관한 연구

  • Park, Jin-Woo (Department of Applied Statistics, The University of Suwon) ;
  • Yun, Seok-Hoon (Department of Applied Statistics, The University of Suwon) ;
  • Kim, Jin-Heum (Department of Applied Statistics, The University of Suwon) ;
  • Jeong, Hyeong-Chul (Department of Applied Statistics, The University of Suwon)
  • 박진우 (수원대학교 통계정보학과) ;
  • 윤석훈 (수원대학교 통계정보학과) ;
  • 김진흠 (수원대학교 통계정보학과) ;
  • 정형철 (수원대학교 통계정보학과)
  • Published : 2007.07.31

Abstract

This paper considers several stratification strategies for multivariate and multipurpose survey with several quantitative stratification variables. We propose three methods of stratification based on, respectively, the method of cumulative frequency square root which is the most popular one in univariate stratification, cluster analysis, and factor analysis followed by cluster analysis. We then compare the efficiency of those methods using the Dong-Eup-Myun data of the holding numbers of farming machines, extracted from the 2001 Agricultural Census. It turned out that the method based on cluster analysis with factor analysis would be a relatively satisfactory strategy.

본 연구는 여러 가지의 양적변수들을 조사하는 다목적, 다변량조사 표본설계에서 층화 문제를 다룬다. 다변량 층화변수를 사용하는 층화 방법으로 일변량 층화변수가 있을 때 사용하는 누적도수제곱근법을 독립적으로 여러 층화변수에 적용하는 방법, 군집분석을 이용하는 방법, 인자분석과 군집분석을 함께 이용하는 방법 등 세 가지 방법을 제시한다. 한편, 2001년 농업총조사 자료에 나타난 동 읍 면의 농기계별 보유대수 정보를 층화변수로 활용하여 세 가지 층화 방안의 효율을 실증적으로 비교하게 되는데 그 결과 인자분석과 군집분석을 함께 고려한 층화방법이 비교적 효율적인 것으로 나타났다.

Keywords

References

  1. 박진우 (2000). A sampling design of the agricultural machine estimated sales survey, The Korean Communications in Statistics, 8, 375-382
  2. 통계청 (2001). <2000 농어업총조사>, CD
  3. 한국통계학회 (2006). <농기계 모니터링 시스템 표본설계>, 학술연구용역보고서
  4. Bryant, E. C., Hartley, H. O. and Jessen, R. J. (1960). Design and estimation in two-way stratification, Journal of the American Statistical Association, 55, 105-124 https://doi.org/10.2307/2282182
  5. Dalenius, T. and Hodges, J. L. (1959). Minimum variance stratification, Journal of the American Statistical Association, 54, 88-101 https://doi.org/10.2307/2282141
  6. Eckman, G. (1959). An Approximation useful in univariate stratification, The Annals of Mathematical Statistics, 30, 219-229 https://doi.org/10.1214/aoms/1177706377
  7. Golder, P. A. and Yeomans, K. A. (1973). The use of cluster analysis for stratification, Applied Statistics, 22, 213-219 https://doi.org/10.2307/2346922
  8. Gunning, P. and Horgan, J. M. (2004). A new algorithm for the construction of stratum boundaries in skewed populations, Survey Methodology, 30, 159-166
  9. Hedlin, D. (2000). A procedure for stratification by an extended Eckman rule, Journal of Official Statistics, 16, 15-29
  10. Jarque, C. M. (1981). A solution to the problem of optimum stratification in multivariate sampling, Applied Statistics, 30, 163-169 https://doi.org/10.2307/2346387
  11. Lavallee, P. and Hidiroglou, M. (1988). On the stratification of skewed populations, Survey Methodology, 14, 33-43
  12. Lu, W. and Sitter, R. R. (2002). Multi-way stratification by linear programming made practical, Survey Methodology, 28, 199-207
  13. Rivest, L. P. (2002). A generalization of the Lavallee-Hidiroglou algorithm for stratification in business surveys, Survey Methodology, 28, 191-198
  14. Sitter, R. R. and Skinner, C. J. (1994). Multi-way stratification by linear programming, Survey Methodology, 20, 65-73
  15. Ward, J. H. (1963). Hierachical grouping to optimize an objective function, Journal of the American Statistical Association, 58, 236-244 https://doi.org/10.2307/2282967