• 제목/요약/키워드: Complex sample design

검색결과 160건 처리시간 0.019초

잠재적 위험요인의 탐색에 관한 단일표본분석과 복합표본분석의 비교 (Comparative Analysis of Unweighted Sample Design and Complex Sample Design Related to the Exploration of Potential Risk Factors of Dysphonia)

  • 변해원
    • 한국산학기술학회논문지
    • /
    • 제13권5호
    • /
    • pp.2251-2258
    • /
    • 2012
  • 본 연구는 잠재적 위험요인을 탐색하는 방법으로 단순임의추출분석(unweighted sample design), 빈도 가중치를 적용한 단일표본분석(frequency weighted sample design), 가중치를 층화하여 적용한 복합표본분석(complex sample design)을 비교하고, 도출된 결과에 통계적인 차이가 있는지를 파악하고자 수행되었다. 자료원은 2009 국민건강영양조사의 이비인후과 검진 자료를 이용하였다. 분석 방법은 피어슨의 교차검정(Pearson chi-square test)과 라오-스콧교차검정(Rao-scott chi-square test)을 이용하였다. 분석 결과, 빈도 가중치만을 적용한 단일표본분석의 경우에는 모든 변수가 유의한 위험요인으로 과대 예측 되었고, 가중치를 적용하지 않은 단순임의추출 분석과 복합표본분석은 유의수준 및 결과에 차이가 있었다. 국가통계자료를 이용할 때, 연구의 결과가 전체 인구집단을 대표할 수 있도록 의미를 부여하기 위해서는 층화변수와 집락변수를 사용하여 가중치를 적용하는 복합표본분석이 필요하다. 나아가, 빈도 가중치만을 적용하는 경우에는 연구 결과에 대한 과잉해석의 가능성이 높기 때문에 각별한 주의가 요구된다.

국민건강영양조사 자료의 복합표본설계효과와 통계적 추론 (Complex sample design effects and inference for Korea National Health and Nutrition Examination Survey data)

  • 정진은
    • Journal of Nutrition and Health
    • /
    • 제45권6호
    • /
    • pp.600-612
    • /
    • 2012
  • Nutritional researchers world-wide are using large-scale sample survey methods to study nutritional health epidemiology and services utilization in general, non-clinical populations. This article provides a review of important statistical methods and software that apply to descriptive and multivariate analysis of data collected in sample surveys, such as national health and nutrition examination survey. A comparative data analysis of the Korea National Health and Nutrition Examination Survey (KNHANES) was used to illustrate analytical procedures and design effects for survey estimates of population statistics, model parameters, and test statistics. This article focused on the following points, method of approach to analyze of the sample survey data, right software tools available to perform these analyses, and correct survey analysis methods important to interpretation of survey data. It addresses the question of approaches to analysis of complex sample survey data. The latest developments in software tools for analysis of complex sample survey data are covered, and empirical examples are presented that illustrate the impact of survey sample design effects on the parameter estimates, test statistics, and significance probabilities (p values) for univariate and multivariate analyses.

복합표본자료에서 동질성검정을 위한 피어슨 검정통계량의 효과 (Effect of complex sample design on Pearson test statistic for homogeneity)

  • 허순영;정영애
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권4호
    • /
    • pp.757-764
    • /
    • 2012
  • 복합표본설계에 기초한 범주형 조사자료는 통상적인 피어슨 카이제곱검정에 필요한 조건을 만족하지 못한다. 그러나 많은 조사연구에서 복잡한 표본설계 방법을 적용하고 있지만, 종래의 피어슨 검정결과를 제시하고 있다. 본 연구는 복합표본설계에 의한 범주형자료의 동질성검정에 대한 실증분석을 통해, 종래의 피어슨 검정과 불편검정인 왈드검정, 표본설계를 반영한 비율추정치를 사용하는 피어슨 검정을 비교하였다. 분석결과, 종래의 피어슨검정은 표본설계를 반영하는 검정들에 비해 통계량 값이 매우 크고, 유의확률이 심각하게 작게 나타나는 것을 확인하였다. 복합표본설계를 반영하되 추정량의 분산을 아는 경우와 모르는 경우의 비교에서는 범주수, 설계효과행렬의 고유치들의 평균과 표준편차에 영향을 받는 것을 확인하였다.

Empirical Analysis on Rao-Scott First Order Adjustment for Two Population Homogeneity test Based on Stratified Three-Stage Cluster Sampling with PPS

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제7권3호
    • /
    • pp.208-213
    • /
    • 2014
  • National-wide and/or large scale sample surveys generally use complex sample design. Traditional Pearson chi-square test is not appropriate for the categorical complex sample data. Rao-Scott suggested an adjustment method for Pearson chi-square test, which uses the average of eigenvalues of design matrix of cell probabilities. This study is to compare the efficiency of Rao-Scott first order adjusted test to Wald test for homogeneity between two populations using 2009 Gyeongnam regional education offices's customer satisfaction survey (2009 GREOCSS) data. The 2009 GREOCSS data were collected based on stratified three-stage cluster sampling with probability proportional to size. The empirical results show that the Rao-Scott adjusted test statistic using only the variances of cell probabilities is very close to the Wald test statistic, which uses the covariance matrix of cell probabilities, under the 2009 GREOCSS data based. However it is necessary to be cautious to use the Rao-Scott first order adjusted test statistic in the place of Wald test because its efficiency is decreasing as the relative variance of eigenvalues of the design matrix of cell probabilities is increasing, specially more when the number of degrees of freedom is small.

한방의료이용 선택 요인에 관한 연구 - 2017 한방의료이용실태조사를 중심으로 (Influencing factors of using Korean Medicine services - focusing on the 2017 Korean Medicine Utilization Survey)

  • 임진웅;이기재
    • 대한한의학회지
    • /
    • 제42권1호
    • /
    • pp.12-25
    • /
    • 2021
  • Objectives: The aim of this study was to investigate influencing factors of using Korean medicine services (KMS) using the 2017 Korean Medicine Utilization Survey (KMUS). Methods: Demographic statistics of the survey were summarized and influencing factors of the KMS experience and the intention to visit KMS were analyzed using logistic regression model with complex sample design. Influencing factors were specified based on Andersen's behavioral model of health care utilization and factors associated with individual recognitions of KMS. Additionally, using the ordinary logistic regression model without complex sample design, the survey data were analyzed to compare the results. Results: In the logistic regression analysis, sex, age, health condition, presence of chronic disease, a degree of knowledge about Korean Medicine, and a view about herbal medicine safety were statistically significant both in the KMS experience, and the intention to visit KMS. Marital status was statistically significant in the KMS experience, while family income, a view about the cost of KMS were statistically significant in the intention to visit KMS. Conclusion: Individual recognitions of KMS and enabling components should be considered when establishing KMS policies. In addition, future studies analyzing KMUS need to take into account the complex sample design features of the survey to avoid statistically misleading results.

설계효과모형 적용에 관한 연구 (A study on design effect models for complex sample survey)

  • 박인호
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권3호
    • /
    • pp.523-531
    • /
    • 2014
  • 설계효과는 새로운 표본설계를 계획하거나 기존 표본조사에 적용된 설계요소의 효율성을 평가하는데 널리 사용된다. 본 연구에서는 Gabler 등 (2006)이 제시한 설계효과모형을 층화이단집락추출의 표본설계로 이루어진 2013 식품소비행태조사에 응용하여 적용하였다. 조사결과를 통해 표본설계모형의 유용성과 적절성에 대해 논의하였다.

Effect of Bias on the Pearson Chi-squared Test for Two Population Homogeneity Test

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제5권4호
    • /
    • pp.241-245
    • /
    • 2012
  • Categorical data collected based on complex sample design is not proper for the standard Pearson multinomial-based chi-squared test because the observations are not independent and identically distributed. This study investigates effects of bias of point estimator of population proportion and its variance estimator to the standard Pearson chi-squared test statistics when the sample is collected based on complex sampling scheme. This study examines the effect under two population homogeneity test. The standard Pearson test statistic can be partitioned into two parts; the first part is the weighted sum of ${\chi}^2_1$ with eigenvalues of design matrix as their weights, and the additional second part which is added due to the biases of the point estimator and its variance estimator. Our empirical analysis shows that even though the bias of point estimator is small, Pearson test statistic is very much inflated due to underestimate the variance of point estimator. In the connection of design-based variance estimator and its design matrix, the bigger the average of eigenvalues of design matrix is, the larger relative size of which the first component part to Pearson test statistic is taking.

지역교육청 수요자 만족도조사를 위한 표본설계에 관한 연구 (A sample survey design for service satisfaction evaluation of regional education offices)

  • 허순영;장덕준
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권4호
    • /
    • pp.669-679
    • /
    • 2010
  • 지역교육청 수요자 만족도조사를 위한 표본설계는 경상남도의 2009년 경남지역교육청 고객만족도 조사의 표본크기에 기초하여 시 군별 지역교육청평가에 맞추어 설계하였다. 대도시의 구단위 지역 교육청과 달리 지방의 시 군 교육청은 학생수와 학교수, 학급당 학생수 등의 변동이 크다. 시간 비용 등을 고려하여 전체 표본크기를 작게 하면서도 각 시 군 교육청 평가에 필요한 최소표본수를 확보하도록 설계하였다. 경상남도는 10개의 시지역과 10개의 군지역을 가지고 있고, 학생수가 상대적으로 작은 군지역교육청 평가에 필요한 최소표본수를 확보하기위해 지역별 평가에 필요한 최소표본을 우선배분한 후, 나머지는 지역별 학급수에 비례배분하였고, 표본학교는 지역과 학교설립유형별로 층화하여 학급수에 비례하여 추출하였다. 표본학교 내에서 조사대상 학생은 2단집락추출하였다. 지역별 추출율의 상이함을 보정하기 위해 가중치를 산정하였다. 조사자료의 분석은 가중치를 적용하여 가중평균, 가중총합 등을 이용하며, 분산의 추정은 통계소프트웨어에서 제공하는 균형반복복제, 잭나이프, 선형화방법 등을 사용할 수 있다.

Unbiased Balanced Half-Sample Variance Estimation in Stratified Two-stage Sampling

  • Kim, Kyu-Seong
    • Journal of the Korean Statistical Society
    • /
    • 제27권4호
    • /
    • pp.459-469
    • /
    • 1998
  • Balanced half sample method is a simple variance estimation method for complex sampling designs. Since it is simple and flexible, it has been widely used in large scale sample surveys. However, the usual BHS method overestimate the true variance in without replacement sampling and two-stage cluster sampling. Focusing on this point , we proposed an unbiased BHS variance estimator in a stratified two-stage cluster sampling and then described an implementation method of the proposed estimator. Finally, partially BHS design is explained as a tool of reducing the number of replications of the proposed estimator.

  • PDF

Test of Homogeneity Baseon Complex Survey Data : Discussion Based on Power of Test

  • Heo, Sun-Yeong;Yi, Su-Cheol
    • Journal of the Korean Data and Information Science Society
    • /
    • 제16권3호
    • /
    • pp.609-620
    • /
    • 2005
  • In the secondary data analysis for categorical data, situations often arise in which the estimated cell variances are available, but not the full matrix of variances. In this case researchers are often inclined to use Pearson-type test statistics for homogeneity. However, for a complex sample observed cell proportions are not distributed as multinomial and Pearson-type test statistic generally is not distributed asymptotically as chi-square distribution. This paper evaluates powers for Wald test and Pearson-type test and the first order corrected test of Pearson-type test for homogeneity. The resulting power curves indicate that as the misspecification effect increases, the amount of inflation of significance level and the loss of power Pearson-type test are getting more severe.

  • PDF