• 제목/요약/키워드: Stratified multistage sample survey

검색결과 15건 처리시간 0.023초

반복조사에서 설계요소를 반영한 표본수 결정 (Sample size determination using design effect formula for repeated surveys)

  • 박인호;황현길
    • 응용통계연구
    • /
    • 제32권4호
    • /
    • pp.643-652
    • /
    • 2019
  • 본 연구에서는 반복조사의 표본재설계에서 설계요소를 반영한 표본수 결정 방법을 제안하였다. 제안된 방법은 다단추출과 층화다단추출 등에 적용할 수 있으며 시점간 모집단 구성 변화, 집락효과, 표본할당 등의 주된 설계요소가 갖는 표본오차에 대한 영향력을 구분하여 반영하므로 보다 전략적인 표본수 결정이 가능할 수 있다.

Variance estimation for distribution rate in stratified cluster sampling with missing values

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권2호
    • /
    • pp.443-449
    • /
    • 2017
  • Estimation of population proportion like the distribution rate of LED TV and the prevalence of a disease are often estimated based on survey sample data. Population proportion is generally considered as a special form of population mean. In complex sampling like stratified multistage sampling with unequal probability sampling, the denominator of mean may be random variable and it is estimated like ratio estimator. In this research, we examined the estimation of distribution rate based on stratified multistage sampling, and determined some numerical outcomes using stratified random sample data with about 25% of missing observations. In the data used for this research, the survey weight was determined by deterministic way. So, the weights are not random variable, and the population distribution rate and its variance estimator can be estimated like population mean estimation. When the weights are not random variable, if one estimates the variance of proportion estimator using ratio method, then the variances may be inflated. Therefore, in estimating variance for population proportion, we need to examine the structure of data and survey design before making any decision for estimation methods.

다단추출 표본설계의 층효율성 연구 (Measuring stratification effects for multistage sampling)

  • 김태훈;이기재;박인호
    • 응용통계연구
    • /
    • 제36권4호
    • /
    • pp.337-347
    • /
    • 2023
  • 표본설계는 개체 혹은 집락을 층으로 나눈후 층별로 독립적으로 표본추출하는 층화추출을 종종 채택한다. 층화 전략은 크게 층구분과 표본할당으로 구성되는데 이는 조사연구에서 반복적으로 고려되는 중요한 주제이다. 조사연구에서는 층화다단추출 방식의 복합표본설계를 채택하고 있지만 층효과 혹은 층효율성과 관련하여서 표본론 교재들에서 주로 단순추출에 대해서 다루어지고 있다. 본 연구는 이단추출에 대한 기존 층효율성 측도를 살펴보며 설계효과모형을 적용한 추가적인 층효율성 측도들을 제안하였다. 제안된 측도들을 활용하여 제4기 국민환경기초조사의 고등학교 대상 표본설계의 층화전략에 대해 평가하였다.

Chi-squared Tests for Homogeneity based on Complex Sample Survey Data Subject to Misclassification Error

  • Heo, Sunyeong
    • Communications for Statistical Applications and Methods
    • /
    • 제9권3호
    • /
    • pp.853-864
    • /
    • 2002
  • In the analysis of categorical data subject to misclassification errors, the observed cell proportions are adjusted by a misclassification probabilities and estimates of variances are adjusted accordingly. In this case, it is important to determine the extent to which misclassification probabilities are homogeneous within a population. This paper considers methods to evaluate the power of chi-squared tests for homogeneity with complex survey data subject to misclassification errors. Two cases are considered: adjustment with homogeneous misclassification probabilities; adjustment with heterogeneous misclassification probabilities. To estimate misclassification probabilities, logistic regression method is considered.

Measurement Error Variance Estimation Based on Complex Survey Data with Subsample Re-Measurements

  • Heo, Sunyeong;Eltinge, John L.
    • Communications for Statistical Applications and Methods
    • /
    • 제10권2호
    • /
    • pp.553-566
    • /
    • 2003
  • In many cases, the measurement error variances may be functions of the unknown true values or related covariates. This paper considers design-based estimators of the parameters of these variance functions based on the within-unit sample variances. This paper devotes to: (1) define an error scale factor $\delta$; (2) develop estimators of the parameters of the linear measurement error variance function of the true values under large-sample and small-error conditions; (3) use propensity methods to adjust survey weights to account for possible selection effects at the replicate level. The proposed methods are applied to medical examination data from the U.S. Third National Health and Nutrition Examination Survey (NHANES III).

설계효과모형을 통한 설계요소의 유용성 이해 (Understanding Complex Design Features via Design Effect Models)

  • 박인호
    • 응용통계연구
    • /
    • 제28권6호
    • /
    • pp.1217-1225
    • /
    • 2015
  • 조사자료분석에 있어서 표본추정량에 대해 설계요소가 갖는 효율성은 단순확률추출과 비교한 복잡표본설계의 의한 표본추출이 주는 분산의 상대적 크기인 설계효과를 통해 평가할 수 있다. 설계효과의 유용성은 복잡설계요소의 함수형태로 표현될 수 있을때 극대화될 수 있다. 본 연구에서는 층화다단추출의 표본설계에서 적용될 수 있는 설계효과모형을 제시하였다. 제시된 설계효과모형은 기존 다단추출을 위한 Gabler 등 (1999, 2006)의 모형을 일반화한 것으로 층구조, 표본할당, 집락추출 및 불균등가중치 등의 설계요소들이 정도수준에 갖는 영향력을 함수식으로 명확히 나타내주고 있다. 이를 활용하면 사전에 기술된 추정정도를 얻기 위해 설정한 표본크기가 줄 수 있는 설계효과를 예측하는데 활용할 수 있다. 또한 사후적으로 표본설계의 개별 설계요소들이 표본추정량에 대해 갖는 효율성을 평가하는데 활용될 수 있다.

Power Analysis for Tests Adjusted for Measurement Error

  • 허순영
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 한국데이터정보과학회 2003년도 춘계학술대회
    • /
    • pp.1-14
    • /
    • 2003
  • In man cases, the measurement error variances may be functions of the unknown true values or related covariate. In some cases, the measurement error variances increase in proportion to the value of predictor. This paper develops estimators of the parameters of a linear measurement error variance function under stratified multistage random sampling design and additional conditions. Also, this paper evaluates and compares the power of an asymptotically unbiased test with that of an asymptotically biased test. The proposed method are applied to blood sample measurements from the U.S. Third National Health and Nutrition Examination Survey(NHANES III)

  • PDF

청소년 건강증진교육을 위한 비만여부에 따른 당뇨병 관련 건강행태 (Health Behavior Factors Related Type 2 Diabetes by Obesity for Health Promotion in Adolescents)

  • 백경원;전기홍
    • 한국학교보건학회지
    • /
    • 제21권2호
    • /
    • pp.61-73
    • /
    • 2008
  • Purpose: Several health behavior factors affect the incidence of type 2 diabetes. Especially, obesity, which causes insulin resistance, is the most important determinant of diabetes. Therefore, we expect the risk factors associated with insulin resistance and type 2 diabetes are affected by obesity and, additionally, the related factors with diabetes caused by obesity can be controlled. Methods: This study used data collected from the 2001 Korea National Health and Nutrition Examination Survey (KNHANES). A stratified multistage probability sampling method was applied and the final sample included 5,500 subjects over 30 years old who had completed necessary health examinations and health behaviors survey. Results: The risk factors associated with type 2 diabetes are affected by obesity. According to logistic regression model stratified by body mass index (BMI) and sex, abdominal obesity and age were the significant risk factors of diabetes regardless of sex and BMI. However, drinking, smoking, total energy consumption, and protein consumption were risk factors for women with normal BMI, while carbohydrate consumption was a risk factor for man with normal BMI. Sleeping hours affected diabetes for women with obesity and fiber consumption was a risk factor for both women and men with obesity. In addition, statistically the family history of diabetes was a significant risk factor only in the group with normal weight, not in the group with obesity. Conclusion: The study results will provide information for implementing a regional initiative of type 2 diabetes prevention by BMI.

이중 추출 자료를 이용한 측정오차분산의 추정 (Measurement Error Variance Estimation Based on Subsample Re-measurements)

  • 허순영
    • 한국조사연구학회:학술대회논문집
    • /
    • 한국조사연구학회 2003년도 춘계학술발표대회
    • /
    • pp.34-41
    • /
    • 2003
  • 많은 경우, 측정오차분산은 알려지지 않은 참값 또는 참값과 연관된 공변수들의 함수로 표현될 수 있다 이 논문은 단위 당 반복측정에 기초한 단위 내 표본분산을 이용한 선형측정오차분산의 추정에 관한 연구이다 이 논문은 다음의 내용을 포함한다: (1) 측정오차의 크기를 나타내는 상수 $\delta$의 추정; (2) 유한모집단으로부터의 복합표본, 작은 측정오차라는 조건하에 선형측정오차분산의 추정; (3) 부표본에 포함될 확률을 설명하기 위한 성향틴헝 추정 미국의 제3차 건강영양조사자료를 사용하여 이상의 결과들을 이용한 경험적 분석을 실행하였다.

  • PDF

한국인의 혈중 망간농도와 공기중 망간농도의 관련성 (Associations between Airborne Manganese and Blood Manganese in the Korean General Population according to KNHANES 2008-2009)

  • 정경식;이종대;김용배
    • 한국환경과학회지
    • /
    • 제22권12호
    • /
    • pp.1589-1598
    • /
    • 2013
  • The objective of this study was to evaluate associations between airborne manganese and blood manganese in a general population of South Korean adults. The concentrations of airborne manganese in total suspended particulate (TSP) were calculated from data obtained from ambient air-monitoring stations (AAMSs) located in South Korea. Blood manganese data obtained Korean National Health and Nutrition Examination Survey (KNHANES) using a rolling sampling design involving a complex, stratified, multistage, probability cluster survey of a representative sample of the non-institutionalized civilian population of South Korea. Airborne manganese geometric means was 46.10 $ng/m^3$, blood manganese geometric means were 1.19 ${\mu}g/d{\ell}$ for male and 1.40 ${\mu}g/d{\ell}$ for female. In multiple linear regression analysis of log transformed blood manganeseas a continuous variable on airborne manganese, after adjusting for covariates including gender, age, job, smoking and drinking status, education level, BMI (body mass index). Airborne manganese was positively associated with blood manganese with statistical significance. The present study confirms that airborne manganese is a possible contributor to the increase of blood manganese in the adult general population.