• 제목/요약/키워드: Two-stage cluster sampling

검색결과 51건 처리시간 0.029초

Unbiased Balanced Half-Sample Variance Estimation in Stratified Two-stage Sampling

  • Kim, Kyu-Seong
    • Journal of the Korean Statistical Society
    • /
    • 제27권4호
    • /
    • pp.459-469
    • /
    • 1998
  • Balanced half sample method is a simple variance estimation method for complex sampling designs. Since it is simple and flexible, it has been widely used in large scale sample surveys. However, the usual BHS method overestimate the true variance in without replacement sampling and two-stage cluster sampling. Focusing on this point , we proposed an unbiased BHS variance estimator in a stratified two-stage cluster sampling and then described an implementation method of the proposed estimator. Finally, partially BHS design is explained as a tool of reducing the number of replications of the proposed estimator.

  • PDF

층화 2-단 표본 추출시 최적 집락의 크기 결정 (A Optimal Cluster Size in Stratified Two-Stage Cluster Sampling)

  • 신민웅;신기일
    • 응용통계연구
    • /
    • 제13권2호
    • /
    • pp.207-224
    • /
    • 2000
  • 모집단을 집략화하여 층화 2-단 표본 추출을 할 때에 일반적으로 집락의 크기는 정해져 있다. 그러나 집락이 아파트 단지 등과 같은 경우에 집락의 크기는 큰 차이를 보인다. 이 경우 집락을 합치거나 또는 분할할 필요가 생긴다. 대 표본조사(large sample survey)에서 행정상 또는 조사 편의상 동질의 원소들이 집락화 되어 있고 집락의 크기를 결정할 필요가 있을 경우가 고려되었으며 본 논문에서는 집락의 최적크기를 결정하는 문제를 다루었다. 또한 주어진 비용 하에서 최적의 일차 추출 단위 수와 최적의 이차 추출 단위 수를 구하였다.

  • PDF

A composite estimator for stratified two stage cluster sampling

  • Lee, Sang Eun;Lee, Pu Reum;Shin, Key-Il
    • Communications for Statistical Applications and Methods
    • /
    • 제23권1호
    • /
    • pp.47-55
    • /
    • 2016
  • Stratified cluster sampling has been widely used for effective parameter estimations due to reductions in time and cost. The probability proportional to size (PPS) sampling method is used when the number of cluster element are significantly different. However, simple random sampling (SRS) is commonly used for simplicity if the number of cluster elements are almost the same. Also it is known that the ratio estimator produces a good performance when the total number of population elements is known. However, the two stage cluster estimator should be used if the total number of elements in population is neither known nor accurate. In this study we suggest a composite estimator by combining the ratio estimator and the two stage cluster estimator to obtain a better estimate under a certain population circumstance. Simulation studies are conducted to compare the superiority of the suggested estimator with two other estimators.

이단계 집락추출에서의 표본크기에 대한 연구 (A Study of Sample Size for Two-Stage Cluster Sampling)

  • 송종호;제해성;박민규
    • 응용통계연구
    • /
    • 제24권2호
    • /
    • pp.393-400
    • /
    • 2011
  • 조사비용과 시간과 같은 현실적인 제약하에서 관측단위 (observation unit)의 집합인 집락(cluster)율 추출하는 집락추출법은 대부분의 대형조사(large scale survey) 에서 흔히 사용된다. 특별히 집락내의 관측단위가 매우 유사한 경우, 집락 내의 모든 관측치를 조사하는 대신 일부를 추출하여 조사하는 이단계 집락 추출법이 선호된다. 이단계 집락추출법의 적용시 집락인 1차추출단위 (Primary Sampling Unit; PSU)와 관측단위인 2차추출단위(Secondary Sampling Unit; SSU)의 표본수 결정은 주어진 비용과 표본으로부터 계산되어지는 통계량의 정도에 의존한다. 본 연구에서는 기존의 1차추출단위의 크기가 동일하다는 가정하에서 유도된 최적 PSU와 SSU 표본크기 산출과정을 일반화하여 1차추출단위의 크기가 같지 않을 경우의 최적 표본크기를 유도하고 그 결과를 제 4차 퇴원환자조사를 위한 표본추출 방안에 적용하여 기존방법과 비교하였으며 이를 바탕으로 제 7차 퇴원환자조사를 위한 표본크기를 제안하였다.

An Additive Quantitative Randomized Response Model by Cluster Sampling

  • Lee, Gi-Sung
    • 응용통계연구
    • /
    • 제25권3호
    • /
    • pp.447-456
    • /
    • 2012
  • For a sensitive survey in which the population is comprised of several clusters with a quantitative attribute, we present an additive quantitative randomized response model by cluster sampling that adapts a two-stage cluster sampling instead of a simple random sample based on Himmelfarb-Edgell's additive quantitative attribute model and Gjestvang-Singh's one. We also derive optimum values for the number of 1st stage clusters and the optimum values of observation units in a 2nd stage cluster under the condition of minimizing the variance given constant cost. We can see that Himmelfarb-Edgell's model is more efficient than Gjestvang-Singh's model under the condition of cluster sampling.

이단계표본추출을 이용한 소결핵병 유병률 추정 (Two-stage Sampling for Estimation of Prevalence of Bovine Tuberculosis)

  • 박선일
    • 한국임상수의학회지
    • /
    • 제28권4호
    • /
    • pp.422-426
    • /
    • 2011
  • For a national survey in which wide geographic region or an entire country is targeted, multi-stage sampling approach is widely used to overcome the problem of simple random sampling, to consider both herd- and animallevel factors associated with disease occurrence, and to adjust clustering effect of disease in the population in the calculation of sample size. The aim of this study was to establish sample size for estimating bovine tuberculosis (TB) in Korea using stratified two-stage sampling design. The sample size was determined by taking into account the possible clustering of TB-infected animals on individual herds to increase the reliability of survey results. In this study, the country was stratified into nine provinces (administrative unit) and herd, the primary sampling unit, was considered as a cluster. For all analyses, design effect of 2, between-cluster prevalence of 50% to yield maximum sample size, and mean herd size of 65 were assumed due to lack of information available. Using a two-stage sampling scheme, the number of cattle sampled per herd was 65 cattle, regardless of confidence level, prevalence, and mean herd size examined. Number of clusters to be sampled at a 95% level of confidence was estimated to be 296, 74, 33, 19, 12, and 9 for desired precision of 0.01, 0.02, 0.03, 0.04, 0.05, and 0.06, respectively. Therefore, the total sample size with a 95% confidence level was 172,872, 43,218, 19,224, 10,818, 6,930, and 4,806 for desired precision ranging from 0.01 to 0.06. The sample size was increased with desired precision and design effect. In a situation where the number of cattle sampled per herd is fixed ranging from 5 to 40 with a 5-head interval, total sample size with a 95% confidence level was estimated to be 6,480, 10,080, 13,770, 17,280, 20.925, 24,570, 28,350, and 31,680, respectively. The percent increase in total sample size resulting from the use of intra-cluster correlation coefficient of 0.3 was 22.2, 32.1, 36.3, 39.6, 41.9, 42.9, 42,2, and 44.3%, respectively in comparison to the use of coefficient of 0.2.

Optimal Allocations in Two-Stage Cluster Sampling

  • Koh, Bong-Sung
    • Communications for Statistical Applications and Methods
    • /
    • 제6권3호
    • /
    • pp.749-754
    • /
    • 1999
  • The cost is known to be proportional to the size of sample. We consider a cost function of the form Cost=c1np+c2npmq where c1, c2 p, and q are all positive constants. This cost function is to be used in finding an optimal allocation in two-stage cluster sampling. The optimal allocations of n and m gives the properties of uniqueness under some conditions and of monotonicity with p>0 when q=1.

  • PDF

Variance estimation of a double expanded estimator for two-phase sampling

  • Mingue Park
    • Communications for Statistical Applications and Methods
    • /
    • 제30권4호
    • /
    • pp.403-410
    • /
    • 2023
  • Two-Phase sampling, which was first introduced by Neyman (1938), has various applications in different forms. Variance estimation for two-phase sampling has been an important research topic because conventional variance estimators used in most softwares are not working. In this paper, we considered a variance estimation for two-phase sampling in which stratified two-stage cluster sampling designs are used in both phases. By defining a conditionally unbiased estimator of an approximate variance estimator, which is calculable when all elements in the first phase sample are observed, we propose an explicit form of variance estimator of the double expanded estimator for a two-phase sample. A small simulation study shows the proposed variance estimator has a negligible bias with small variance. The suggested variance estimator is also applicable to other linear estimators of the population total or mean if appropriate residuals are defined.

우리나라 당뇨병의 역학적 규모와 당뇨병 관리현황 파악을 위한 표본설계의 평가 (An Evaluation of Sampling Design for Estimating an Epidemiologic Volume of Diabetes and for Assessing Present Status of Its Control in Korea)

  • 이지성;김재용;백세현;박이병;이준영
    • Journal of Preventive Medicine and Public Health
    • /
    • 제42권2호
    • /
    • pp.135-142
    • /
    • 2009
  • Objectives : An appropriate sampling strategy for estimating an epidemiologic volume of diabetes has been evaluated through a simulation. Methods : We analyzed about 250 million medical insurance claims data submitted to the Health Insurance Review & Assessment Service with diabetes as principal or subsequent diagnoses, more than or equal to once per year, in 2003. The database was re-constructed to a 'patient-hospital profile' that had 3,676,164 cases, and then to a 'patient profile' that consisted of 2,412,082 observations. The patient profile data was then used to test the validity of a proposed sampling frame and methods of sampling to develop diabetic-related epidemiologic indices. Results : Simulation study showed that a use of a stratified two-stage cluster sampling design with a total sample size of 4,000 will provide an estimate of 57.04%(95% prediction range, 49.83 - 64.24%) for a treatment prescription rate of diabetes. The proposed sampling design consists, at first, stratifying the area of the nation into "metropolitan/city/county" and the types of hospital into "tertiary/secondary/primary/clinic" with a proportion of 5:10:10:75. Hospitals were then randomly selected within the strata as a primary sampling unit, followed by a random selection of patients within the hospitals as a secondly sampling unit. The difference between the estimate and the parameter value was projected to be less than 0.3%. Conclusions : The sampling scheme proposed will be applied to a subsequent nationwide field survey not only for estimating the epidemiologic volume of diabetes but also for assessing the present status of nationwide diabetes control.

2단 크기비례 계통추출법의 분산추정량 효율성 비교 (Efficiency of Variance Estimators for Two-stage PPS Systematic Sampling)

  • 김영원;김예니;한혜은;곽은선
    • 응용통계연구
    • /
    • 제26권6호
    • /
    • pp.1033-1041
    • /
    • 2013
  • 본 논문에서는 크기비례 계통추출법에서 적용할 수 있는 다양한 분산추정 방법들을 정리하고 각 분산추정 방법들의 통계적 특성에 대해서 논의하였다. 이론적으로 하나의 계통표본을 가지고 비편향 분산추정량을 구하는 것은 불가능 하지만 실제 표본자료 분석에 있어서 어떤 대안이 있을 수 있는지 살펴보고, 다양한 분산추정 방법들의 성질을 상대편향 및 상대평균제곱오차 관점에서 비교해 보았다. 또한 우리나라 가구나 사업체 표본설계에서 흔히 발생하는 2단 크기비례 계통추출 표본에서 적용 가능한 효과적인 분산추정 방법을 알아보기 위해 2008년 사업체근로실태조사 자료의 근로자 평균임금과 2011년 식품원료소비실태조사 자료의 가구당 연평균 쌀 소비량의 분산 추정 문제를 기초로 모의실험을 수행하였다.