• 제목/요약/키워드: unequal probability sampling.

검색결과 7건 처리시간 0.022초

Regression Estimators with Unequal Selection Probabilities on Two Successive Occasions

  • Kim, Kyu-Seong
    • Journal of the Korean Statistical Society
    • /
    • 제25권1호
    • /
    • pp.25-37
    • /
    • 1996
  • In this paper, we propose regression estimators based on a partial replacement sampling scheme over two successive occasions and derive the minimum variances of them. PPSWR, RHC, $\pi$PS and PPSWOR schemes are considered to select unequal probability samples on two occasions. Simulation results over four populations are given for comparison of composite estimators and regression estimators.

  • PDF

시계열 계속 표본조사에서 불균등확률 추출법 연구 (A study on unequal probability sampling over two successive occasions in time series)

  • 박홍래;이계오
    • 응용통계연구
    • /
    • 제6권1호
    • /
    • pp.145-162
    • /
    • 1993
  • 본 논문에서는 반복적 계속 표본조사에서 일부의 표본을 교체하는 2회 계속조사의 표본 추 출법들을 요약하고 앞 조사시기의 관찰값을 확률측도로 이용한 RHC(Rao-Hartley-Cochran) 유형의 불균등 확률추출법을 제안하였다. 제안된 추출법과 기존의 확률추출법의 비교를 위 하여 둘째 조사시기의 모평균 추정량과 그의 분산을 유도하였으며, 제안된 추출법의 상대 효율은 이론적인 측면과 수치적 시뮬레이션 방법으로 비교 분석되었다. 시뮬레이션 비교를 위하여 한 특별한 시계열 모형을 가정하고 이를 사용하여 인위적인 모집단을 생성하였으며 이 모집단에서 각 추출법에 해당되는 표본을 컴퓨터로 추출하여 각각의 추정치를 계산하여 비교한 결과에서 RHC 유형의 새로 제안된 추출법의 분산과 편차가 일반적으로 적음을 보 였다.

  • PDF

확률비례추출법에 의한 확률화응답기법에 관한 연구 (A Study on the Randomized Response Technique by PPS Sampling)

  • 이기성
    • 응용통계연구
    • /
    • 제19권1호
    • /
    • pp.69-80
    • /
    • 2006
  • 본 연구에서는 매우 민감한 조사에서 모집단이 집락의 크기가 서로 다른 여러 개의 집락으로 구성되어 있을 때, 집락의 크기에 비례하게 추출확률을 부여하는 확률비례추출법(probability proportional to size : pps)을 이용한 확률화응답기법을 제안하고자 한다. 민감한 속성에 대한 모수의 추정치와 분산 및 분산추정량을 구하여 이론적 체계를 구축하고, 확률비례추출법에 의한 확률화응답기법과 등확률 2단계 추출법에 의한 확률화응답기법의 효율성을 비교해 보고자 한다. 또한, 실제조사를 통해 제안한 확률비례추출법에 의한 확률화응답기법에 대한 실용화의 타당성을 검토하고자 한다.

Adjusting sampling bias in case-control genetic association studies

  • Seo, Geum Chu;Park, Taesung
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권5호
    • /
    • pp.1127-1135
    • /
    • 2014
  • Genome-wide association studies (GWAS) are designed to discover genetic variants such as single nucleotide polymorphisms (SNPs) that are associated with human complex traits. Although there is an increasing interest in the application of GWAS methodologies to population-based cohorts, many published GWAS have adopted a case-control design, which raise an issue related to a sampling bias of both case and control samples. Because of unequal selection probabilities between cases and controls, the samples are not representative of the population that they are purported to represent. Therefore, non-random sampling in case-control study can potentially lead to inconsistent and biased estimates of SNP-trait associations. In this paper, we proposed inverse-probability of sampling weights based on disease prevalence to eliminate a case-control sampling bias in estimation and testing for association between SNPs and quantitative traits. We apply the proposed method to a data from the Korea Association Resource project and show that the standard estimators applied to the weighted data yield unbiased estimates.

Variance estimation for distribution rate in stratified cluster sampling with missing values

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권2호
    • /
    • pp.443-449
    • /
    • 2017
  • Estimation of population proportion like the distribution rate of LED TV and the prevalence of a disease are often estimated based on survey sample data. Population proportion is generally considered as a special form of population mean. In complex sampling like stratified multistage sampling with unequal probability sampling, the denominator of mean may be random variable and it is estimated like ratio estimator. In this research, we examined the estimation of distribution rate based on stratified multistage sampling, and determined some numerical outcomes using stratified random sample data with about 25% of missing observations. In the data used for this research, the survey weight was determined by deterministic way. So, the weights are not random variable, and the population distribution rate and its variance estimator can be estimated like population mean estimation. When the weights are not random variable, if one estimates the variance of proportion estimator using ratio method, then the variances may be inflated. Therefore, in estimating variance for population proportion, we need to examine the structure of data and survey design before making any decision for estimation methods.

Empirical Bayes Posterior Odds Ratio for Heteroscedastic Classification

  • Kim, Hea-Jung
    • Journal of the Korean Statistical Society
    • /
    • 제16권2호
    • /
    • pp.92-101
    • /
    • 1987
  • Our interest is to access in some way teh relative odds or probability that a multivariate observation Z belongs to one of k multivariate normal populations with unequal covariance matrices. We derived the empirical Bayes posterior odds ratio for the classification rule when population parameters are unknown. It is a generalization of the posterior odds ratio suggested by Gelsser (1964). The classification rule does not have complicated distribution theory which a large variety of techniques from the sampling viewpoint have. The proposed posterior odds ratio is compared to the Gelsser's posterior odds ratio through a Monte Carlo study. The results show that the empiricla Bayes posterior odds ratio, in general, performs better than the Gelsser's. Especially, for large dimension of Z and small training sample, the performance is prominent.

  • PDF

A NOTE ON PROTECTION OF PRIVACY IN RANDOMIZED RESPONSE DEVICES

  • SAHA AMITAVA
    • Journal of the Korean Statistical Society
    • /
    • 제34권4호
    • /
    • pp.297-309
    • /
    • 2005
  • We consider 'efficiency versus privacy-protection' problem concerned with several well-known randomized response (RR) devices to estimate pro­portion of people bearing a stigmatizing characteristic in a community. The literature of RR on respondent's privacy protection discusses only about response specific jeopardy measures. We propose a measure of jeopardy that is independent of the RR offered by the interviewee and recommend it for using as a technical characteristic of the RR device. For ensuring better cooperation from the interviewees this new measure that depends only on the design parameters of the RR devices may be disclosed to the respondents before producing the RR by implementing the randomization device.