• Title/Summary/Keyword: 초모집단모형

Search Result 9, Processing Time 0.028 seconds

A study on non-response bias adjusted estimation in business survey (사업체조사에서의 무응답 편향보정 추정에 관한 연구)

  • Chung, Hee Young;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.11-23
    • /
    • 2020
  • Sampling design should provide statistics to meet a given accuracy while saving cost and time. However, a large number of non-responses are occurring due to the deterioration of survey circumstances, which significantly reduces the accuracy of the survey results. Non-responses occur for a variety of reasons. Chung and Shin (2017, 2019) and Min and Shin (2018) found that the accuracy of estimation is improved by removing the bias caused by non-response when the response rate is an exponential or linear function of variable of interests. For that case they assumed that the error of the super population model follows normal distribution. In this study, we proposed a non-response bias adjusted estimator in the case where the error of a super population model follows the gamma distribution or the log-normal distribution in a business survey. We confirmed the superiority of the proposed estimator through simulation studies.

On the Relative Efficiency of Alternative Estimators in RHC Sampling Scheme

  • 홍기학;이기성;손창균
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2004.11a
    • /
    • pp.61-64
    • /
    • 2004
  • 대규모 표본조사와 관련해서 관심변수와 보조변수간의 약한 상관관계를 고려한 Amahia et at.(1989)의 대체추정방법을 RHC(Rao, Hartley and Cochran)추출방법에 적용해서 Rao추정량과 효율성을 비교하였다.

  • PDF

Efficient Estimation of the Mean for Populations with a Linear Trend : An Extension of Systematic Sampling (선형추세를 갖는 모집단에 대한 효율적인 모평균 추정 : 계통추출의 확장)

  • 김혁주;석은양
    • The Korean Journal of Applied Statistics
    • /
    • v.13 no.2
    • /
    • pp.457-476
    • /
    • 2000
  • In this study, we have proposed a sampling method and an estimation method for efficiently estimating the mean of a population which has a linear trend. These methods involve drawing a sample by the so-called "centered balanced systematic sampling", which is an extension of systematic sampling, and then estimating the population mean with an adjusted estimator, not with the sample mean itself. We used the concept of interpolation in determining the adjusted estimator.\Ve compared the efficiency of the proposed estimator with those of the estimators from existing methods, under the expected mean square error criterion based on the infinite superpopulation model introduced by Cochran(1946). The proposed method is for use in the case when the sample size n(2 5) is an odd number and k(the reciprocal of the sampling fraction) is an even number. A good result was also obtained in an example using computer simulation. simulation.

  • PDF

Minimum Variance Estimation for the Power Allocation in Stratified Sampling

  • Son, Chang-Gyun;Hong, Gi-Hak;Lee, Gi-Seong
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.11a
    • /
    • pp.185-189
    • /
    • 2002
  • 본 논문에서는 초 모집단 모형 하에서 HT 추정량의 분산의 하한에 관계된 층화추정량의 효율성에 대해 다루었다. 특별히 Dalenius-Hodges 층화와 표본배분방법 중 멱배분(power allocation)을 적용했을 때 최소분산 성질에 대해 살펴보았다.

  • PDF

A study on the determination of substrata using the information of exponential response rate by simulation studies (모의실험을 기반으로 지수형 응답률 보정을 위한 세부 층 결정에 관한 연구)

  • Min, Joo-Won;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.5
    • /
    • pp.621-636
    • /
    • 2018
  • Research on the application of informative sampling technique has been conducted in order to reduce the influence of non-response. Chung and Shin (Korean Journal of Applied Statistics, 30, 993-1004, 2017) showed that the estimation accuracy improved when using exponential response rate information for the parameter estimation if the distribution of errors included in the super population model follows normal distribution. However this method divides the stratum into equally spaced substrata to obtain the sample weight of the informative sampling technique and shows that the accuracy of the estimation improves as the number of substrata increases. In this study, with the given number of total sample size, the optimal substratum boundary points are calculated using equal space, quantile, and LH algorithm; consequently, the results using those methods are compared through simulation. We also studied the criteria to determine the number of substrata and substratum boundaries that can be used in practice with various types of auxiliary variable distributions.

A comparison of alternative estimators in view of the Rao-Hartley-Cochran sampling scheme

  • Hong, Ki-Hak;Lee, Gi-Sung;Son, Chang-Kyoon
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2006.04a
    • /
    • pp.181-187
    • /
    • 2006
  • In this paper we suggest a new alternative estimator for the characteristics that are pooly correlated with the selection probabilities by applying the Amahia et al.(1989)'s estimator to Rao-Hartley-Cochran sampling scheme and compare it with that of Rao(1966)'s under a super-population model.

  • PDF

Features of sample concepts in the probability and statistics chapters of Korean mathematics textbooks of grades 1-12 (초.중.고등학교 확률과 통계 단원에 나타난 표본개념에 대한 분석)

  • Lee, Young-Ha;Shin, Sou-Yeong
    • Journal of Educational Research in Mathematics
    • /
    • v.21 no.4
    • /
    • pp.327-344
    • /
    • 2011
  • This study is the first step for us toward improving high school students' capability of statistical inferences, such as obtaining and interpreting the confidence interval on the population mean that is currently learned in high school. We suggest 5 underlying concepts of 'discretion of contingency and inevitability', 'discretion of induction and deduction', 'likelihood principle', 'variability of a statistic' and 'statistical model', those are necessary to appreciate statistical inferences as a reliable arguing tools in spite of its occasional erroneous conclusions. We assume those 5 concepts above are to be gradually developing in their school periods and Korean mathematics textbooks of grades 1-12 were analyzed. Followings were found. For the right choice of solving methodology of the given problem, no elementary textbook but a few high school textbooks describe its difference between the contingent circumstance and the inevitable one. Formal definitions of population and sample are not introduced until high school grades, so that the developments of critical thoughts on the reliability of inductive reasoning could not be observed. On the contrary of it, strong emphasis lies on the calculation stuff of the sample data without any inference on the population prospective based upon the sample. Instead of the representative properties of a random sample, more emphasis lies on how to get a random sample. As a result of it, the fact that 'the random variability of the value of a statistic which is calculated from the sample ought to be inherited from the randomness of the sample' could neither be noticed nor be explained as well. No comparative descriptions on the statistical inferences against the mathematical(deductive) reasoning were found. Few explanations on the likelihood principle and its probabilistic applications in accordance with students' cognitive developmental growth were found. It was hard to find the explanation of a random variability of statistics and on the existence of its sampling distribution. It is worthwhile to explain it because, nevertheless obtaining the sampling distribution of a particular statistic, like a sample mean, is a very difficult job, mere noticing its existence may cause a drastic change of understanding in a statistical inference.

  • PDF

Estimation using informative sampling technique when response rate follows exponential function of variable of interest (응답률이 관심변수의 지수함수를 따를 경우 정보적 표본설계 기법을 이용한 모수추정)

  • Chung, Hee Young;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.993-1004
    • /
    • 2017
  • A stratified sampling method is generally used with a sample selected using the same sample weight in each stratum in order to improve the accuracy of the sampling survey estimation. However, the weight should be adjusted to reflect the response rate if the response rate is affected by the value of the variable of interest. It may be also more effective to adjust the weights by subdividing the stratum rather than using the same weight if the variable of interest has a linear relationship with the continuous auxiliary variables. In this study, we propose a method to increase the accuracy of estimation using an informative sampling design technique when the response rate is an exponential function of the variable of interest and the variable of interest has a linear relationship with the auxiliary variable. Simulation results show the superiority of the proposed method.

Choosing clusters for two-stage household surveys (가구조사를 위한 이단추출 표본설계에서의 집락선택)

  • Park, Inho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.2
    • /
    • pp.363-372
    • /
    • 2016
  • Two-stage sample designs are commonly used for household surveys in Korea using as clusters the enumeration districts (EDs). Since clustering decomposes the population variation into within- and between-cluster variations, the sample sizes allocated in stages can affect the overall precision. Alternative clusters are often considered due to diverse reasons such as the EDs' limitation in size, being out-of-date, and in-assessibility to their household lists. In addition, the EDs are currently under development by the Statistics Korea as an joint effort toward their transition from the traditional practice to the register census from 2015. We present an approach for evaluating the difference in the precision of the mean estimators of the sets of the cluster units in between a hierachical and nested form, where the design effect is used to reflect the effect of the clustering and the sample allocation. We also demonstrate our approach using the U.S. Census counts from the year 2000 for Anne Arundel County in Maryland. Our research shows that the within-cluster variance can be significantly different for survey variables and thus the choice of cluster units and the associated sample allocation scheme should reflect the corresponding variance decomposition due to clustering.