• Title/Summary/Keyword: 정보적 표본설계

Search Result 102, Processing Time 0.022 seconds

표본배분에 관한 소고

  • 김종호
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.3
    • /
    • pp.299-302
    • /
    • 1996
  • 표본조사에 있어서 층화추출법은 모집단에 관한 예비정보를 필요로 하고 있다. 조사자가 표본설계시 층화와 표본배분의 문제를 막연히 추상적으로 처리함으로 생기는 오류를 줄이기 위해서 다원적 입장에서 모집단에 대한 예비 정보를 정확하게 파악하고 이용해야 층화추출법의 효율을 올릴 수 있음을 지적하고 있다.

  • PDF

A study to improve the accuracy of the naive propensity score adjusted estimator using double post-stratification method (나이브 성향점수보정 추정량의 정확성 향상을 위한 이중 사후층화 방법 연구)

  • Leesu Yeo;Key-Il Shin
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.6
    • /
    • pp.547-559
    • /
    • 2023
  • Proper handling of nonresponse in sample survey improves the accuracy of the parameter estimation. Various studies have been conducted to properly handle MAR (missing at random) nonresponse or MCAR (missing completely at random) nonresponse. When nonresponse occurs, the PSA (propensity score adjusted) estimator is commonly used as a mean estimator. The PSA estimator is known to be unbiased when known sample weights and properly estimated response probabilities are used. However, for MNAR (missing not at random) nonresponse, which is affected by the value of the study variable, since it is very difficult to obtain accurate response probabilities, bias may occur in the PSA estimator. Chung and Shin (2017, 2022) proposed a post-stratification method to improve the accuracy of mean estimation when MNAR nonresponse occurs under a non-informative sample design. In this study, we propose a double post-stratification method to improve the accuracy of the naive PSA estimator for MNAR nonresponse under an informative sample design. In addition, we perform simulation studies to confirm the superiority of the proposed method.

Measuring stratification effects for multistage sampling (다단추출 표본설계의 층효율성 연구)

  • Taehoon Kim;KeeJae Lee;Inho Park
    • The Korean Journal of Applied Statistics
    • /
    • v.36 no.4
    • /
    • pp.337-347
    • /
    • 2023
  • Sampling designs often use stratified sampling, where elements or clusters of the study population are divided into strata and an independent sample is chosen from each stratum. The stratification strategy consists of stratification and sample allocation, which are important issues that are repeatedly considered in survey sampling. Although a stratified multistage sample design is often used in practice, the literature tends to discuss simple sampling in terms of stratum effects or stratum efficiency. This study examines an existing stratum efficiency measure for two-stage sampling and further proposes additional stratum efficiency measures using the design effect model. The proposed measures are used to evaluate the stratification strategy of the sample design for high school students of the 4th Korean National Environmental Health Survey (KoNEHS).

Effect of complex sample design on Pearson test statistic for homogeneity (복합표본자료에서 동질성검정을 위한 피어슨 검정통계량의 효과)

  • Heo, Sun-Yeong;Chung, Young-Ae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.757-764
    • /
    • 2012
  • This research is for comparison of test statistics for homogeneity when the data is collected based on complex sample design. The survey data based on complex sample design does not satisfy the condition of independency which is required for the standard Pearson multinomial-based chi-squared test. Today, lots of data sets ara collected by complex sample designs, but the tests for categorical data are conducted using the standard Pearson chi-squared test. In this study, we compared the performance of three test statistics for homogeneity between two populations using data from the 2009 customer satisfaction evaluation survey to the service from Gyeongsangnam-do regional offices of education: the standard Pearson test, the unbiasedWald test, and the Pearsontype test with survey-based point estimates. Through empirical analyses, we fist showed that the standard Pearson test inflates the values of test statistics very much and the results are not reliable. Second, in the comparison of Wald test and Pearson-type test, we find that the test results are affected by the number of categories, the mean and standard deviation of the eigenvalues of design matrix.

Estimation using informative sampling technique when response rate follows exponential function of variable of interest (응답률이 관심변수의 지수함수를 따를 경우 정보적 표본설계 기법을 이용한 모수추정)

  • Chung, Hee Young;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.993-1004
    • /
    • 2017
  • A stratified sampling method is generally used with a sample selected using the same sample weight in each stratum in order to improve the accuracy of the sampling survey estimation. However, the weight should be adjusted to reflect the response rate if the response rate is affected by the value of the variable of interest. It may be also more effective to adjust the weights by subdividing the stratum rather than using the same weight if the variable of interest has a linear relationship with the continuous auxiliary variables. In this study, we propose a method to increase the accuracy of estimation using an informative sampling design technique when the response rate is an exponential function of the variable of interest and the variable of interest has a linear relationship with the auxiliary variable. Simulation results show the superiority of the proposed method.

A sample design for the survey on goodwill in retail properties (상가권리금 현황조사를 위한 표본설계 연구)

  • Kim, Dal Ho;Woo, Namkyo;Jo, Junwoo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.6
    • /
    • pp.1443-1452
    • /
    • 2016
  • In this paper, we study a sample design for survey on goodwill in retail properties to provide a protecting policy for small traders and tenants, to use basic data for a dispute case related to goodwill. Since goodwill in retail properties is occurred by individual rent company, we use the census on establishments from the Statistics Korea as population. First of all, we consider preferentially seven metropolitan cities in which there are more than half of population. Total sample size is decided as 8,000. We allocate the sample size for markets as stratum in each city using proportional formula and the sample size for industrial classifications in each market using root proportional formula. Also we compute survey weights and calculate estimators, standard errors and interval of estimators for each characteristic such as type of establishments and market in seven metropolitan cities.

A study on the determination of substrata using the information of exponential response rate by simulation studies (모의실험을 기반으로 지수형 응답률 보정을 위한 세부 층 결정에 관한 연구)

  • Min, Joo-Won;Shin, Key-Il
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.5
    • /
    • pp.621-636
    • /
    • 2018
  • Research on the application of informative sampling technique has been conducted in order to reduce the influence of non-response. Chung and Shin (Korean Journal of Applied Statistics, 30, 993-1004, 2017) showed that the estimation accuracy improved when using exponential response rate information for the parameter estimation if the distribution of errors included in the super population model follows normal distribution. However this method divides the stratum into equally spaced substrata to obtain the sample weight of the informative sampling technique and shows that the accuracy of the estimation improves as the number of substrata increases. In this study, with the given number of total sample size, the optimal substratum boundary points are calculated using equal space, quantile, and LH algorithm; consequently, the results using those methods are compared through simulation. We also studied the criteria to determine the number of substrata and substratum boundaries that can be used in practice with various types of auxiliary variable distributions.

국가 생물자원 정보화 현황 및 전망

  • Kim, Chan-Hoe;Byeon, Bong-Gyu
    • Proceedings of the Korean Society of Applied Entomolohy Conference
    • /
    • 2006.09a
    • /
    • pp.45-60
    • /
    • 2006
  • 최근 국제적으로 생물자원의 중요성이 재인식됨에 따라 이들에 대한 체계적이고 안정적인 정보의 구축 및 활용이 국가경쟁력의 척도로 간주할 정도로 중요성이 커지고 있는 실정이다. 또한, 우리나라는 국토의 64%가 산림으로 되어 있어서 다양한 생물종이 서식하는데 매우 적합한 조건을 갖추고 있다. 이에 따라 산림청과 국립수목원은 국가식물자원정보시스템에 이어 곤충정보에 대한 정보화 사업을 2001년부터 추진해오고 있으며 이 결과로 현재 33만여 점에 곤충표본정보가 구축되기에 이르렀다. 현재까지 국내 유수 곤충표본보유기관 22개 기관이 본 사업에 참여하여 다양하고 내실 있는 정보구축이 이루어져 왔으며 이를 통해 명실 공히 국가적인 곤충자원정보망을 갖추게 되었다. 곤충DB의 국가적인 사업이 추진된 지 6년차에 이르는 2006년도에는 정보통신부의 지원을 받아 14개 표본보유기관이 참여하여 곤충표본DB구축을 중점 추진할 계획이다. 2006년도의 목표량은 곤충표본정보 17,000점으로 기존구축자료 이외에 추가가 필요한 표본정보위주로 구축될 예정이다. 또한, 이미 구축된 곤충표본자원의 활용도와 가치를 높이기 위해서는 더욱 다양하고 많은 자료의 표본을 추가하여 DB화를 추진함은 물론 이들 각각의 정보들을 식물정보와 연계하여 분석 가능토록 하고 GIS시스템을 도입하여 명실공히 국가적인 곤충자원정보의 종합관리가 될 수 있도록 추진하고 있다. 본 사업이 충실히 수행될 경우 국가 주요 생물자원 중 하나인 곤충정보의 DB확대 구축을 통한 전체적인 현황파악 및 체계적인 관리가 가능해 질 것이며, 이와 관련하여 정보화적인 측면, 경제적 측면, 사화 문화적 측면에서 다양한 효과가 기대되며 앞으로도 이에 대한 내실 있는 운영을 위해서는 정부차원의 종합적인 지원 및 관리가 요구되는 시점으로 판단된다.pm3.42$, 저층수 $23.43\pm3.38$이었으며, 전반적으로 해역별 수질기준 I등급 내지는 II등급을 유지하고 있었고, 공간적으로는 외해측으로 갈수록 외해수와 혼합 확산되어 양호한 수질을 나타내었다. 장기적인 변동특성은 세그룹으로 구분되어진다.기 실험결과 용출용매로 증류수와 해수를 이용했을 때, 제강 슬래그에서 용출되는 납, 구리, 카드뮴, 수은의 용출 경향의 차이를 확인할 수 있었고 이에 따라서, 납, 구리, 카드뮴의 용출 유해성은 낮기 때문에 해양구조물로의 제강슬래그 유효이용은 적합할 것으로 판단되었다.im80%$로 계산되었다. 열형광선량계로 측정된 방사선량은 각각 1.8, 1.2, 0.8, 1.2, 0.8 (70 cm 거리) cGy로 측정되었으며, 환자의 복부 표면에서의 서베이메터를 이용한 측정량은 10.9 mR/h였다. 차폐구조물의 사용 시 전체 치료 동안에 태아선량은 약 1 cGy 정도로 평가되었다. 결론 : AAPM Report No.50의 자료에 따르면, 임산부의 방사선 치료 시 태아의 방사선 피폭선량은 5 cGy 이하일 경우에 방사선 피폭에 따른 태아의 위험이 거의 없는 것으로 제시되고 있다. 본원에서 차폐 구조물을 설치하였을 경우에 측정된 태아선량은 약 1 cGy로 측정되었고, 고안된 차폐구조물은 태아에 도달하는 방사선량을 감소시키기에 적합한 설계임이 입증되었다. 아니라 일반종합병원에서도 CTX-M형 ESBL 생성 E. coli와 K. pneumoniae가 존재하며 확산 중임을 시사한다. 앞으로 CTX-M형 ESBL의 만연과 변종 CTX-M형 ESBL의 출연을 감시하기 위한 정기적인 연구와 조사가 필요한 것으로 생각한다., A2-1, B1-1, B2-1의 경우, 강우 일수 감소 이전과 연 유출량 변화는 거의

  • PDF

Multivariate Stratification Method for the Multipurpose Sample Survey : A Case Study of the Sample Design for Fisher Production Survey (다목적 표본조사를 위한 다변량 층화 : 어업비계통생산량조사를 위한 표본설계 사례)

  • Park, Jin-Woo;Kim, Young-Won;Lee, Seok-Hoon;Shin, Ji-Eun
    • Survey Research
    • /
    • v.9 no.1
    • /
    • pp.69-85
    • /
    • 2008
  • Stratification is a feature of the majority of field sample design. This paper considers the multivariate stratification strategy for multipurpose sample survey with several auxiliary variables. In a multipurpose survey, stratification procedure is very complicated because we have to simultaneously consider the efficiencies of stratification for several variables of interest. We propose stratification strategy based on factor analysis and cluster analysis using several stratification variables. To improve the efficiency of stratification, we first select the stratification variables by factor analysis, and then apply the K-means clustering algorithm to the formation of strata. An application of the stratification strategy in the sampling design for the Fisher Production Survey is discussed, and it turns out that the variances of estimators are significantly less than those obtained by simple random sampling.

  • PDF

Choosing clusters for two-stage household surveys (가구조사를 위한 이단추출 표본설계에서의 집락선택)

  • Park, Inho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.2
    • /
    • pp.363-372
    • /
    • 2016
  • Two-stage sample designs are commonly used for household surveys in Korea using as clusters the enumeration districts (EDs). Since clustering decomposes the population variation into within- and between-cluster variations, the sample sizes allocated in stages can affect the overall precision. Alternative clusters are often considered due to diverse reasons such as the EDs' limitation in size, being out-of-date, and in-assessibility to their household lists. In addition, the EDs are currently under development by the Statistics Korea as an joint effort toward their transition from the traditional practice to the register census from 2015. We present an approach for evaluating the difference in the precision of the mean estimators of the sets of the cluster units in between a hierachical and nested form, where the design effect is used to reflect the effect of the clustering and the sample allocation. We also demonstrate our approach using the U.S. Census counts from the year 2000 for Anne Arundel County in Maryland. Our research shows that the within-cluster variance can be significantly different for survey variables and thus the choice of cluster units and the associated sample allocation scheme should reflect the corresponding variance decomposition due to clustering.