• Title/Summary/Keyword: sampling bias

Search Result 183, Processing Time 0.023 seconds

Efficient Use of Auxiliary Variables in Estimating Finite Population Variance in Two-Phase Sampling

  • Singh, Housila P.;Singh, Sarjinder;Kim, Jong-Min
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.2
    • /
    • pp.165-181
    • /
    • 2010
  • This paper presents some chain ratio-type estimators for estimating finite population variance using two auxiliary variables in two phase sampling set up. The expressions for biases and mean squared errors of the suggested c1asses of estimators are given. Asymptotic optimum estimators(AOE's) in each class are identified with their approximate mean squared error formulae. The theoretical and empirical properties of the suggested classes of estimators are investigated. In the simulation study, we took a real dataset related to pulmonary disease available on the CD with the book by Rosner, (2005).

Dynamic displacement estimation by fusing biased high-sampling rate acceleration and low-sampling rate displacement measurements using two-stage Kalman estimator

  • Kim, Kiyoung;Choi, Jaemook;Koo, Gunhee;Sohn, Hoon
    • Smart Structures and Systems
    • /
    • v.17 no.4
    • /
    • pp.647-667
    • /
    • 2016
  • In this paper, dynamic displacement is estimated with high accuracy by blending high-sampling rate acceleration data with low-sampling rate displacement measurement using a two-stage Kalman estimator. In Stage 1, the two-stage Kalman estimator first approximates dynamic displacement. Then, the estimator in Stage 2 estimates a bias with high accuracy and refines the displacement estimate from Stage 1. In the previous Kalman filter based displacement techniques, the estimation accuracy can deteriorate due to (1) the discontinuities produced when the estimate is adjusted by displacement measurement and (2) slow convergence at the beginning of estimation. To resolve these drawbacks, the previous techniques adopt smoothing techniques, which involve additional future measurements in the estimation. However, the smoothing techniques require more computational time and resources and hamper real-time estimation. The proposed technique addresses the drawbacks of the previous techniques without smoothing. The performance of the proposed technique is verified under various dynamic loading, sampling rate and noise level conditions via a series of numerical simulations and experiments. Its performance is also compared with those of the existing Kalman filter based techniques.

Generalized Composite Estimators and Mean Squared Errors for l/G Rotation Design (l/G 교체표본디자인에서의 일반화복합추정량과 평균제곱오차에 관한 연구)

  • 김기환;박유성;남궁재은
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.1
    • /
    • pp.61-73
    • /
    • 2004
  • Rotation sampling designs may be classified into two categories. The first type uses the same sample unit for the entire life of the survey. The second type uses the sample unit only for a fixed number of times. In both type of designs, the entire sample is partitioned into a finite number(=G) of rotation groups. This paper is generalization of the first type designs. Since the generalized design can be identified by only G rotation groups and recall level 1, we denote this rotation system as l/G rotation design. Under l/G rotation design, variance and mean squared error (MSE) of generalized composite estimator are derived, incorporating two type of biases and exponentially decaying correlation pattern. Compromising MSE's of some selected l/G designs, we investigate design efficiency, design gap effect, ans the effects of correlation and bias.

Model Performance Evaluation and Bias Correction Effect Analysis for Forecasting PM2.5 Concentrations (PM2.5 예보를 위한 모델 성능평가와 편차보정 효과 분석)

  • Ghim, Young Sung;Choi, Yongjoo;Kim, Soontae;Bae, Chang Han;Park, Jinsoo;Shin, Hye Jung
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.33 no.1
    • /
    • pp.11-18
    • /
    • 2017
  • The performance of a modeling system consisting of WRF model v3.3 and CMAQ model v4.7.1 for forecasting $PM_{2.5}$ concentrations were evaluated during the period May 2012 through December 2014. Twenty-four hour averages of $PM_{2.5}$ and its major components obtained through filter sampling at the Bulgwang intensive measurement station were used for comparison. The mean predicted $PM_{2.5}$ concentration over the entire period was 68% of the mean measured value. Predicted concentrations for major components were underestimated except for $NO_3{^-}$. The model performance for $PM_{2.5}$ generally tended to degrade with increasing the concentration level. However, the mean fractional bias (MFB) for high concentration above the $80^{th}$ percentile fell within the criteria, the level of accuracy acceptable for standard model applications. Among three bias correction methods, the ratio adjustment was generally most effective in improving the performance. Albeit for limited test conditions, this analysis demonstrated that the effects of bias correction were larger when using the data with a larger bias of predicted values from measurement values.

GENERAL FAMILIES OF CHAIN RATIO TYPE ESTIMATORS OF THE POPULATION MEAN WITH KNOWN COEFFICIENT OF VARIATION OF THE SECOND AUXILIARY VARIABLE IN TWO PHASE SAMPLING

  • Singh Housila P.;Singh Sarjinder;Kim, Jong-Min
    • Journal of the Korean Statistical Society
    • /
    • v.35 no.4
    • /
    • pp.377-395
    • /
    • 2006
  • In this paper we have suggested a family of chain estimators of the population mean $\bar{Y}$ of a study variate y using two auxiliary variates in two phase (double) sampling assuming that the coefficient of variation of the second auxiliary variable is known. It is well known that chain estimators are traditionally formulated when the population mean $\bar{X}_1$ of one of the two auxiliary variables, say $x_1$, is not known but the population mean $\bar{X}_2$ of the other auxiliary variate $x_2$ is available and $x_1$ has higher degree of positive correlation with the study variate y than $x_2$ has with y, $x_2$ being closely related to $x_1$. Here the classes are constructed when the population mean $\bar{X}_1\;of\;X_1$ is not known and the coefficient of variation $C_{x2}\;of\;X_2$ is known instead of population mean $\bar{X}_2$. Asymptotic expressions for the bias and mean square error (MSE) of the suggested family have been obtained. An asymptotic optimum estimator (AOE) is also identified with its MSE formula. The optimum sample sizes of the preliminary and final samples have been derived under a linear cost function. An empirical study has been carried out to show the superiority of the constructed estimator over others.

Korean women wage analysis using selection models (표본 선택 모형을 이용한 국내 여성 임금 데이터 분석)

  • Jeong, Mi Ryang;Kim, Mijeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1077-1085
    • /
    • 2017
  • In this study, we have found the major factors which affect Korean women's wage analysing the data provided by 2015 Korea Labor Panel Survey (KLIPS). In general, wage data is difficult to analyze because random sampling is infeasible. Heckman sample selection model is the most widely used method for analysing the data with sample selection. Heckman proposed two kinds of selection models: the one is the model with maximum likelihood method and the other is the Heckman two stage model. Heckman two stage model is known to be robust to the normal assumption of bivariate error terms. Recently, Marchenko and Genton (2012) proposed the Heckman selectiont model which generalizes the Heckman two stage model and concluded that Heckman selection-t model is more robust to the error assumptions. Employing the two models, we carried out the analysis of the data and we compared those results.

Objective Bayesian Estimation of Two-Parameter Pareto Distribution (2-모수 파레토분포의 객관적 베이지안 추정)

  • Son, Young Sook
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.5
    • /
    • pp.713-723
    • /
    • 2013
  • An objective Bayesian estimation procedure of the two-parameter Pareto distribution is presented under the reference prior and the noninformative prior. Bayesian estimators are obtained by Gibbs sampling. The steps to generate parameters in the Gibbs sampler are from the shape parameter of the gamma distribution and then the scale parameter by the adaptive rejection sampling algorism. A numerical study shows that the proposed objective Bayesian estimation outperforms other estimations in simulated bias and mean squared error.

Mean estimation of small areas using penalized spline mixed-model under informative sampling

  • Chytrasari, Angela N.R.;Kartiko, Sri Haryatmi;Danardono, Danardono
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.3
    • /
    • pp.349-363
    • /
    • 2020
  • Penalized spline is a suitable nonparametric approach in estimating mean model in small area. However, application of the approach in informative sampling in a published article is uncommon. We propose a semiparametric mixed-model using penalized spline under informative sampling to estimate mean of small area. The response variable is explained in terms of mean model, informative sample effect, area random effect and unit error. We approach the mean model by penalized spline and utilize a penalized spline function of the inclusion probability to account for the informative sample effect. We determine the best and unbiased estimators for coefficient model and derive the restricted maximum likelihood estimators for the variance components. A simulation study shows a decrease in the average absolute bias produced by the proposed model. A decrease in the root mean square error also occurred except in some quadratic cases. The use of linear and quadratic penalized spline to approach the function of the inclusion probability provides no significant difference distribution of root mean square error, except for few smaller samples.

Efficiency of Variance Estimators for Two-stage PPS Systematic Sampling (2단 크기비례 계통추출법의 분산추정량 효율성 비교)

  • Kim, Young-Won;Kim, Yeny;Han, Hye-Eun;Kwak, Eun-Sun
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.6
    • /
    • pp.1033-1041
    • /
    • 2013
  • In this paper, we investigate several variance estimators for pps systematic sampling. Unfortunately, there is no unbiased variance estimators for a systematic sample because systematic sampling can be regarded as a random selection of one cluster. This study provides guidance on which variance estimator may be more appropriate than others in several circumstances. We judge the efficiency of variance estimators for systematic sampling based on of their relative biases and relative mean square error. Also, we investigate variance estimation problems for two-stage systematic sampling applied for the Food Raw Material Consumption Survey and the Establishment Labor Force Survey simulation study, in order to consider the popular two-stage pps systematic sample design for establishment and household survey in Korea.

An estimation procedure with updated sample (패널조사에서 표본 변경을 고려한 추정)

  • 박진우
    • The Korean Journal of Applied Statistics
    • /
    • v.10 no.2
    • /
    • pp.367-374
    • /
    • 1997
  • In panel surveys it is necessary to manage both sampling frame and sample units across time. When sample is updated according to the change of its frame, it should be incorporated in the estimation procedure. This paper derives the bias of the conventional estimator caused by neglecting the change of sample, and provides a bias-adjusted estimator with its variance.

  • PDF