• Title/Summary/Keyword: Sampling distribution of sample mean

Search Result 50, Processing Time 0.022 seconds

Variance estimation for distribution rate in stratified cluster sampling with missing values

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.2
    • /
    • pp.443-449
    • /
    • 2017
  • Estimation of population proportion like the distribution rate of LED TV and the prevalence of a disease are often estimated based on survey sample data. Population proportion is generally considered as a special form of population mean. In complex sampling like stratified multistage sampling with unequal probability sampling, the denominator of mean may be random variable and it is estimated like ratio estimator. In this research, we examined the estimation of distribution rate based on stratified multistage sampling, and determined some numerical outcomes using stratified random sample data with about 25% of missing observations. In the data used for this research, the survey weight was determined by deterministic way. So, the weights are not random variable, and the population distribution rate and its variance estimator can be estimated like population mean estimation. When the weights are not random variable, if one estimates the variance of proportion estimator using ratio method, then the variances may be inflated. Therefore, in estimating variance for population proportion, we need to examine the structure of data and survey design before making any decision for estimation methods.

A Study of Using the Terminology of Sampling Error and Sampling Distribution (표집오차(sampling error)와 표집분포(sampling distribution)의 용어 사용에 관한 연구)

  • Kim, Yung-Hwan
    • Journal of the Korean School Mathematics Society
    • /
    • v.9 no.3
    • /
    • pp.309-316
    • /
    • 2006
  • This study examined the ambiguous using the terminology of statistics at mathematics textbook of highschool in Korea and proposed the correct using of sampling error and sampling distribution of sample mean with consistency. And this paper proposed that the concept of error have to teach in context of sampling action in school mathematics.

  • PDF

A Bayesian Multiple Testing of Detecting Differentially Expressed Genes in Two-sample Comparison Problem

  • Oh Hyun-Sook;Yang Wan-Youn
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.1
    • /
    • pp.39-47
    • /
    • 2006
  • The Bayesian approach to multiple testing procedure for one sample testing problem proposed by Scott and Berger (2003) is extended to two-sample comparison problem in microarray experiments. The prior distribution of each gene's mean for one sample is given conditionally on the corresponding gene's mean for the other sample. Posterior distributions of interesting parameters are derived and estimated based on an importance sampling method. A simulated example is given for illustration.

A Bulk Sampling Plan for Reliability Assurance (벌크재료의 신뢰성보증을 위한 샘플링검사 방식)

  • Kim, Dong-Chul;Kim, Jong-Gurl
    • Journal of the Korea Safety Management & Science
    • /
    • v.9 no.2
    • /
    • pp.123-134
    • /
    • 2007
  • This paper focuses on the in-house reliability assurance plan for the bulk materials of each company. The reliability assurance needs in essence a long time and high cost for testing the materials. In order to reduce the time and cost, accelerated life test is adopted. The bulk sampling technique was used for acceptance. Design parameters might be total sample size(segments and increments}, stress level and so on. We focus on deciding the sample size by minimizing the asymptotic variance of test statistics as well as satisfying the consumer's risk. In bulk sampling, we also induce the sample size by adapting the normal life time distribution model when the variable of the lognormal life time distribution is transformed and adapted to the model. In addition, the sample size for both the segments and increments can be induced by minimizing the asymptotic variance of test statistics of the segments and increments with consumer's risk met. We can assure the reliability of the mean life and B100p life time of the bulk materials by using the calculated minimum sample size.

The Design and Implementation to Teach Sampling Distributions with the Statistical Inferences (통계적 추론에서의 표집분포 개념 지도를 위한 시뮬레이션 소프트웨어 설계 및 구현)

  • Lee, Young-Ha;Lee, Eun-Ho
    • School Mathematics
    • /
    • v.12 no.3
    • /
    • pp.273-299
    • /
    • 2010
  • The purpose of the study is designing and implementing 'Sampling Distributions Simulation' to help students to understand concepts of sampling distributions. This computer simulation is developed to help students understand sampling distributions more easily. 'Sampling Distributions Simulation' consists of 4 sessions. 'The first session - Confidence level and confidence intervals - includes checking if the intended confidence level is actually achieved by the real relative frequency for the obtained sample confidence intervals containing population mean. This will give the students clearer idea about confidence level and confidence intervals in addition to the role of sampling distribution of the sample means among those. 'The second session - Sampling Distributions - helps understand sampling distribution of the sample means, through the simulation method to make comparison between the histogram of sampling distributions and that of the population. The third session - The Central Limit Theorem - includes calculating the means of the samples taken from a population which follows a uniform distribution or follows a Bernoulli distribution and then making the histograms of those means. This will provides comprehension of the central limit theorem, which mentions about the sampling distribution of the sample means when the sample size is very large. The forth session - the normal approximation to the binomial distribution - helps understand the normal approximation to the binomial distribution as an alternative version of central limit theorem. With the practical usage of the shareware 'Sampling Distributions Simulation', we expect students to have a new vision on the sampling distribution and to get more emphasis on it. With the sound understandings on the sampling distributions, more accurate and profound statistical inferences are expected. And the role of the sampling distribution in the inferences should be more deeply appreciated.

  • PDF

Estimation in the exponential distribution under progressive Type I interval censoring with semi-missing data

  • Shin, Hyejung;Lee, Kwangho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.6
    • /
    • pp.1271-1277
    • /
    • 2012
  • In this paper, we propose an estimation method of the parameter in an exponential distribution based on a progressive Type I interval censored sample with semi-missing observation. The maximum likelihood estimator (MLE) of the parameter in the exponential distribution cannot be obtained explicitly because the intervals are not equal in length under the progressive Type I interval censored sample with semi-missing data. To obtain the MLE of the parameter for the sampling scheme, we propose a method by which progressive Type I interval censored sample with semi-missing data is converted to the progressive Type II interval censored sample. Consequently, the estimation procedures in the progressive Type II interval censored sample can be applied and we obtain the MLE of the parameter and survival function. It will be shown that the obtained estimators have good performance in terms of the mean square error (MSE) and mean integrated square error (MISE).

Modified Ranked Ordering Set Samples for Estimating the Population Mean

  • Kim, Hyun-Gee;Kim, Dong-Hee
    • Communications for Statistical Applications and Methods
    • /
    • v.14 no.3
    • /
    • pp.641-648
    • /
    • 2007
  • We propose the new sampling method, called modified ranked ordering set sampling (MROSS). Kim and Kim (2003) suggested the sign test using the ranked ordering set sampling (ROSS), and showed that the asymptotic relative efficiency (ARE) of ROSS against RSS for sign test increases as sample size does. We propose the estimator for the population mean using MROSS. The relative precision (RP) of estimator of the population mean using MROSS method with respect to the usual estimator using modified RSS is higher, and when the underlying distribution is skewed, the bias of the proposed estimator is smaller than that of several ranked set sampling estimators.

Mean estimation of small areas using penalized spline mixed-model under informative sampling

  • Chytrasari, Angela N.R.;Kartiko, Sri Haryatmi;Danardono, Danardono
    • Communications for Statistical Applications and Methods
    • /
    • v.27 no.3
    • /
    • pp.349-363
    • /
    • 2020
  • Penalized spline is a suitable nonparametric approach in estimating mean model in small area. However, application of the approach in informative sampling in a published article is uncommon. We propose a semiparametric mixed-model using penalized spline under informative sampling to estimate mean of small area. The response variable is explained in terms of mean model, informative sample effect, area random effect and unit error. We approach the mean model by penalized spline and utilize a penalized spline function of the inclusion probability to account for the informative sample effect. We determine the best and unbiased estimators for coefficient model and derive the restricted maximum likelihood estimators for the variance components. A simulation study shows a decrease in the average absolute bias produced by the proposed model. A decrease in the root mean square error also occurred except in some quadratic cases. The use of linear and quadratic penalized spline to approach the function of the inclusion probability provides no significant difference distribution of root mean square error, except for few smaller samples.

Design wind speed prediction suitable for different parent sample distributions

  • Zhao, Lin;Hu, Xiaonong;Ge, Yaojun
    • Wind and Structures
    • /
    • v.33 no.6
    • /
    • pp.423-435
    • /
    • 2021
  • Although existing algorithms can predict wind speed using historical observation data, for engineering feasibility, most use moment methods and probability density functions to estimate fitted parameters. However, extreme wind speed prediction accuracy for long-term return periods is not always dependent on how the optimized frequency distribution curves are obtained; long-term return periods emphasize general distribution effects rather than marginal distributions, which are closely related to potential extreme values. Moreover, there are different wind speed parent sample types; how to theoretically select the proper extreme value distribution is uncertain. The influence of different sampling time intervals has not been evaluated in the fitting process. To overcome these shortcomings, updated steps are introduced, involving parameter sensitivity analysis for different sampling time intervals. The extreme value prediction accuracy of unknown parent samples is also discussed. Probability analysis of mean wind is combined with estimation of the probability plot correlation coefficient and the maximum likelihood method; an iterative estimation algorithm is proposed. With the updated steps and comparison using a Monte Carlo simulation, a fitting policy suitable for different parent distributions is proposed; its feasibility is demonstrated in extreme wind speed evaluations at Longhua and Chuansha meteorological stations in Shanghai, China.

A Time Truncated Two-Stage Group Sampling Plan for Weibull Distribution

  • Aslam, Muhammad;Jun, Chi-Hyuck;Rasool, Mujahid;Ahmad, Munir
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.1
    • /
    • pp.89-98
    • /
    • 2010
  • In this paper, a two-stage group sampling plan based on the time truncated life test is proposed for the Weibull distribution. The design parameters such as the number of groups and the acceptance number in each stage are determined by satisfying the producer's and consumer's risks simultaneously when the group size and the test duration are specified. The acceptable reliability level is expressed by the ratio of the true mean life to the specified life. It was demonstrated from the comparison with single-stage group sampling plans that the proposed plan can reduce the average sample number or improve the operating characteristics.