• 제목/요약/키워드: Sampling distribution of sample mean

검색결과 49건 처리시간 0.02초

Variance estimation for distribution rate in stratified cluster sampling with missing values

  • Heo, Sunyeong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권2호
    • /
    • pp.443-449
    • /
    • 2017
  • Estimation of population proportion like the distribution rate of LED TV and the prevalence of a disease are often estimated based on survey sample data. Population proportion is generally considered as a special form of population mean. In complex sampling like stratified multistage sampling with unequal probability sampling, the denominator of mean may be random variable and it is estimated like ratio estimator. In this research, we examined the estimation of distribution rate based on stratified multistage sampling, and determined some numerical outcomes using stratified random sample data with about 25% of missing observations. In the data used for this research, the survey weight was determined by deterministic way. So, the weights are not random variable, and the population distribution rate and its variance estimator can be estimated like population mean estimation. When the weights are not random variable, if one estimates the variance of proportion estimator using ratio method, then the variances may be inflated. Therefore, in estimating variance for population proportion, we need to examine the structure of data and survey design before making any decision for estimation methods.

표집오차(sampling error)와 표집분포(sampling distribution)의 용어 사용에 관한 연구 (A Study of Using the Terminology of Sampling Error and Sampling Distribution)

  • 김응환
    • 한국학교수학회논문집
    • /
    • 제9권3호
    • /
    • pp.309-316
    • /
    • 2006
  • 이 논문에서는 현재 중등학교 수학의 통계교육에서 다루고 있는 통계용어의 의미상 혼선과 애매한 내용을 수학교사를 대상으로 알아보고, 표본평균의 확률분포에 대한 지도 영역에 있어서 표집(sampling, 표본추출)의 문맥에서 표집오차(sampling error)와 표본평균의 표집분포(sampling distribution)라는 용어를 도입하여 일관성 있게 사용할 것을 제안하였다. 현행 중고등학교의 수학과의 통계의 용어 정의와 개념설명에 있어서, 교육부가 검정한 12종의 검정 교과서와 국정교과서 간에서도 차이는 물론 의미의 혼선과 함께 정의의 일관성의 부족은 통계를 교육하는 수학교사와 학생들에게 심각한 오개념을 형성하게 만들고, 그 애매함으로 인하여 통계학의 학문 자체에 대한 흥미와 태도의 정의적인 면에서 부정적인 영향을 주고 있음이 발견되었다 본 연구에서는 표본평균의 확률분포의 효율적인 지도를 위한 표본오차 대신에 표집오차를 사용할 것과 표집분포의 용어를 도입함으로서 통계용어의 정확한 사용을 동하여 교사와 학생들에게 통계용어의 올바른 개념의 형성과 이해는 물론 통계교육의 일관성과 계열성 유지의 필요성을 제기하였다.

  • PDF

A Bayesian Multiple Testing of Detecting Differentially Expressed Genes in Two-sample Comparison Problem

  • Oh Hyun-Sook;Yang Wan-Youn
    • Communications for Statistical Applications and Methods
    • /
    • 제13권1호
    • /
    • pp.39-47
    • /
    • 2006
  • The Bayesian approach to multiple testing procedure for one sample testing problem proposed by Scott and Berger (2003) is extended to two-sample comparison problem in microarray experiments. The prior distribution of each gene's mean for one sample is given conditionally on the corresponding gene's mean for the other sample. Posterior distributions of interesting parameters are derived and estimated based on an importance sampling method. A simulated example is given for illustration.

벌크재료의 신뢰성보증을 위한 샘플링검사 방식 (A Bulk Sampling Plan for Reliability Assurance)

  • 김동철;김종걸
    • 대한안전경영과학회지
    • /
    • 제9권2호
    • /
    • pp.123-134
    • /
    • 2007
  • This paper focuses on the in-house reliability assurance plan for the bulk materials of each company. The reliability assurance needs in essence a long time and high cost for testing the materials. In order to reduce the time and cost, accelerated life test is adopted. The bulk sampling technique was used for acceptance. Design parameters might be total sample size(segments and increments}, stress level and so on. We focus on deciding the sample size by minimizing the asymptotic variance of test statistics as well as satisfying the consumer's risk. In bulk sampling, we also induce the sample size by adapting the normal life time distribution model when the variable of the lognormal life time distribution is transformed and adapted to the model. In addition, the sample size for both the segments and increments can be induced by minimizing the asymptotic variance of test statistics of the segments and increments with consumer's risk met. We can assure the reliability of the mean life and B100p life time of the bulk materials by using the calculated minimum sample size.

통계적 추론에서의 표집분포 개념 지도를 위한 시뮬레이션 소프트웨어 설계 및 구현 (The Design and Implementation to Teach Sampling Distributions with the Statistical Inferences)

  • 이영하;이은호
    • 대한수학교육학회지:학교수학
    • /
    • 제12권3호
    • /
    • pp.273-299
    • /
    • 2010
  • 본 논문의 목적은 고등학교 수준의 학생들이 표집분포의 개념을 학습할 수 있도록 '표집분포 시뮬레이션 (Sampling Distributions Simulation)'을 설계하고 구현하는 것이다. '표집분포 시뮬레이션'은 다음과 같이 4차시로 구성되어 있다. 1차시-신뢰도와 신뢰구간의 의미 학습하기 2차시-표집분포의 의미 학습하기 3차시-중심극한정리의 의미 학습하기 4차시-이항분포의 정규근사 학습하기 본 연구를 통하여 표집분포의 중요성에 대한 학생들이 인식이 달라지고 이해가 증진되기를 기대한다. 또 본 연구의 결과로 제공되는 프로그램 '표집분포의 시뮬레이션' 수업을 통해 통계적 추론 능력이 향상되고, 아울러 통계적 추론 속에서 표집 분포의 역할이 충분히 이해되기를 기대한다.

  • PDF

Estimation in the exponential distribution under progressive Type I interval censoring with semi-missing data

  • Shin, Hyejung;Lee, Kwangho
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권6호
    • /
    • pp.1271-1277
    • /
    • 2012
  • In this paper, we propose an estimation method of the parameter in an exponential distribution based on a progressive Type I interval censored sample with semi-missing observation. The maximum likelihood estimator (MLE) of the parameter in the exponential distribution cannot be obtained explicitly because the intervals are not equal in length under the progressive Type I interval censored sample with semi-missing data. To obtain the MLE of the parameter for the sampling scheme, we propose a method by which progressive Type I interval censored sample with semi-missing data is converted to the progressive Type II interval censored sample. Consequently, the estimation procedures in the progressive Type II interval censored sample can be applied and we obtain the MLE of the parameter and survival function. It will be shown that the obtained estimators have good performance in terms of the mean square error (MSE) and mean integrated square error (MISE).

Modified Ranked Ordering Set Samples for Estimating the Population Mean

  • Kim, Hyun-Gee;Kim, Dong-Hee
    • Communications for Statistical Applications and Methods
    • /
    • 제14권3호
    • /
    • pp.641-648
    • /
    • 2007
  • We propose the new sampling method, called modified ranked ordering set sampling (MROSS). Kim and Kim (2003) suggested the sign test using the ranked ordering set sampling (ROSS), and showed that the asymptotic relative efficiency (ARE) of ROSS against RSS for sign test increases as sample size does. We propose the estimator for the population mean using MROSS. The relative precision (RP) of estimator of the population mean using MROSS method with respect to the usual estimator using modified RSS is higher, and when the underlying distribution is skewed, the bias of the proposed estimator is smaller than that of several ranked set sampling estimators.

Mean estimation of small areas using penalized spline mixed-model under informative sampling

  • Chytrasari, Angela N.R.;Kartiko, Sri Haryatmi;Danardono, Danardono
    • Communications for Statistical Applications and Methods
    • /
    • 제27권3호
    • /
    • pp.349-363
    • /
    • 2020
  • Penalized spline is a suitable nonparametric approach in estimating mean model in small area. However, application of the approach in informative sampling in a published article is uncommon. We propose a semiparametric mixed-model using penalized spline under informative sampling to estimate mean of small area. The response variable is explained in terms of mean model, informative sample effect, area random effect and unit error. We approach the mean model by penalized spline and utilize a penalized spline function of the inclusion probability to account for the informative sample effect. We determine the best and unbiased estimators for coefficient model and derive the restricted maximum likelihood estimators for the variance components. A simulation study shows a decrease in the average absolute bias produced by the proposed model. A decrease in the root mean square error also occurred except in some quadratic cases. The use of linear and quadratic penalized spline to approach the function of the inclusion probability provides no significant difference distribution of root mean square error, except for few smaller samples.

Design wind speed prediction suitable for different parent sample distributions

  • Zhao, Lin;Hu, Xiaonong;Ge, Yaojun
    • Wind and Structures
    • /
    • 제33권6호
    • /
    • pp.423-435
    • /
    • 2021
  • Although existing algorithms can predict wind speed using historical observation data, for engineering feasibility, most use moment methods and probability density functions to estimate fitted parameters. However, extreme wind speed prediction accuracy for long-term return periods is not always dependent on how the optimized frequency distribution curves are obtained; long-term return periods emphasize general distribution effects rather than marginal distributions, which are closely related to potential extreme values. Moreover, there are different wind speed parent sample types; how to theoretically select the proper extreme value distribution is uncertain. The influence of different sampling time intervals has not been evaluated in the fitting process. To overcome these shortcomings, updated steps are introduced, involving parameter sensitivity analysis for different sampling time intervals. The extreme value prediction accuracy of unknown parent samples is also discussed. Probability analysis of mean wind is combined with estimation of the probability plot correlation coefficient and the maximum likelihood method; an iterative estimation algorithm is proposed. With the updated steps and comparison using a Monte Carlo simulation, a fitting policy suitable for different parent distributions is proposed; its feasibility is demonstrated in extreme wind speed evaluations at Longhua and Chuansha meteorological stations in Shanghai, China.

A Time Truncated Two-Stage Group Sampling Plan for Weibull Distribution

  • Aslam, Muhammad;Jun, Chi-Hyuck;Rasool, Mujahid;Ahmad, Munir
    • Communications for Statistical Applications and Methods
    • /
    • 제17권1호
    • /
    • pp.89-98
    • /
    • 2010
  • In this paper, a two-stage group sampling plan based on the time truncated life test is proposed for the Weibull distribution. The design parameters such as the number of groups and the acceptance number in each stage are determined by satisfying the producer's and consumer's risks simultaneously when the group size and the test duration are specified. The acceptable reliability level is expressed by the ratio of the true mean life to the specified life. It was demonstrated from the comparison with single-stage group sampling plans that the proposed plan can reduce the average sample number or improve the operating characteristics.