• Title/Summary/Keyword: balanced sample

Search Result 109, Processing Time 0.029 seconds

Unbiased Balanced Half-Sample Variance Estimation in Stratified Two-stage Sampling

  • Kim, Kyu-Seong
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.4
    • /
    • pp.459-469
    • /
    • 1998
  • Balanced half sample method is a simple variance estimation method for complex sampling designs. Since it is simple and flexible, it has been widely used in large scale sample surveys. However, the usual BHS method overestimate the true variance in without replacement sampling and two-stage cluster sampling. Focusing on this point , we proposed an unbiased BHS variance estimator in a stratified two-stage cluster sampling and then described an implementation method of the proposed estimator. Finally, partially BHS design is explained as a tool of reducing the number of replications of the proposed estimator.

  • PDF

Study on the Size of Minimal Standardized Detectable Difference in Balanced Design of Experiments (실험계획법에서 최소 표준화 검출 가능 효과의 크기에 관한 연구)

  • 임용빈
    • Journal of Korean Society for Quality Management
    • /
    • v.26 no.4
    • /
    • pp.239-249
    • /
    • 1998
  • In balanced design of experiment sample size is determined to detect a certain standardized size of effect with power 1-$\beta$ at the level of significance $\alpha$. Tables (Marvin, et al (1970) and Lorenzen and Anderson(1993) and charts (Odeh and Fox(1991)) are available to determine the sample size in balanced design of experiments. To simplify those tables and charts simple MATLAB program is used to find the minimal standardized detectable difference $\delta$ when $\alpha , \beta$ and sample size are given.

  • PDF

THREE-WAY BALANCED MULTI-LEVEL ROTATION SAMPLING DESIGNS

  • Park, Y. S.;Kim, K. W.;Kim, N. Y.
    • Journal of the Korean Statistical Society
    • /
    • v.32 no.3
    • /
    • pp.245-259
    • /
    • 2003
  • The 2-way balanced one-level rotation design has been discussed (Park et al., 2001), where the 2-way balancing is done on interview time in monthly sample and rotation group. We extend it to 3-way balanced multi-level design to obtain more information of the same sample unit for one or more previous months. The 3-way balancing is accomplished not only on interview time in monthly sample and rotation group but also on recall time as well. The 3-way balancing eliminates or reduces any bias arising from unbalanced interview time, rotation group and recall time, and all rotation groups are equally represented in the monthly sample. We present the rule and rotation algorithm which guarantee the 3-way balancing. In particular, we specify the necessary and sufficient condition for the 3-way balanced multi-level rotation design.

An Estimator of Population Mean Based on Balanced Systematic Sampling When Both the Sample Size and the Reciprocal of the Sampling Fraction are Odd Numbers

  • Kim, Hyuk-Joo
    • Communications for Statistical Applications and Methods
    • /
    • v.14 no.3
    • /
    • pp.667-677
    • /
    • 2007
  • In this paper, we propose a method for estimating the mean of a population which has a linear trend, when both n, the sample size, and k, the reciprocal of the sampling fraction, are odd numbers. The proposed method, not having the drawbacks of centered systematic sampling, centered modified sampling and centered balanced sampling, consists of selecting a sample by balanced systematic sampling and estimating the population mean by using interpolation. We compare the efficiency of the proposed method and existing methods under the criterion of the expected mean square error based on the infinite superpopulation model.

Three-Way Balanced Multi-level Semi Rotation Sampling Designs

  • Park, You-Sung;Choi, Jai-Won;Kim, Kee-Whan
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2002.05a
    • /
    • pp.19-24
    • /
    • 2002
  • The two-way balanced one-level rotation design has been discussed (Park, Kim and Choi, 2001), where the two-way balancing is done on interview time in monthly sample and rotation group. We extend it to three-way balanced multi-level design under the most general rotation system. The three-way balancing is accomplished on interview time not only in monthly sample and rotation group but also in recall time. We present the necessary condition and rotation algorithm which guarantee the three-way balancing. We propose multi-level composite estimators (MCE) from this design and derive their variances and mean squared errors (MSE), assuming the correlation from the measurements of the same sample unit and three types of biases in monthly sample.

  • PDF

Image Classification using Class-Balanced Loss (Class-Balanced Loss를 이용한 이미지 분류)

  • Jihee Park;Wonjun Hwang
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.11a
    • /
    • pp.164-166
    • /
    • 2022
  • Long-tail problem은 class 별로 sample의 개수에 차이가 있어 성능에 안 좋은 영향을 미치는 것을 말한다. 본 논문에서는 cost-sensitive learning 중 Class-Balanced Loss를 이용해 성능을 개선하여 Long-tail problem을 해결하려고 한다. 먼저, balanced data set과 imbalanced data set의 성능 차이를 살펴보도록 할 것이다. 그 후, Class-Balanced Loss를 3가지 버전으로 이용해 그 성능을 측정하고 분석해 볼 것이다.

  • PDF

Estimation of the Parameter of a Bernoulli Distribution Using a Balanced Loss Function

  • Farsipour, N.Sanjari;Asgharzadeh, A.
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.3
    • /
    • pp.889-898
    • /
    • 2002
  • In decision theoretic estimation, the loss function usually emphasizes precision of estimation. However, one may have interest in goodness of fit of the overall model as well as precision of estimation. From this viewpoint, Zellner(1994) proposed the balanced loss function which takes account of both "goodness of fit" and "precision of estimation". This paper considers estimation of the parameter of a Bernoulli distribution using Zellner's(1994) balanced loss function. It is shown that the sample mean $\overline{X}$, is admissible. More general results, concerning the admissibility of estimators of the form $a\overline{X}+b$ are also presented. Finally, minimax estimators and some numerical results are given at the end of paper,at the end of paper.

Default Prediction for Real Estate Companies with Imbalanced Dataset

  • Dong, Yuan-Xiang;Xiao, Zhi;Xiao, Xue
    • Journal of Information Processing Systems
    • /
    • v.10 no.2
    • /
    • pp.314-333
    • /
    • 2014
  • When analyzing default predictions in real estate companies, the number of non-defaulted cases always greatly exceeds the defaulted ones, which creates the two-class imbalance problem. This lowers the ability of prediction models to distinguish the default sample. In order to avoid this sample selection bias and to improve the prediction model, this paper applies a minority sample generation approach to create new minority samples. The logistic regression, support vector machine (SVM) classification, and neural network (NN) classification use an imbalanced dataset. They were used as benchmarks with a single prediction model that used a balanced dataset corrected by the minority samples generation approach. Instead of using prediction-oriented tests and the overall accuracy, the true positive rate (TPR), the true negative rate (TNR), G-mean, and F-score are used to measure the performance of default prediction models for imbalanced dataset. In this paper, we describe an empirical experiment that used a sampling of 14 default and 315 non-default listed real estate companies in China and report that most results using single prediction models with a balanced dataset generated better results than an imbalanced dataset.

An Improved Sample Balanced Genetic Algorithm and Extreme Learning Machine for Accurate Alzheimer Disease Diagnosis

  • Sachnev, Vasily;Suresh, Sundaram
    • Journal of Computing Science and Engineering
    • /
    • v.10 no.4
    • /
    • pp.118-127
    • /
    • 2016
  • An improved sample balanced genetic algorithm and Extreme Learning Machine (iSBGA-ELM) was designed for accurate diagnosis of Alzheimer disease (AD) and identification of biomarkers associated with AD in this paper. The proposed AD diagnosis approach uses a set of magnetic resonance imaging scans in Open Access Series of Imaging Studies (OASIS) public database to build an efficient AD classifier. The approach contains two steps: "voxels selection" based on an iSBGA and "AD classification" based on the ELM. In the first step, the proposed iSBGA searches for a robust subset of voxels with promising properties for further AD diagnosis. The robust subset of voxels chosen by iSBGA is then used to build an AD classifier based on the ELM. A robust subset of voxels keeps a high generalization performance of AD classification in various scenarios and highlights the importance of the chosen voxels for AD research. The AD classifier with maximum classification accuracy is created using an optimal subset of robust voxels. It represents the final AD diagnosis approach. Experiments with the proposed iSBGA-ELM using OASIS data set showed an average testing accuracy of 87%. Experiments clearly indicated the proposed iSBGA-ELM was efficient for AD diagnosis. It showed improvements over existing techniques.

The Influence of Family Functioning on the Exposure of Domestic Violence in Children (가족기능이 자녀의 가정폭력 노출에 미치는 영향)

  • Kim, Kyeong-Shin;Kim, Jeong-Ran
    • Korean Journal of Human Ecology
    • /
    • v.13 no.5
    • /
    • pp.691-699
    • /
    • 2004
  • The purpose of this study was to analyze the relationship between family functioning and family violence. The SPSS 10.0 for Windows was used to analyze data obtained through 1,044 children who live in Gwangju Chonnam area. Major findings are as follows: 1. Wife abuse and child abuse showed significant difference according to family cohesion, family flexibility, and family-system types. 2. In family violence non-experience group, 'balanced family' was found in 36.2% of the sample and 'extreme family' in 7.8%. In family violence experience group, 'balanced family' was found in 12.7% of the sample and 'extreme family' in 15.4%. 3. There were significant negative correlations between family functioning and family violence. Family cohesion and family flexibility had significant negative correlation with wife abuse and child abuse.

  • PDF