• Title/Summary/Keyword: sample selection

Search Result 676, Processing Time 0.027 seconds

Statistical Problems Caused by Sample Censoring and Their Solutions -Focused on the application to consumer research- (표본중도절단에 따른 통계학적 문제와 교정방법에 관한 고찰 -소비자분야 연구에의 적용을 중심으로-)

  • 나명균
    • Journal of the Korean Home Economics Association
    • /
    • v.33 no.2
    • /
    • pp.19-27
    • /
    • 1995
  • This paper discusses the bias that results from using nonrandomly selectd samples of consumer research. A two stage system (maximum likelihood probit analysis and ordinary least square analysis) is a solution to sample selection bias. Empirical results show that correcting for sample selection bias improves the validity of consumer research results.

  • PDF

Supervised Rank Normalization with Training Sample Selection (학습 샘플 선택을 이용한 교사 랭크 정규화)

  • Heo, Gyeongyong;Choi, Hun;Youn, Joo-Sang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.20 no.1
    • /
    • pp.21-28
    • /
    • 2015
  • Feature normalization as a pre-processing step has been widely used to reduce the effect of different scale in each feature dimension and error rate in classification. Most of the existing normalization methods, however, do not use the class labels of data points and, as a result, do not guarantee the optimality of normalization in classification aspect. A supervised rank normalization method, combination of rank normalization and supervised learning technique, was proposed and demonstrated better result than others. In this paper, another technique, training sample selection, is introduced in supervised feature normalization to reduce classification error more. Training sample selection is a common technique for increasing classification accuracy by removing noisy samples and can be applied in supervised normalization method. Two sample selection measures based on the classes of neighboring samples and the distance to neighboring samples were proposed and both of them showed better results than previous supervised rank normalization method.

Sample selection approach using moving window for acoustic analysis of pathological sustained vowels according to signal typing

  • Lee, Ji-Yeoun
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.99-108
    • /
    • 2011
  • The perturbation parameters like jitter, shimmer, and signal-to-noise ratio (SNR) are largely estimated in the particular segment from the subjective or whole portion of the given pathological voice signal although there are many possible regions to be able to analyze the voice signals. In this paper, the pathological voice signals were classified as type 1, 2, 3, or 4 according to narrow band spectrogram and the value differences of the perturbation parameters extracted in the subjective and entire portion tended to be getting bigger as from type 1 to type 4 signals. Therefore, sample selection method based on moving window to analyze type 2 and 3 signals as well as type 1 signals is proposed. Although type 3 signals cannot be analyzed using the perturbation analysis, the type 3 signals by selecting out the samples in which error count is less than 10 through moving window were analyzed. At present, there is no method to be able to analyze the type 4 signals. Future research will endeavor to determine the best way to evaluate such voices.

  • PDF

Bayesian estimation for finite population proportion under selection bias via surrogate samples

  • Choi, Seong Mi;Kim, Dal Ho
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.6
    • /
    • pp.1543-1550
    • /
    • 2013
  • In this paper, we study Bayesian estimation for the finite population proportion in binary data under selection bias. We use a Bayesian nonignorable selection model to accommodate the selection mechanism. We compare four possible estimators of the finite population proportions based on data analysis as well as Monte Carlo simulation. It turns out that nonignorable selection model might be useful for weekly biased samples.

A Novel Text Sample Selection Model for Scene Text Detection via Bootstrap Learning

  • Kong, Jun;Sun, Jinhua;Jiang, Min;Hou, Jian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.2
    • /
    • pp.771-789
    • /
    • 2019
  • Text detection has been a popular research topic in the field of computer vision. It is difficult for prevalent text detection algorithms to avoid the dependence on datasets. To overcome this problem, we proposed a novel unsupervised text detection algorithm inspired by bootstrap learning. Firstly, the text candidate in a novel form of superpixel is proposed to improve the text recall rate by image segmentation. Secondly, we propose a unique text sample selection model (TSSM) to extract text samples from the current image and eliminate database dependency. Specifically, to improve the precision of samples, we combine maximally stable extremal regions (MSERs) and the saliency map to generate sample reference maps with a double threshold scheme. Finally, a multiple kernel boosting method is developed to generate a strong text classifier by combining multiple single kernel SVMs based on the samples selected from TSSM. Experimental results on standard datasets demonstrate that our text detection method is robust to complex backgrounds and multilingual text and shows stable performance on different standard datasets.

An Application of Heckman Two-step Procedure to Management Accounting and Firm Effectiveness: An Empirical Study from Vietnam

  • HUYNH, Quang Linh
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.2
    • /
    • pp.347-353
    • /
    • 2022
  • Using the Heckman two-step procedure, this study investigates the relationship between management accounting implementation and firm effectiveness. The research data for this study was acquired from 450 publicly traded companies in Vietnam; however, the final sample only includes 304 responses containing useful information. The reliability analysis was used to evaluate the acquired data to examine the qualities of constructs and the dimensions that make them up. Then, the Heckman two-step technique was performed to analyze the causal connection from the acceptance of management accounting to firm effectiveness allowing for the effect of environmental uncertainty and organizational characteristics on the likelihood of adopting management accounting. The empirical findings show that management accounting acceptance determines firm effectiveness; however, the research model on the relationship between management accounting adoption and firm effectiveness has a sample selection bias. The main conclusions of this study are that there is a difference in the effects of management accounting adoption on business effectiveness when sample selection bias is not taken into consideration. When potential sample selection bias is taken into account by integrating environmental uncertainty and organizational characteristics in the research model, the effect of adopting management accounting on company effectiveness becomes minor.

Prediction of Auditor Selection Using a Combination of PSO Algorithm and CART in Iran

  • Salehi, Mahdi;Kamalahmadi, Sharifeh;Bahrami, Mostafa
    • Journal of Distribution Science
    • /
    • v.12 no.3
    • /
    • pp.33-41
    • /
    • 2014
  • Purpose - The purpose of this study was to predict the selection of independent auditors in the companies listed on the Tehran Stock Exchange (TSE) using a combination of PSO algorithm and CART. This study involves applied research. Design, approach and methodology - The population consisted of all the companies listed on TSE during the period 2005-2010, and the sample included 576 data specimens from 95 companies during six consecutive years. The independent variables in the study were the financial ratios of the sample companies, which were analyzed using two data mining techniques, namely, PSO algorithm and CART. Results - The results of this study showed that among the analyzed variables, total assets, current assets, audit fee, working capital, current ratio, debt ratio, solvency ratio, turnover, and capital were predictors of independent auditor selection. Conclusion - The current study is practically the first to focus on this topic in the specific context of Iran. In this regard, the study may be valuable for application in developing countries.

A CONSISTENT AND BIAS CORRECTED EXTENSION OF AKAIKE'S INFORMATION CRITERION(AIC) : AICbc(k)

  • Kwon, Soon H.;Ueno, M.;Sugeno, M.
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.2 no.1
    • /
    • pp.41-60
    • /
    • 1998
  • This paper derives a consistent and bias corrected extension of Akaike's Information Criterion (AIC), $AIC_{bc}$, based on Kullback-Leibler information. This criterion has terms that penalize the overparametrization more strongly for small and large samples than that of AIC. The overfitting problem of the asymptotically efficient model selection criteria for small and large samples will be overcome. The $AIC_{bc}$ also provides a consistent model order selection. Thus, it is widely applicable to data with small and/or large sample sizes, and to cases where the number of free parameters is a relatively large fraction of the sample size. Relationships with other model selection criteria such as $AIC_c$ of Hurvich, CAICF of Bozdogan and etc. are discussed. Empirical performances of the $AIC_{bc}$ are studied and discussed in better model order choices of a linear regression model using a Monte Carlo experiment.

  • PDF

Forecasting the Baltic Dry Index Using Bayesian Variable Selection (베이지안 변수선택 기법을 이용한 발틱건화물운임지수(BDI) 예측)

  • Xiang-Yu Han;Young Min Kim
    • Korea Trade Review
    • /
    • v.47 no.5
    • /
    • pp.21-37
    • /
    • 2022
  • Baltic Dry Index (BDI) is difficult to forecast because of the high volatility and complexity. To improve the BDI forecasting ability, this study apply Bayesian variable selection method with a large number of predictors. Our estimation results based on the BDI and all predictors from January 2000 to September 2021 indicate that the out-of-sample prediction ability of the ADL model with the variable selection is superior to that of the AR model in terms of point and density forecasting. We also find that critical predictors for the BDI change over forecasts horizon. The lagged BDI are being selected as an key predictor at all forecasts horizon, but commodity price, the clarksea index, and interest rates have additional information to predict BDI at mid-term horizon. This implies that time variations of predictors should be considered to predict the BDI.

Two-phase Adaptive Cluster Sampling with Unequal Probabilities Selection

  • Lee, Keejae
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.3
    • /
    • pp.265-278
    • /
    • 1998
  • In this paper, we suggest two-phase adaptive cluster sampling schemes. The main feature of the two-phase sampling is that the information collected in the first phase sample is utilized in the selection of the second phase sample. The conventional two-phase sampling is, however, not sufficient to increase efficiency when the population of interest is rare and clustered. In the proposed sampling scheme, the first phase sample is selected with adaptive cluster sampling procedure and the second phase sample is selected by PPSWR and $\pi$PS sampling. We investigate unbiased estimators of population total and their variance for the proposed sampling schemes respectively. Finally we compare these suggested sampling schemes using numerical examples .

  • PDF