Search | Korea Science

Robust Variable Selection in Classification Tree

Jang Jeong Yee;Jeong Kwang Mo
- Proceedings of the Korean Statistical Society Conference
- /
- 2001.11a
- /
- pp.89-94
- /
- 2001
In this study we focus on variable selection in decision tree growing structure. Some of the splitting rules and variable selection algorithms are discussed. We propose a competitive variable selection method based on Kruskal-Wallis test, which is a nonparametric version of ANOVA F-test. Through a Monte Carlo study we note that CART has serious bias in variable selection towards categorical variables having many values, and also QUEST using F-test is not so powerful to select informative variables under heavy tailed distributions.
PDF

A Unit Root Test Based on Bootstrapping

Shin, Key-Il;Kang, Hee-Jeong
- Communications for Statistical Applications and Methods
- /
- v.3 no.1
- /
- pp.257-265
- /
- 1996
We consider nonstationary autoregressive autoregressive process with infinite variance of error. In the case of infinite cariance, the limiting distribution of the estimated coefficient is different from that under the finite cariance assumption. In this paper we show that the bootstrap method can be used to approximate the distribution of ordinary least squares estimator of the coefficient in the first order random walk process with infinite variance through some empirical studies and we suggest a test procedure based on bootstrap method for the unit root test.
PDF

A Study on Distribution Based on the Normalized Sample Lorenz Curve

Suk-Bok kang;Cho, Young-Suk
- Communications for Statistical Applications and Methods
- /
- v.8 no.1
- /
- pp.185-192
- /
- 2001
Using the Lorenz curve that is proved to be a powerful tool to measure the income inequality within a population of income receivers, we propose the normalized sample Lorenz curve for the goodness-of-fit test that is very important test in statistical analysis. For two hodgkin's disease data sets, we compare the Q-Q plot and the proposed normalized sample Lorenz curve.
PDF

Asymmetric Modeling in Beta-ARCH Processes

S. Y. Hwang;Kahng, Myung-Wook
- Journal of the Korean Statistical Society
- /
- v.31 no.4
- /
- pp.459-468
- /
- 2002
A class of asymmetric beta-ARCH processes is proposed and connections to traditional ARCH models are explained. Geometric ergodicity of the model is discussed. Conditional least squares as well as maximum likelihood estimators of parameters and their limit results are also presented. A test for symmetry of the model is studied with limiting power of test statistic given.
PDF KSCI

Test for Independence in Bivariate Pareto Model with Bivariate Random Censored Data

Cho, Jang-Sik;Kwon, Yong-Man;Choi, Seung-Bae
- Journal of the Korean Data and Information Science Society
- /
- v.15 no.1
- /
- pp.31-39
- /
- 2004
In this paper, we consider two components system which the lifetimes follow bivariate pareto model with bivariate random censored data. We assume that the censoring times are independent of the lifetimes of the two components. We develop large sample test for testing independence between two components. Also we present a simulation study which is the test based on asymptotic normal distribution in testing independence.
PDF

Improvement in Performing a Test for Additional Information (추가정보검정법 적용의 개선)

Kim, Myung-Geun
- The Korean Journal of Applied Statistics
- /
- v.18 no.3
- /
- pp.655-660
- /
- 2005
An iterative method that greatly reduces a burden of computation in performing Rao's test for additional information in two-group discriminant analysis is suggested. A numerical example is provided for illustration.
https://doi.org/10.5351/KJAS.2005.18.3.655 인용 PDF KSCI

Influence Analysis of the Liklihood Ratio Test in Multivariate Behrens-Fisher Problem

Jung, Kang-Mo;Kim, Myung-Geun
- Communications for Statistical Applications and Methods
- /
- v.6 no.3
- /
- pp.939-946
- /
- 1999
We propose methods for detecting influential observations that have a large influence on the likelihood ratio test statistic for the multivariate Behrens-Fisher problem. For this purpose we derive the influence curve and the derivative influence of the likelihood ratio test statistic. An illustrative example is given to show the effectiveness of the proposed methods on the identification of influential observations.
PDF

Error cause analysis of Pearson test statistics for k-population homogeneity test (k-모집단 동질성검정에서 피어슨검정의 오차성분 분석에 관한 연구)

Heo, Sunyeong
- Journal of the Korean Data and Information Science Society
- /
- v.24 no.4
- /
- pp.815-824
- /
- 2013
Traditional Pearson chi-squared test is not appropriate for the data collected by the complex sample design. When one uses the traditional Pearson chi-squared test to the complex sample categorical data, it may give wrong test results, and the error may occur not only due to the biased variance estimators but also due to the biased point estimators of cell proportions. In this study, the design based consistent Wald test statistics was derived for k-population homogeneity test, and the traditional Pearson chi-squared test statistics was partitioned into three parts according to the causes of error; the error due to the bias of variance estimator, the error due to the bias of cell proportion estimator, and the unseparated error due to the both bias of variance estimator and bias of cell proportion estimator. An analysis was conducted for empirical results of the relative size of each error component to the Pearson chi-squared test statistics. The second year data from the fourth Korean national health and nutrition examination survey (KNHANES, IV-2) was used for the analysis. The empirical results show that the relative size of error from the bias of variance estimator was relatively larger than the size of error from the bias of cell proportion estimator, but its degrees were different variable by variable.
https://doi.org/10.7465/jkdi.2013.24.4.815 인용 PDF KSCI

On the distribution-free tests for umbrella alternatives in a randomized block design (화률화 블록 계획법에서 우산형 대립가설에 대한 분포부관 검정법의 연구)

김동희;김영철
- The Korean Journal of Applied Statistics
- /
- v.5 no.1
- /
- pp.41-57
- /
- 1992
Distribution-free test for umbrella alternatives in a randomized block design is proposed and asymptotic properties test statistics and the asymptotic relatives efficiency (ARE) of the proposed test statistics with respect to the Puri's parametric method are investigated. For given peak points 2,3,4, with 4 blocks and 5 treatments, and with 3 blocks and 5 treatments : for given peak point 3, with 2 blocks and 4 treatments : from the small sample Monte Carlo Study, the empirical powers between the proposed test and Puri's test are compared. Throughout the simulation results, the proposed test statistic is efficient for the heavy tailed distributions.
PDF

Genetic association tests when a nuisance parameter is not identifiable under no association

Kim, Wonkuk;Kim, Yeong-Hwa
- Communications for Statistical Applications and Methods
- /
- v.24 no.6
- /
- pp.663-671
- /
- 2017
Some genetic association tests include an unidentifiable nuisance parameter under the null hypothesis of no association. When the mode of inheritance (MOI) is not specified in a case-control design, the Cochran-Armitage (CA) trend test contains an unidentifiable nuisance parameter. The transmission disequilibrium test (TDT) in a family-based association study that includes the unaffected also contains an unidentifiable nuisance parameter. The hypothesis tests that include an unidentifiable nuisance parameter are typically performed by taking a supremum of the CA tests or TDT over reasonable values of the parameter. The p-values of the supremum test statistics cannot be obtained by a normal or chi-square distribution. A common method is to use a Davies's upper bound of the p-value instead of an exact asymptotic p-value. In this paper, we provide a unified sine-cosine process expression of the CA trend test that does not specify the MOI and the TDT that includes the unaffected. We also present a closed form expression of the exact asymptotic formulas to calculate the p-values of the supremum tests when the score function can be written as a linear form in an unidentifiable parameter. We illustrate how to use the derived formulas using a pharmacogenetics case-control dataset and an attention deficit hyperactivity disorder family-based example.
https://doi.org/10.29220/CSAM.2017.24.6.663 인용 PDF KSCI

Search Result 6,465, Processing Time 0.033 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)