• Title/Summary/Keyword: Robust Statistics

Search Result 397, Processing Time 0.02 seconds

Robust Discriminant Analysis using Minimum Disparity Estimators

  • 조미정;홍종선;정동빈
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2004.11a
    • /
    • pp.135-140
    • /
    • 2004
  • Lindsay and Basu (1994)에 의해 소개된 최소차이추정량 (Minimum Disparity Estimators)들은 실제 자료 분석 도구로써 유용하다. 본 논문에서는 최소일반화음지수 차이추정량 (Minimum Generalized Negative Exponential Disparity Estimator, MGNEDE)이 최대가능도추정량 (Maximum Likelihood Estimator, MLE)와 최소가중 헬링거거리추정량 (Minimum Blended Weight Hellinger Distance Estimator, MBWHDE)에 비해 오염된 정규모형에서 효율적이고 로버스트하다는 것을 모의실험을 통하여 확인하였다. 또한 세 가지 추정량들에 의해 추정된 모수들을 이용하여 판별하였을 때 자 추정량득의 판별율을 비교함으로써 오염된 정규모형에서 MLE의 대안으로 MGNEDE와 MBWHDE를 사용할 수 있음을 보였다.

  • PDF

The Detection and Testing of Multiple Outliers in Linear Regression

  • Park, Jin-Pyo;Zamar, Ruben H.
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.4
    • /
    • pp.921-934
    • /
    • 2004
  • We consider the problem of identifying and testing outliers in linear regression. First, we consider the scale-ratio tests for testing the null hypothesis of no outliers. A test based on the ratio of two residual scale estimates is proposed. We show the asymptotic distribution of test statistics and investigate the properties of the test. Next we consider the problem of identifying the outliers. A forward procedure based on the suggested test is proposed and shown to perform fairly well. The forward procedure is unaffected by masking and swamping effects because the test statistics used a robust scale estimate.

  • PDF

A Distribution-Free Rank Test for Ordered Alternatives in a Randomized Block Design

  • Kim, Dong-Hee;Song, Moon-Sup;Kim, Woo-Chul
    • Journal of the Korean Statistical Society
    • /
    • v.15 no.1
    • /
    • pp.9-25
    • /
    • 1986
  • In this paper we propose a distribution-free rank test for ordered alternatives in a randomized block design and investigate the properties of the proposed test. The proposed test is an extension of the Page test to allow replications in each cell. Some asymptotic properties including ARE's are investigated. A small sample Monte Carlo study was performed to compare the powers of the test considered in this paper for small samples. The results show that our proposed test is robust and efficient in the case of equally-spaced treatment effects.

  • PDF

The Bivariate Kumaraswamy Weibull regression model: a complete classical and Bayesian analysis

  • Fachini-Gomes, Juliana B.;Ortega, Edwin M.M.;Cordeiro, Gauss M.;Suzuki, Adriano K.
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.5
    • /
    • pp.523-544
    • /
    • 2018
  • Bivariate distributions play a fundamental role in survival and reliability studies. We consider a regression model for bivariate survival times under right-censored based on the bivariate Kumaraswamy Weibull (Cordeiro et al., Journal of the Franklin Institute, 347, 1399-1429, 2010) distribution to model the dependence of bivariate survival data. We describe some structural properties of the marginal distributions. The method of maximum likelihood and a Bayesian procedure are adopted to estimate the model parameters. We use diagnostic measures based on the local influence and Bayesian case influence diagnostics to detect influential observations in the new model. We also show that the estimates in the bivariate Kumaraswamy Weibull regression model are robust to deal with the presence of outliers in the data. In addition, we use some measures of goodness-of-fit to evaluate the bivariate Kumaraswamy Weibull regression model. The methodology is illustrated by means of a real lifetime data set for kidney patients.

A Comparison of Methods for the Detection of Outliers in Multivariate Data

  • Hadi, Ali-S.;Joo, Hye-Seon;Son, Mun-S.
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.2
    • /
    • pp.53-67
    • /
    • 1996
  • Numerous classical as well as robust methods have been proposed in the literature for the detection of multiple outlier in multivariate data. The effectiveness and power of each of these methods have not been thoroughly investigated. In this paper we first reduce the vast number of outlier detection methods to a small number of viable ones. This reduction is based on previous work of other researches and on some theoretical arguments. Then we design and implement a Monte Carlo experiment for comparing these methods. The main goal of our study is to determine which methods are most powerful in the detection of multiple outlier and in dealing with the masking and swamping problems. The results of the Monte Carlo study indicate that two of the methods seem to hace better performances than the others for the detection of multiple outlier in multivariate data.

  • PDF

Fused inverse regression with multi-dimensional responses

  • Cho, Youyoung;Han, Hyoseon;Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.3
    • /
    • pp.267-279
    • /
    • 2021
  • A regression with multi-dimensional responses is quite common nowadays in the so-called big data era. In such regression, to relieve the curse of dimension due to high-dimension of responses, the dimension reduction of predictors is essential in analysis. Sufficient dimension reduction provides effective tools for the reduction, but there are few sufficient dimension reduction methodologies for multivariate regression. To fill this gap, we newly propose two fused slice-based inverse regression methods. The proposed approaches are robust to the numbers of clusters or slices and improve the estimation results over existing methods by fusing many kernel matrices. Numerical studies are presented and are compared with existing methods. Real data analysis confirms practical usefulness of the proposed methods.

Variable Selection for Logistic Regression Model Using Adjusted Coefficients of Determination (수정 결정계수를 사용한 로지스틱 회귀모형에서의 변수선택법)

  • Hong C. S.;Ham J. H.;Kim H. I.
    • The Korean Journal of Applied Statistics
    • /
    • v.18 no.2
    • /
    • pp.435-443
    • /
    • 2005
  • Coefficients of determination in logistic regression analysis are defined as various statistics, and their values are relatively smaller than those for linear regression model. These coefficients of determination are not generally used to evaluate and diagnose logistic regression model. Liao and McGee (2003) proposed two adjusted coefficients of determination which are robust at the addition of inappropriate predictors and the variation of sample size. In this work, these adjusted coefficients of determination are applied to variable selection method for logistic regression model and compared with results of other methods such as the forward selection, backward elimination, stepwise selection, and AIC statistic.

A study on target Sigma Level at R&D stage and robust limits for design margins (R&D 분야의 목표 시그마 수준 설정과 설계 공차의 강건 한계 결정에 대한 연구)

  • Ko, Seoung-gon
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.2
    • /
    • pp.369-379
    • /
    • 2016
  • The Sigma Level, proposed by Motorola Inc., is one of the many Process Capability Index (PCI)'s that have been presented since the 1970's. It is used to evaluate process capability and unlike other PCI's, it has an advantage in that it uses population probability distribution. However, it is originally designed for mass production and is inadequate to evaluate prototypes or early products in the R&D stages. For use in such cases, we propose an R&D target Sigma Level, derived by considering 1.5 sigma shifts in traditional sigma level from a statistical point of view. We also explain the way to find robust limits for design tolerance because the sigma level or defect probability is useful to establish economical tolerance limits at the R&D stage and mass production.

An Alternative Study of the Determination of the Threshold for the Generalized Pareto Distribution (일반화 파레토 분포에서 임계치 결정에 대한 대안적 연구)

  • Yoon, Jeong-Yoen;Cho, Jae-Beom;Jun, Byoung-Cheol
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.931-939
    • /
    • 2011
  • In practice, thresholds are determined by the two subjective assessment methods in a generalized pareto distribution of mean extreme function(MEF-graph) or Hill-graph. To remedy the problem of subjectiveness of these methods, we propose an alternative method to determine the threshold based on the robust statistics. We compared the MEF-graph, Hill-graph and our method through VaRs on the Korean stock market data from January 5, 1987 to August 3, 2009. As a result, the VaR based on the proposed method is not much different from the existing methods, and the standard deviation of VaR for our method was the smallest. The results show that our method can be a promising alternative to determine thresholds of the generalized pareto distributions.

The Maximin Robust Design for the Uncertainty of Parameters of Michaelis-Menten Model (Michaelis-Menten 모형의 모수의 불확실성에 대한 Maximin 타입의 강건 실험)

  • Kim, Youngil;Jang, Dae-Heung;Yi, Seongbaek
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.7
    • /
    • pp.1269-1278
    • /
    • 2014
  • Despite the D-optimality criterion becomes very popular in designing an experiment for nonlinear models because of theoretical foundations it provides, it is very critical that the criterion depends on the unknown parameters of the nonlinear model. But some nonlinear models turned out to be partially nonlinear in sense that the optimal design depends on the subset of parameters only. It was a strong belief that the maximin approach to find a robust design to protect against the uncertainty of parameters is not guaranteed to be successful in nonlinear models. But the maximin approach could be a success for the partial nonlinear model, because often the optimal design depends on only one unknown value of parameter, easier to handle than the full parameters. We deal with maximin approach for Michaelis-Menten model with respect to D- and $D_s$-optimality.