• 제목/요약/키워드: Outlier

검색결과 657건 처리시간 0.023초

Estimates for parameter changes in a uniform model with a generalized uniform outlier

  • Lee, Chang-Soo;Chang, Chu-Seock;Park, Yang-Woo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권3호
    • /
    • pp.581-587
    • /
    • 2010
  • We shall propose several estimators for the scale parameter in a uniform distri-bution with a generalized uniform outlier when the scale parameter is a function of a known exposure level, and obtain expectations and variances for their proposed estima-tors. And we shall compare numerically efficiencies for proposed estimators of changed parameters of the scale in the small sample sizes.

Test for an Outlier in Multivariate Regression with Linear Constraints

  • Kim, Myung-Geun
    • Communications for Statistical Applications and Methods
    • /
    • 제9권2호
    • /
    • pp.473-478
    • /
    • 2002
  • A test for a single outlier in multivariate regression with linear constraints on regression coefficients using a mean shift model is derived. It is shown that influential observations based on case-deletions in testing linear hypotheses are determined by two types of outliers that are mean shift outliers with or without linear constraints, An illustrative example is given.

Procedures for Detecting Multiple Outliers in Linear Regression Using R

  • Kwon, Soon-Sun;Lee, Gwi-Hyun;Park, Sung-Hyun
    • 한국통계학회:학술대회논문집
    • /
    • 한국통계학회 2005년도 추계 학술발표회 논문집
    • /
    • pp.13-17
    • /
    • 2005
  • In recent years, many people use R as a statistics system. R is frequently updated by many R project teams. We are interested in the method of multiple outlier detection and know that R is not supplied the method of multiple outlier detection. In this talk, we review these procedures for detecting multiple outliers and provide more efficient procedures combined with direct methods and indirect methods using R.

  • PDF

치의학 연구에서 이상치의 처리 (Outlier detection in dental research)

  • 김기열
    • 대한치과의사협회지
    • /
    • 제55권9호
    • /
    • pp.604-616
    • /
    • 2017
  • In clinical dental research, errors occur in spite of careful study design and conduct. Data cleaning procedures intend to identify and correct these errors or at least to minimize their influence on study. Outlier is the one of these errors. Outlier detection is the first step in data analysis process which has a serious effect in the field of dental research. Hence, this paper aims to introduce the methods to detect the outliers and to examine their influences in statistical data analysis.

  • PDF

Jackknife Estimation in a Truncated Exponential Distribution with an Uniform Outlier

  • Lee, Chang-Soo;Chang, Chu-Seock;Park, Yang-Woo
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권3호
    • /
    • pp.1021-1028
    • /
    • 2006
  • We shall propose ML, ordinary jackknife and biased reducing estimators of the parameter in the right truncated exponential distribution with an unidentified uniform outlier when the truncated point is unknown and their biases and MSE's are compared numerically each other in the small sample sizes.

  • PDF

Outlier Detection Diagnostic based on Interpolation Method in Autoregressive Models

  • Cho, Sin-Sup;Ryu, Gui-Yeol;Park, Byeong-Uk;Lee, Jae-June
    • Journal of the Korean Statistical Society
    • /
    • 제22권2호
    • /
    • pp.283-306
    • /
    • 1993
  • An outlier detection diagnostic for the detection of k-consecutive atypical observations is considered. The proposed diagnostic is based on the innovational variance estimate utilizing both the interpolated and the predicted residuals. We adopt the interpolation method to construct the proposed diagnostic by replacing atypical observations. The perfomance of the proposed diagnositc is investigated by simulation. A real example is presented.

  • PDF

재무 시계열 자료의 이상치 탐지 기법 연구 (A Study on Outlier Detection Method for Financial Time Series Data)

  • 하명호;김삼용
    • 응용통계연구
    • /
    • 제23권1호
    • /
    • pp.41-47
    • /
    • 2010
  • 본 연구에서는 재무 시계열 자료를 분석하는데 있어 유용하게 쓰이는 이분산성 시계열 모형하에서 이상치 탐지 기법을 적용하여 그 효율성을 보이고자 한다. 먼저 GARCH 모형과 GARCH 모형하에서 이상치 탐지 기법에 대해 소개하고, 적용된 방법이 기존의 전통적인 이상치 탐지 방법보다 성능이 우수함을 시뮬레이션과 실제 KOSPI 자료에 적합시켜 입증하였다.

이상 트래픽 탐지를 위한 로버스트 추정 방법 비교 연구 (A Comparative Study of a Robust Estimate Method for Abnormal Traffic Detection)

  • 정재윤;김삼용
    • Communications for Statistical Applications and Methods
    • /
    • 제18권4호
    • /
    • pp.517-525
    • /
    • 2011
  • 본 연구는 이상치가 존재하는 자료에 적용될 수 있는 방법을 비교한 연구로서, 이분산 시계열 모형 하에서 로버스트 추정 방법의 효용성을 보이고자 한다. GARCH 모형하에서 이상치 탐지 기법과 GARCH 모형을기반한 로버스트 추정방법의 성능을 비교하였다. 실제 인터넷 트래픽 자료에 두 방법을 적용했을때, 로버스트 추정방법이 이상치 탐지 기법에 비해 덜 복잡하고 성능이 우수함을 입증하였다.

Variable Selection and Outlier Detection for Automated K-means Clustering

  • Kim, Sung-Soo
    • Communications for Statistical Applications and Methods
    • /
    • 제22권1호
    • /
    • pp.55-67
    • /
    • 2015
  • An important problem in cluster analysis is the selection of variables that define cluster structure that also eliminate noisy variables that mask cluster structure; in addition, outlier detection is a fundamental task for cluster analysis. Here we provide an automated K-means clustering process combined with variable selection and outlier identification. The Automated K-means clustering procedure consists of three processes: (i) automatically calculating the cluster number and initial cluster center whenever a new variable is added, (ii) identifying outliers for each cluster depending on used variables, (iii) selecting variables defining cluster structure in a forward manner. To select variables, we applied VS-KM (variable-selection heuristic for K-means clustering) procedure (Brusco and Cradit, 2001). To identify outliers, we used a hybrid approach combining a clustering based approach and distance based approach. Simulation results indicate that the proposed automated K-means clustering procedure is effective to select variables and identify outliers. The implemented R program can be obtained at http://www.knou.ac.kr/~sskim/SVOKmeans.r.

MULTIPLE OUTLIER DETECTION IN LOGISTIC REGRESSION BY USING INFLUENCE MATRIX

  • Lee, Gwi-Hyun;Park, Sung-Hyun
    • Journal of the Korean Statistical Society
    • /
    • 제36권4호
    • /
    • pp.457-469
    • /
    • 2007
  • Many procedures are available to identify a single outlier or an isolated influential point in linear regression and logistic regression. But the detection of influential points or multiple outliers is more difficult, owing to masking and swamping problems. The multiple outlier detection methods for logistic regression have not been studied from the points of direct procedure yet. In this paper we consider the direct methods for logistic regression by extending the $Pe\tilde{n}a$ and Yohai (1995) influence matrix algorithm. We define the influence matrix in logistic regression by using Cook's distance in logistic regression, and test multiple outliers by using the mean shift model. To show accuracy of the proposed multiple outlier detection algorithm, we simulate artificial data including multiple outliers with masking and swamping.