• Title/Summary/Keyword: 로버스트법

Search Result 40, Processing Time 0.016 seconds

On a robust analysis of variance based on winsorization (윈저화를 이용한 로버스트 분산분석)

  • 성내경
    • The Korean Journal of Applied Statistics
    • /
    • v.8 no.1
    • /
    • pp.119-131
    • /
    • 1995
  • Based on Monte-Carlo simulation results we propose a robust analysis of variance procedure by utilizing trimmed mean and Winsorized variance. We deal with mainly the one-way classification case. We evaluate the empirical distribution of a pseudo-F statistic based on symmetrically Winsorized sum of squares when the population is normally distributed.

  • PDF

Outlier Detection of Autoregressive Models Using Robust Regression Estimators (로버스트 추정법을 이용한 자기상관회귀모형에서의 특이치 검출)

  • Lee Dong-Hee;Park You-Sung;Kim Kee-Whan
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.2
    • /
    • pp.305-317
    • /
    • 2006
  • Outliers adversely affect model identification, parameter estimation, and forecast in time series data. In particular, when outliers consist of a patch of additive outliers, the current outlier detection procedures suffer from the masking and swamping effects which make them inefficient. In this paper, we propose new outlier detection procedure based on high breakdown estimators, called as the dual robust filtering. Empirical and simulation studies in the autoregressive model with orders p show that the proposed procedure is effective.

확률화 블록 계획법에서 우산형 대립가설에 대한 점근 분포 무관 검정법의 연구

  • 김동희;김현기;이주현
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.3
    • /
    • pp.83-92
    • /
    • 1996
  • 확률화 블록 계획법에서 우산형 대립가설에 대한 점근 분포 무관 검정법을 제시하고 제안된 검정통계량의 점근적 정규성과 모수적 방법 및 비모수적 방법의 점근상대효율을 관찰하였다. 검점통계량은 블록 효과를 추정하여 제거한 관측치의 전체 블록 순위를 사용하여 제안하였으며 제안된 검정통계량의 소표본 Monte Carlo 연구를 통해 실험 검정력을 비교하였다. 그 결과 본 논문에서 제안된 검정통계량이 꼬리가 두꺼운 분포에서는 전반적으로 우수하고 로버스트한 것으로 나타났다.

  • PDF

A comparison study of various robust regression estimators using simulation (시뮬레이션을 통한 다양한 로버스트 회귀추정량의 비교 연구)

  • Jang, Soohee;Yoon, Jungyeon;Chun, Heuiju
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.3
    • /
    • pp.471-485
    • /
    • 2016
  • Least squares (LS) regression is a classic method for regression that is optimal under assumptions of regression and usual observations. However, the presence of unusual data in the LS method leads to seriously distorted estimates. Therefore, various robust estimation methods are proposed to circumvent the limitations of traditional LS regression. Among these, there are M-estimators based on maximum likelihood estimation (MLE), L-estimators based on linear combinations of order statistics and R-estimators based on a linear combinations of the ordered residuals. In this paper, robust regression estimators with high breakdown point and/or with high efficiency are compared under several simulated situations. The paper analyses and compares distributions of estimates as well as relative efficiencies calculated from mean squared errors (MSE) in the simulation study. We conclude that MM-estimators or GR-estimators are a good choice for the real data application.

Principal Components Logistic Regression based on Robust Estimation (로버스트추정에 바탕을 둔 주성분로지스틱회귀)

  • Kim, Bu-Yong;Kahng, Myung-Wook;Jang, Hea-Won
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.3
    • /
    • pp.531-539
    • /
    • 2009
  • Logistic regression is widely used as a datamining technique for the customer relationship management. The maximum likelihood estimator has highly inflated variance when multicollinearity exists among the regressors, and it is not robust against outliers. Thus we propose the robust principal components logistic regression to deal with both multicollinearity and outlier problem. A procedure is suggested for the selection of principal components, which is based on the condition index. When a condition index is larger than the cutoff value obtained from the model constructed on the basis of the conjoint analysis, the corresponding principal component is removed from the logistic model. In addition, we employ an algorithm for the robust estimation, which strives to dampen the effect of outliers by applying the appropriate weights and factors to the leverage points and vertical outliers identified by the V-mask type criterion. The Monte Carlo simulation results indicate that the proposed procedure yields higher rate of correct classification than the existing method.

Robust spectral estimator from M-estimation point of view: application to the Korean housing price index (M-추정에 기반을 둔 로버스트 스펙트럴 추정량: 주택 가격 지수에 대한 응용)

  • Pak, Ro Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.3
    • /
    • pp.463-470
    • /
    • 2016
  • In analysing a time series on the frequency domain, the spectral estimator (or periodogram) is a very useful statistic to identify the periods of a time series. However, the spectral estimator is very sensitive in nature to outliers, so that the spectral estimator in terms of M-estimation has been studied by some researchers. Pak (2001) proposed an empirical method to choose a tuning parameter for the Huber's M-estimating function. In this article, we try to implement Pak's estimation proposal in the spectral estimator. We use the Korean housing price index as an example data set for comparing various M-estimating results.

An Alternative Study of the Determination of the Threshold for the Generalized Pareto Distribution (일반화 파레토 분포에서 임계치 결정에 대한 대안적 연구)

  • Yoon, Jeong-Yoen;Cho, Jae-Beom;Jun, Byoung-Cheol
    • The Korean Journal of Applied Statistics
    • /
    • v.24 no.5
    • /
    • pp.931-939
    • /
    • 2011
  • In practice, thresholds are determined by the two subjective assessment methods in a generalized pareto distribution of mean extreme function(MEF-graph) or Hill-graph. To remedy the problem of subjectiveness of these methods, we propose an alternative method to determine the threshold based on the robust statistics. We compared the MEF-graph, Hill-graph and our method through VaRs on the Korean stock market data from January 5, 1987 to August 3, 2009. As a result, the VaR based on the proposed method is not much different from the existing methods, and the standard deviation of VaR for our method was the smallest. The results show that our method can be a promising alternative to determine thresholds of the generalized pareto distributions.

A Robust Test for Location Parameters in Multivariate Data (다변량 자료에서 위치모수에 대한 로버스트 검정)

  • So, Sun-Ha;Lee, Dong-Hee;Jung, Byoung-Cheo
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.6
    • /
    • pp.1355-1364
    • /
    • 2009
  • This work propose a robust test for location parameters in multivariate data based on MVE and MCD with the affine equivariance and the high-breakdown properties. We consider the hypothesis testing satisfying high efficiency and high test power simultaneously to bring in the one-step reweighting procedure upon high-breakdown estimators, which generally suffer from the low efficiency and, as a result, usually used only in the exploratory analysis. Monte Carlo study shows that the suggested method retains nominal significance levels and higher testing power without regard to various population distributions than a Hotelling's $T^2$ test. In an example, a data set containing known outliers does not make an influence toward our proposal, while it renders a Hotelling's $T^2$ useless.

A Study on the Fatigue Strength and Life Distribution of Carbon Steel Using the Database System (데이터베이스 시스템을 이용한 탄소강의 피로강도 및 수명분포)

  • Kim, Jung Kyu;Moon, Joon Ho;Kim, Do Sik
    • Journal of Korean Society of Steel Construction
    • /
    • v.10 no.1 s.34
    • /
    • pp.37-45
    • /
    • 1998
  • The relational database system on fatigue strength was constructed, and the properties of fatigue life distribution were examined to analyze reliability and safety of metallic materials. Data manipulations were efficiently performed in relational fatigue strength database system using dependency diagram. Regardless of the distribution of fatigue strength, the proposed method, the Robust method and the complementary error function method using probability distribution, successfully estimated parameters of the 3-parameter Weibull distribution. The proposed criterion for estimating non-failure probability showed good results regardless of censoring time. The fatigue life distribution function described as a function of parameters of the Weibull distribution and applied stress ratio produced P-S-N characteristics reasonably.

  • PDF

Prediction Intervals for Nonlinear Time Series Models Using the Bootstrap Method (붓스트랩을 이용한 비선형 시계열 모형의 예측구간)

  • 이성덕;김주성
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.2
    • /
    • pp.219-228
    • /
    • 2004
  • In this paper we construct prediction intervals for nonlinear time series models using the bootstrap. We compare these prediction intervals to traditional asymptotic prediction intervals using quasi-score estimation function and M-quasi-score estimating function comprising bounded functions. Simulation results show that the bootstrap method leads to improved accuracy. The accuracy of the bootstrap is empirically demonstrated with the consumer price index.