• Title/Summary/Keyword: Robust 모형

Search Result 235, Processing Time 0.078 seconds

Algorithm for the Robust Estimation in Logistic Regression (로지스틱회귀모형의 로버스트 추정을 위한 알고리즘)

  • Kim, Bu-Yong;Kahng, Myung-Wook;Choi, Mi-Ae
    • The Korean Journal of Applied Statistics
    • /
    • v.20 no.3
    • /
    • pp.551-559
    • /
    • 2007
  • The maximum likelihood estimation is not robust against outliers in the logistic regression. Thus we propose an algorithm for the robust estimation, which identifies the bad leverage points and vertical outliers by the V-mask type criterion, and then strives to dampen the effect of outliers. Our main finding is that, by an appropriate selection of weights and factors, we could obtain the logistic estimates with high breakdown point. The proposed algorithm is evaluated by means of the correct classification rate on the basis of real-life and artificial data sets. The results indicate that the proposed algorithm is superior to the maximum likelihood estimation in terms of the classification.

Robust Estimation using Estimating Functions for Time Series Models (시계열모형에서 추정함수를 이용한 로버스트 추론방법)

  • 차경엽;김삼용;이성덕
    • The Korean Journal of Applied Statistics
    • /
    • v.12 no.2
    • /
    • pp.479-490
    • /
    • 1999
  • 선형시계열모형인 AR(1)모형과 비선형시계열모형인 RCA(1), ARCH(1)모형에서 이상치(Outlier)가 존재할 경우 최소제곱추정량과 M추정량간의 점근상대효율(Asymptotic Relative Efficiency: ARE)을 구하여 두 추정량의 로버스트 성질을 비교·분석하였다. 또한 여러 유계함수(Huber, Tukey, Andrews, Hampel)들을 M추정함수에 적용하여 각각의 유계함수들을 비교·분석하였다.

  • PDF

Efficient Outlier Detection of the Water Temperature Monitoring Data (수온 관측 자료의 효율적인 이상 자료 탐지)

  • Cho, Hongyeon;Jeong, Shin Taek;Ko, Dong Hui;Son, Kyeong-Pyo
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.26 no.5
    • /
    • pp.285-291
    • /
    • 2014
  • The statistical information of the coastal water temperature monitoring data can be biased because of outliers and missing intervals. Though a number of outlier detection methods have been developed, their applications are very limited to the in-situ monitoring data because of the assumptions of the a prior information of the outliers and no-missing condition, and the excessive computational time for some methods. In this study, the practical robust method is developed that can be efficiently and effectively detect the outliers in case of the big-data. This model is composed of these two parts, one part is the construction part of the approximate components of the monitoring data using the robust smoothing and data re-sampling method, and the other part is the main iterative outlier detection part using the detailed components of the data estimated by the approximate components. This model is tested using the two-years 5-minute interval water temperature data in Lake Saemangeum. It can be estimated that the outlier proportion of the data is about 1.6-3.7%. It shows that most of the outliers in the data are detected and removed with satisfaction by the model. In order to effectively detect and remove the outliers, the outlier detection using the long-span smoothing should be applied earlier than that using the short-span smoothing.

An Outlier Detection Method in Penalized Spline Regression Models (벌점 스플라인 회귀모형에서의 이상치 탐지방법)

  • Seo, Han Son;Song, Ji Eun;Yoon, Min
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.4
    • /
    • pp.687-696
    • /
    • 2013
  • The detection and the examination of outliers are important parts of data analysis because some outliers in the data may have a detrimental effect on statistical analysis. Outlier detection methods have been discussed by many authors. In this article, we propose to apply Hadi and Simonoff's (1993) method to penalized spline a regression model to detect multiple outliers. Simulated data sets and real data sets are used to illustrate and compare the proposed procedure to a penalized spline regression and a robust penalized spline regression.

Fuzzy Theil regression Model (Theil방법을 이용한 퍼지회귀모형)

  • Yoon, Jin Hee;Lee, Woo-Joo;Choi, Seung-Hoe
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.23 no.4
    • /
    • pp.366-370
    • /
    • 2013
  • Regression Analysis is an analyzing method of regression model to explain the statistical relationship between explanatory variable and response variables. This paper introduce Theil's method to find a fuzzy regression model which explain the relationship between explanatory variable and response variables. Theil's method is a robust method which is not sensive to outliers. Theil's method use medians of rate of increment based on randomly chosen pairs of each components of ${\alpha}$-level sets of fuzzy data in order to estimate the coefficients of fuzzy regression model. We propose an example to show Theil's estimator is robust than the Least squares estimator.

Analysis of Field Test Data using Robust Linear Mixed-Effects Model (로버스트 선형혼합모형을 이용한 필드시험 데이터 분석)

  • Hong, Eun Hee;Lee, Youngjo;Ok, You Jin;Na, Myung Hwan;Noh, Maengseok;Ha, Il Do
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.2
    • /
    • pp.361-369
    • /
    • 2015
  • A general linear mixed-effects model is often used to analyze repeated measurement experiment data of a continuous response variable. However, a general linear mixed-effects model can give improper analysis results when simultaneously detecting heteroscedasticity and the non-normality of population distribution. To achieve a more robust estimation, we used a heavy-tailed linear mixed-effects model for a more exact and reliable analysis conclusion than a general linear mixed-effects model. We also provide reliability analysis results for further research.

Multiple-Model Probabilistic Design of Repetitive Controllers (연속반복학습제어의 복수모형 확률설계기법)

  • Lee, Soo-Cheol
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.13 no.2
    • /
    • pp.1-7
    • /
    • 2008
  • This paper presents a method to design a repetitive controller that is robust to variations in the system parameters. The uncertain parameters are specified probabilistically by their probability distribution functions. Instead of working with the distribution functions directly, the repetitive controller is designed from a set of models that are generated from the specified probability functions. With this multiple-model design approach, any number of uncertain parameters that follow any type of distribution functions can be treated. furthermore, the controller is derived by minimizing a frequency-domain based cost function that produces monotonic convergence of the tracking error as a function of repetition number. Numerical illustrations show how the proposed multiple-model design method produces a repetitive controller that is significantly more robust than an optimal repetitive controller designed from a single nominal model of the system.

  • PDF

Parameter Calibration of Storage Function Model and Flood Forecasting (2) Comparative Study on the Flood Forecasting Methods (저류함수모형의 매개변수 보정과 홍수예측 (2) 홍수예측방법의 비교 연구)

  • Kim, Bum Jun;Song, Jae Hyun;Kim, Hung Soo;Hong, Il Pyo
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.1B
    • /
    • pp.39-50
    • /
    • 2006
  • The flood control offices of main rivers have used a storage function model to forecast flood stage in Korea and studies of flood forecasting actively have been done even now. On this account, the storage function model, which is used in flood control office, regression models and artificial neural network model are applied into flood forecasting of study watershed in this paper. The result obtained by each method are analyzed for the comparative study. In case of storage function model, this paper uses the representative parameters of the flood control offices and the optimized parameters. Regression coefficients are obtained by regression analysis and neural network is trained by backpropagation algorithm after selecting four events between 1995 to 2001. As a result of this study, it is shown that the optimized parameters are superior to the representative parameters for flood forecasting. The results obtained by multiple, robust, stepwise regression analysis, one of the regression methods, show very good forecasts. Although the artificial neural network model shows less exact results than the regression model, it can be efficient way to produce a good forecasts.

The Maximin Robust Design for the Uncertainty of Parameters of Michaelis-Menten Model (Michaelis-Menten 모형의 모수의 불확실성에 대한 Maximin 타입의 강건 실험)

  • Kim, Youngil;Jang, Dae-Heung;Yi, Seongbaek
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.7
    • /
    • pp.1269-1278
    • /
    • 2014
  • Despite the D-optimality criterion becomes very popular in designing an experiment for nonlinear models because of theoretical foundations it provides, it is very critical that the criterion depends on the unknown parameters of the nonlinear model. But some nonlinear models turned out to be partially nonlinear in sense that the optimal design depends on the subset of parameters only. It was a strong belief that the maximin approach to find a robust design to protect against the uncertainty of parameters is not guaranteed to be successful in nonlinear models. But the maximin approach could be a success for the partial nonlinear model, because often the optimal design depends on only one unknown value of parameter, easier to handle than the full parameters. We deal with maximin approach for Michaelis-Menten model with respect to D- and $D_s$-optimality.

Online abnormal events detection with online support vector machine (온라인 서포트벡터기계를 이용한 온라인 비정상 사건 탐지)

  • Park, Hye-Jung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.2
    • /
    • pp.197-206
    • /
    • 2011
  • The ability to detect online abnormal events in signals is essential in many real-world signal processing applications. In order to detect abnormal events, previously known algorithms require an explicit signal statistical model, and interpret abnormal events as statistical model abrupt changes. In general, maximum likelihood and Bayesian estimation theory to estimate well as detection methods have been used. However, the above-mentioned methods for robust and tractable model, it is not easy to estimate. More freedom to estimate how the model is needed. In this paper, we investigate a machine learning, descriptor-based approach that does not require a explicit descriptors statistical model, based on support vector machines are known to be robust statistical models and a sequential optimal algorithm online support vector machine is introduced.