• Title/Summary/Keyword: Robust 회귀분석

Search Result 75, Processing Time 0.024 seconds

Outlier Detection of Autoregressive Models Using Robust Regression Estimators (로버스트 추정법을 이용한 자기상관회귀모형에서의 특이치 검출)

  • Lee Dong-Hee;Park You-Sung;Kim Kee-Whan
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.2
    • /
    • pp.305-317
    • /
    • 2006
  • Outliers adversely affect model identification, parameter estimation, and forecast in time series data. In particular, when outliers consist of a patch of additive outliers, the current outlier detection procedures suffer from the masking and swamping effects which make them inefficient. In this paper, we propose new outlier detection procedure based on high breakdown estimators, called as the dual robust filtering. Empirical and simulation studies in the autoregressive model with orders p show that the proposed procedure is effective.

A Study on the Factors Determining Officetel Price in Busan (부산지역 오피스텔 가격 결정요인 분석)

  • Choi, Yeol;Kim, Hyeong Jun;Yeo, Jung Hoon
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.35 no.3
    • /
    • pp.725-735
    • /
    • 2015
  • The aim of this study is to specifically understand the officetel market by empirical analysis for the determining factors that affect determining the price of the officetel in Busan. In my opinion, it can help officetel providers to select the appropriate size and location that analysis for the factors determining officetel price with market price, and also it can help customers officetel to choice depending on the purpose. So I was conducting this study. In this study, I analyzes the factors determining the price of Officetel using a OLS linear regression, semi-log model, and a robust regression-Busan area Officetel Real Transaction Price as the dependent variable and factors representing the physical characteristics, locational characteristics and regional characteristics as independent variables.

An empirical study on the combined forecasts (결합예측에 관한 실증적 연구)

  • 이우리
    • The Korean Journal of Applied Statistics
    • /
    • v.1 no.2
    • /
    • pp.10-26
    • /
    • 1987
  • If the forecasts from different, sources are combined in some way, the resulting forecasts may be more accurate than any of the individual components. In this paper, the established procedures of combining forecasts are reviewed and the alternative procedures are suggested. By the results of empirical analysis from survey data, the method of combining forecasts using the restricted regression weights, the restricted robust regression weights, and mixed regression weights are robust. We can not find the most efficient combined forecasts in any case if we select the corresponding decision by preliminary analysis for the statistical properties of individual dorecasts, our results of combined forecast can became useful.

Robust tests for heteroscedasticity using outlier detection methods (이상치 탐지법을 이용한 강건 이분산 검정)

  • Seo, Han Son;Yoon, Min
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.3
    • /
    • pp.399-408
    • /
    • 2016
  • There is a need to detect heteroscedasticity in a regression analysis; however, it invalidates the standard inference procedure. The diagnostics on heteroscedasticity may be distorted when both outliers and heteroscedasticity exist. Available heteroscedasticity detection methods in the presence of outliers usually use robust estimators or separating outliers from the data. Several approaches have been suggested to identify outliers in the heteroscedasticity problem. In this article conventional tests on heteroscedasticity are modified by using a sequential outlier detection methods to separate outliers from contaminated data. The performance of the proposed method is compared with original tests by a Monte Carlo study and examples.

An Outlier Detection Method in Penalized Spline Regression Models (벌점 스플라인 회귀모형에서의 이상치 탐지방법)

  • Seo, Han Son;Song, Ji Eun;Yoon, Min
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.4
    • /
    • pp.687-696
    • /
    • 2013
  • The detection and the examination of outliers are important parts of data analysis because some outliers in the data may have a detrimental effect on statistical analysis. Outlier detection methods have been discussed by many authors. In this article, we propose to apply Hadi and Simonoff's (1993) method to penalized spline a regression model to detect multiple outliers. Simulated data sets and real data sets are used to illustrate and compare the proposed procedure to a penalized spline regression and a robust penalized spline regression.

A Robust Backpropagation Algorithm and It's Application (문자인식을 위한 로버스트 역전파 알고리즘)

  • Oh, Kwang-Sik;Kim, Sang-Min;Lee, Dong-No
    • Journal of the Korean Data and Information Science Society
    • /
    • v.8 no.2
    • /
    • pp.163-171
    • /
    • 1997
  • Function approximation from a set of input-output pairs has numerous applications in scientific and engineering areas. Multilayer feedforward neural networks have been proposed as a good approximator of nonlinear function. The back propagation(BP) algorithm allows multilayer feedforward neural networks to learn input-output mappings from training samples. It iteratively adjusts the network parameters(weights) to minimize the sum of squared approximation errors using a gradient descent technique. However, the mapping acquired through the BP algorithm may be corrupt when errorneous training data we employed. When errorneous traning data are employed, the learned mapping can oscillate badly between data points. In this paper we propose a robust BP learning algorithm that is resistant to the errorneous data and is capable of rejecting gross errors during the approximation process, that is stable under small noise perturbation and robust against gross errors.

  • PDF

An Analysis of the Determinants of Employment Productivity in Korean Transportation Industry Using Korea Labor and Income Panel Study (한국노동패널자료를 활용한 국내 운송업 고용생산성 결정요인 분석)

  • So, Ae-rim;Shin, Seung-sik
    • Journal of Korea Port Economic Association
    • /
    • v.35 no.1
    • /
    • pp.57-76
    • /
    • 2019
  • This study deals with the determinants of employment productivity of transportation labor, who are the main agents of the transportation industry that has made significant contributions to our country's industrial development. The study selected the determinants of employment productivity using the Korea Labor and Income Panel Study data, and analyzed the effects of various factors using panel logistic regression, panel OLS model, and panel robust regression. The results were as follows. First, a more positive effect was shown when employees held a regular job, had a "high level of education", "joining the labor union" and "experiencing vocational training". Second, in the case of job security, having a "high level of education" and "joining the labor union" showed a more positive effect; further, job security was higher for employees who worked in a "big company" or were "married". Third, in the case of higher income productivity, higher values of "age", "academic ability" and "company size" had a more positive effect, whereas larger values of "education" and "health condition except job training" had a negative one. Fourth, in the case of job satisfaction, "female", "joining the labor union" and having a higher "income" or "job security" led to higher satisfaction and a better "health condition compared to an average person". Further, a higher "overall life satisfaction" and "economic level" led to lower job satisfaction. The analysis of the determinants of employment productivity of transportation business and seeking for improvement plan is expected to improve the employment productivity in the transportation business.

Comparison of GEE Estimation Methods for Repeated Binary Data with Time-Varying Covariates on Different Missing Mechanisms (시간-종속적 공변량이 포함된 이분형 반복측정자료의 GEE를 이용한 분석에서 결측 체계에 따른 회귀계수 추정방법 비교)

  • Park, Boram;Jung, Inkyung
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.5
    • /
    • pp.697-712
    • /
    • 2013
  • When analyzing repeated binary data, the generalized estimating equations(GEE) approach produces consistent estimates for regression parameters even if an incorrect working correlation matrix is used. However, time-varying covariates experience larger changes in coefficients than time-invariant covariates across various working correlation structures for finite samples. In addition, the GEE approach may give biased estimates under missing at random(MAR). Weighted estimating equations and multiple imputation methods have been proposed to reduce biases in parameter estimates under MAR. This article studies if the two methods produce robust estimates across various working correlation structures for longitudinal binary data with time-varying covariates under different missing mechanisms. Through simulation, we observe that time-varying covariates have greater differences in parameter estimates across different working correlation structures than time-invariant covariates. The multiple imputation method produces more robust estimates under any working correlation structure and smaller biases compared to the other two methods.

Lasso Regression of RNA-Seq Data based on Bootstrapping for Robust Feature Selection (안정적 유전자 특징 선택을 위한 유전자 발현량 데이터의 부트스트랩 기반 Lasso 회귀 분석)

  • Jo, Jeonghee;Yoon, Sungroh
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.9
    • /
    • pp.557-563
    • /
    • 2017
  • When large-scale gene expression data are analyzed using lasso regression, the estimation of regression coefficients may be unstable due to the highly correlated expression values between associated genes. This irregularity, in which the coefficients are reduced by L1 regularization, causes difficulty in variable selection. To address this problem, we propose a regression model which exploits the repetitive bootstrapping of gene expression values prior to lasso regression. The genes selected with high frequency were used to build each regression model. Our experimental results show that several genes were consistently selected in all regression models and we verified that these genes were not false positives. We also identified that the sign distribution of the regression coefficients of the selected genes from each model was correlated to the real dependent variables.

Pattern Recognition using Robust Feedforward Neural Networks (로버스트 다층전방향 신경망을 이용한 패턴인식)

  • Hwang, Chang-Ha;Kim, Sang-Min
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.345-355
    • /
    • 1998
  • The back propagation(BP) algorithm allows multilayer feedforward neural networks to learn input-output mappings from training samples. It iteratively adjusts the network parameters(weights) to minimize the sum of squared approximation errors using a gradient descent technique. However, the mapping acquired through the BP algorithm may be corrupt when errorneous training data are employed. In this paper two types of robust backpropagation algorithms are discussed both from a theoretical point of view and in the case studies of nonlinear regression function estimation and handwritten Korean character recognition. For future research we suggest Bayesian learning approach to neural networks and compare it with two robust backpropagation algorithms.

  • PDF