• Title/Summary/Keyword: least squares linear regression

Search Result 134, Processing Time 0.029 seconds

Robust Response Transformation Using Outlier Detection in Regression Model (회귀모형에서 이상치 검색을 이용한 로버스트 변수변환방법)

  • Seo, Han-Son;Lee, Ga-Yoen;Yoon, Min
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.1
    • /
    • pp.205-213
    • /
    • 2012
  • Transforming response variable is a general tool to adapt data to a linear regression model. However, it is well known that response transformations in linear regression are very sensitive to one or a few outliers. Many methods have been suggested to develop transformations that will not be influenced by potential outliers. Recently Cheng (2005) suggested to using a trimmed likelihood estimator based on the idea of the least trimmed squares estimator(LTS). However, the method requires presetting the number of outliers and needs many computations. A new method is proposed, that can solve the problems addressed and improve the robustness of the estimates. The method uses a stepwise procedure, suggested by Hadi and Simonoff (1993), to detect outliers that determine response transformations.

Support Vector Machine for Interval Regression

  • Hong Dug Hun;Hwang Changha
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2004.11a
    • /
    • pp.67-72
    • /
    • 2004
  • Support vector machine (SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate interval linear and nonlinear regression models combining the possibility and necessity estimation formulation with the principle of SVM. For data sets with crisp inputs and interval outputs, the possibility and necessity models have been recently utilized, which are based on quadratic programming approach giving more diverse spread coefficients than a linear programming one. SVM also uses quadratic programming approach whose another advantage in interval regression analysis is to be able to integrate both the property of central tendency in least squares and the possibilistic property In fuzzy regression. However this is not a computationally expensive way. SVM allows us to perform interval nonlinear regression analysis by constructing an interval linear regression function in a high dimensional feature space. In particular, SVM is a very attractive approach to model nonlinear interval data. The proposed algorithm here is model-free method in the sense that we do not have to assume the underlying model function for interval nonlinear regression model with crisp inputs and interval output. Experimental results are then presented which indicate the performance of this algorithm.

  • PDF

The Efficiency of the Cochrane-Orcutt Estimation Procedure in Autocorrelated Regression Models

  • Song, Seuck-Heun;Myoungshic Jhun;Jung, Byoung-Cheol
    • Journal of the Korean Statistical Society
    • /
    • v.27 no.3
    • /
    • pp.319-329
    • /
    • 1998
  • In the linear regression model with an autocorrelated disturbances, the Cochrane-Orcutt estimator (COE) is a well known alternative to the Generalized Least Squares estimator (GLSE). The efficiency of COE has been studied empirically in a Monte Carlo study when the unknown parameters are estimated by maximum likelihood method. In this paper, it is theoretically proved that the COE is shown to be inferior to the GLSE. The comparisons are based on the difference of corresponding information matrices or the ratio of their determinants.

  • PDF

A procedure for simultaneous variable selection, variable transformation and outlier identification in linear regression (선형회귀에서 변수선택, 변수변환과 이상치 탐지의 동시적 수행을 위한 절차)

  • Seo, Han Son;Yoon, Min
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.1
    • /
    • pp.1-10
    • /
    • 2020
  • We propose a unified approach to variable selection, transformation and outliers in the linear model. The procedure includes a sequential method for outlier detection and a least trimmed squares estimator for variable transformation. It uses all possible subsets regressions for model selection. Some real data analyses and the simulation results are provided to show the efficiency of the methods in the context of the correct variable selection and the fitness of the estimated model.

Short-term Reactive Power Load Forecasting Using Multiple Time-Series Model (다중 시계열 모델을 이용한 단기 부하 무효전력 예측)

  • Lee, Hyo-Sang;Cho, Jong-Man;Park, Woo-Hyun;Kim, Jin-O
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.18 no.5
    • /
    • pp.105-111
    • /
    • 2004
  • This paper shows that active and reactive power load have significant positive relationship and there exist two types of relationship between them using Test Statistics. In investigating the cross plots at every hour, we found out that from 0 to 8 hours, there relationships are linear, while from 9 to 23 hours, they are two piece-wise linear. Also, reactive power loads was estimated and forecasted using active power load as the explanary variable with OLS (Ordinary Least Squares) regression methods. MAPE (Mean Absolute Percentage Error) for each model is calculated for one-hour ahead forecasting.

ON THEIL'S METHOD IN FUZZY LINEAR REGRESSION MODELS

  • Choi, Seung Hoe;Jung, Hye-Young;Lee, Woo-Joo;Yoon, Jin Hee
    • Communications of the Korean Mathematical Society
    • /
    • v.31 no.1
    • /
    • pp.185-198
    • /
    • 2016
  • Regression analysis is an analyzing method of regression model to explain the statistical relationship between explanatory variable and response variables. This paper propose a fuzzy regression analysis applying Theils method which is not sensitive to outliers. This method use medians of rate of increment based on randomly chosen pairs of each components of ${\alpha}$-level sets of fuzzy data in order to estimate the coefficients of fuzzy regression model. An example and two simulation results are given to show fuzzy Theils estimator is more robust than the fuzzy least squares estimator.

Pulse pile-up correction by auto-regression on linear operations (ARLO) method: A comparison with integration-based algorithms

  • Mohammad-Reza Mohammadian-Behbahani
    • Nuclear Engineering and Technology
    • /
    • v.56 no.9
    • /
    • pp.3904-3913
    • /
    • 2024
  • Radiation detection at high count rate suffers from pulse pile-up, where the counting data and energy information of the system are affected by the overlapping of the system output pulses. There exist various pile-up correction strategies to recover the true information of the pulses, among which pulse-tail extrapolation is a well-known method focused on in this study. Present work aims to use a mono-exponential model for extrapolating the pileup-distorted trailing edge of a pulse, to provide a reference line for calculating the true amplitude of its subsequent overlapping pulse. To this goal, the auto-regression on linear operations (ARLO) method is examined and compared with two integration-based methods (the Foss and the Matheson methods), as well as the non-linear least squares (NLS) method. Despite a higher sensitivity to noise, the ARLO method was able to provide a simple, non-iterative solution with a performance over 400 times faster than the NLS algorithm, according to the analysis of a high count rate set of experimental pulses from a NaI(Tl) detection system. Foss and Matheson methods also provided solutions reasonably faster than NLS (but not surpassing ARLO), performing exactly the same as each other with results very close to NLS, benefiting from their non-iterative nature.

The Sequential Testing of Multiple Outliers in Linear Regression

  • Park, Jinpyo;Park, Heechang
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.2
    • /
    • pp.337-346
    • /
    • 2001
  • In this paper we consider the problem of identifying and testing the outliers in linear regression. first we consider the problem for testing the null hypothesis of no outliers. The test based on the ratio of two scale estimates is proposed. We show the asymptotic distribution of the test statistic by Monte Carlo simulation and investigate its properties. Next we consider the problem of identifying the outliers. A forward sequential procedure based on the suggested test is proposed and shown to perform fairly well. The forward sequential procedure is unaffected by masking and swamping effects because the test statistic is based on robust estimate.

  • PDF

Sustained Vowel Modeling using Nonlinear Autoregressive Method based on Least Squares-Support Vector Regression (최소 제곱 서포트 벡터 회귀 기반 비선형 자귀회귀 방법을 이용한 지속 모음 모델링)

  • Jang, Seung-Jin;Kim, Hyo-Min;Park, Young-Choel;Choi, Hong-Shik;Yoon, Young-Ro
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.17 no.7
    • /
    • pp.957-963
    • /
    • 2007
  • In this paper, Nonlinear Autoregressive (NAR) method based on Least Square-Support Vector Regression (LS-SVR) is introduced and tested for nonlinear sustained vowel modeling. In the database of total 43 sustained vowel of Benign Vocal Fold Lesions having aperiodic waveform, this nonlinear synthesizer near perfectly reproduced chaotic sustained vowels, and also conserved the naturalness of sound such as jitter, compared to Linear Predictive Coding does not keep these naturalness. However, the results of some phonation are quite different from the original sounds. These results are assumed that single-band model can not afford to control and decompose the high frequency components. Therefore multi-band model with wavelet filterbank is adopted for substituting single band model. As a results, multi-band model results in improved stability. Finally, nonlinear sustained vowel modeling using NAR based on LS-SVR can successfully reconstruct synthesized sounds nearly similar to original voiced sounds.

Prediction of Prestress Foce Losses by Nonlinear Regression (비선형 회귀분석에 의한 프리스트레스 하중의 사간에 따른 소실 예측)

  • 오병환;양인환;홍경옥;채성태
    • Proceedings of the Korea Concrete Institute Conference
    • /
    • 1998.04a
    • /
    • pp.347-352
    • /
    • 1998
  • The purpose of this paper is to present and establish a procedure to predict the prestress forces during the service life of the structure. The statistical approach of this procedure is using the in-situ measurement data of the post-tensioning system to develop a nonlinear regression analysis. The method of least squares is used to fit a certain function a set of data. Use of a nonlinear model is achieved by its logarithmic transformation and sunsequent use of linear-regression theory. The regression analysis result can be used to check the prestress force during the service life so that the remaining prestress force is equal to or exceeds the design requirement. Results from the measurement data of PSC box girder bridge structure were used to demonstrate the procedures.

  • PDF