• Title/Summary/Keyword: OLS(Ordinary Least Squares)

Search Result 75, Processing Time 0.025 seconds

Expressions for Shrinkage Factors of PLS Estimator

  • Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1169-1180
    • /
    • 2006
  • Partial least squares regression (PLS) is a biased, non-least squares regression method and is an alternative to the ordinary least squares regression (OLS) when predictors are highly collinear or predictors outnumber observations. One way to understand the properties of biased regression methods is to know how the estimators shrink the OLS estimator. In this paper, we introduce an expression for the shrinkage factor of PLS and develop a new shrinkage expression, and then prove the equivalence of the two representations. We use two near-infrared (NIR) data sets to show general behavior of the shrinkage and in particular for what eigendirections PLS expands the OLS coefficients.

  • PDF

Comparison between Total Least Squares and Ordinary Least Squares for Linear Relationship of Stable Water Isotopes (완전최소자승법과 보통최소자승법을 이용한 물안정동위원소의 선형관계식 비교)

  • Lee, Jeonghoon;Choi, Hye-Bin;Lee, Won Sang;Lee, Seung-Gu
    • Economic and Environmental Geology
    • /
    • v.50 no.6
    • /
    • pp.517-523
    • /
    • 2017
  • A linear relationship between two stable water isotopes, oxygen and hydrogen, has been used to understand the water cycle as a basic tool. A slope and intercept from the linear relationship indicates what kind of physical processes occur during movement of water. Traditionally, ordinary least squares (OLS) method has been utilized for the linear relationship, but total least squares (TLS) method provides more accurate slope and intercept theoretically because isotopic compositions of both oxygen and hydrogen have uncertainties. In this work, OLS and TLS were compared with isotopic compositions of snow and snowmelt collected from the King Sejong Station, Antarctica and isotopic compositions of water vapor observed by Lee et al. (2013) in the western part of Korea. The slopes from the linear relationship of isotopic compositions of snow and snowmelt at the King Sejong Station were estimated to be 7.00 (OLS) and 7.16(TLS) and the slopes of stable water vapor isotopes were 7.75(OLS) and 7.87(TLS). There was a melting process in the snow near the King Sejong Station and the water vapor was directly transported from the ocean to the study area based on the slope calculations. There is no significant difference in two slopes to interpret the physical processes. However, it is necessary to evaluate the slope differences from the two methods for studies for example, groundwater recharge processes, using the absolute slope values.

Unified Non-iterative Algorithm for Principal Component Regression, Partial Least Squares and Ordinary Least Squares

  • Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.355-366
    • /
    • 2003
  • A unified procedure for principal component regression (PCR), partial least squares (PLS) and ordinary least squares (OLS) is proposed. The process gives solutions for PCR, PLS and OLS in a unified and non-iterative way. This enables us to see the interrelationships among the three regression coefficient vectors, and it is seen that the so-called E-matrix in the solution expression plays the key role in differentiating the methods. In addition to setting out the procedure, the paper also supplies a robust numerical algorithm for its implementation, which is used to show how the procedure performs on a real world data set.

  • PDF

Weighted Least Absolute Deviation Lasso Estimator

  • Jung, Kang-Mo
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.6
    • /
    • pp.733-739
    • /
    • 2011
  • The linear absolute shrinkage and selection operator(Lasso) method improves the low prediction accuracy and poor interpretation of the ordinary least squares(OLS) estimate through the use of $L_1$ regularization on the regression coefficients. However, the Lasso is not robust to outliers, because the Lasso method minimizes the sum of squared residual errors. Even though the least absolute deviation(LAD) estimator is an alternative to the OLS estimate, it is sensitive to leverage points. We propose a robust Lasso estimator that is not sensitive to outliers, heavy-tailed errors or leverage points.

A Study on Internet Traffic Forecasting by Combined Forecasts (결합예측 방법을 이용한 인터넷 트래픽 수요 예측 연구)

  • Kim, Sahm
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1235-1243
    • /
    • 2015
  • Increased data volume in the ICT area has increased the importance of forecasting accuracy for internet traffic. Forecasting results may have paper plans for traffic management and control. In this paper, we propose combined forecasts based on several time series models such as Seasonal ARIMA and Taylor's adjusted Holt-Winters and Fractional ARIMA(FARIMA). In combined forecasting methods, we use simple-combined method, MSE based method (Armstrong, 2001), Ordinary Least Squares (OLS) method and Equality Restricted Least Squares (ERLS) method. The results show that the Seasonal ARIMA model outperforms in 3 hours ahead forecasts and that combined forecasts outperform in longer periods.

Applicability of the Ordinary Least Squares Procedure When Both Variables are Subject to Error

  • Kim, Kil-Soo;Byun, Jai-Hyun;Yum, Bong-Jin
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.21 no.1
    • /
    • pp.163-170
    • /
    • 1996
  • An errors-in-variables model (EVM) differs from the classical regression model in that in the former the independent variable is also subject to error. This paper shows that to assess the applicability of the ordinary least squares (OLS) estimation procedure to the EVM, the relative dispersion of the independent variable to its error variance must be also considered in addition to Mandel's criterion. The effect of physically reducing the variance of errors in the independent variable on the performance of the OLS slope estimator is also discussed.

  • PDF

Predicting Site Quality by Partial Least Squares Regression Using Site and Soil Attributes in Quercus mongolica Stands (신갈나무 임분의 입지 및 토양 속성을 이용한 부분최소제곱 회귀의 지위추정 모형)

  • Choonsig Kim;Gyeongwon Baek;Sang Hoon Chung;Jaehong Hwang;Sang Tae Lee
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.1
    • /
    • pp.23-31
    • /
    • 2023
  • Predicting forest productivity is essential to evaluate sustainable forest management or to enhance forest ecosystem services. Ordinary least squares (OLS) and partial least squares (PLS) regression models were used to develop predictive models for forest productivity (site index) from the site characteristics and soil profile, along with soil physical and chemical properties, of 112 Quercus mongolica stands. The adjusted coefficients of determination (adjusted R2) in the regression models were higher for the site characteristics and soil profile of B horizon (R2=0.32) and of A horizon (R2=0.29) than for the soil physical and chemical properties of B horizon (R2=0.21) and A horizon (R2=0.09). The PLS models (R2=0.20-0.32) were better predictors of site index than the OLS models (R2=0.09-0.31). These results suggest that the regression models for Q. mongolica can be applied to predict the forest productivity, but new variables may need to be developed to enhance the explanatory power of regression models.

A Study on the Optimum Scheme for Determination of Operation Time of Line Feeders in Automatic Combination Weighers

  • Keraita James N.;Kim Kyo-Hyoung
    • Journal of Mechanical Science and Technology
    • /
    • v.20 no.10
    • /
    • pp.1567-1575
    • /
    • 2006
  • In an automatic combination weigher, the line feeders distribute the product to several weighing hoppers. The ability to supply appropriate amount of product to the weighing hoppers for each combination operation is crucial for the overall performance. Determining the right duration of operating a line feeder to supply a given amount of product becomes very challenging in case of products which are irregular in volume or specific gravity such as granular secondary processed foods. In this research, several schemes were investigated to determine the best way for a line feeder to approximate the next operating time in order to supply a set amount of irregular goods to the corresponding weighing hopper. Results obtained show that a weighted least squares method (WLS) employing 10 data points is the most effective in determining the operating times of line feeders.

Short-term Reactive Power Load Forecasting Using Multiple Time-Series Model (다중 시계열 모델을 이용한 단기 부하 무효전력 예측)

  • Lee, Hyo-Sang;Cho, Jong-Man;Park, Woo-Hyun;Kim, Jin-O
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.18 no.5
    • /
    • pp.105-111
    • /
    • 2004
  • This paper shows that active and reactive power load have significant positive relationship and there exist two types of relationship between them using Test Statistics. In investigating the cross plots at every hour, we found out that from 0 to 8 hours, there relationships are linear, while from 9 to 23 hours, they are two piece-wise linear. Also, reactive power loads was estimated and forecasted using active power load as the explanary variable with OLS (Ordinary Least Squares) regression methods. MAPE (Mean Absolute Percentage Error) for each model is calculated for one-hour ahead forecasting.

Reflections on the China-Malaysia Economic Partnership

  • AL SHAHER, Shaher;ZREIK, Mohamad
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.3
    • /
    • pp.229-234
    • /
    • 2022
  • The study aims to investigate whether Musharakah management has an impact on Chinese and Malaysian business partnerships. To estimate the relationship between Musharakah and the Sino-Malaysian partnership, this study uses a panel econometric technique namely pooled ordinary least squares. Ordinary Least Squares regression (OLS) is a common technique for estimating coefficients of linear regression equations which describe the relationship between one or more independent quantitative variables and a dependent variable. Data was retrieved from the annual reports (from 2009 to 2019) of non-financial firms listed on the stock exchange of China and Malaysia. Four partnership measures (i.e., Musharakah, Mudarabah, Tawuruq, and Kafalah) were used to estimate the impact of Musharakah on the Sino-Malaysian partnership. Empirical results reveal that Musharakah and Mudarabah are positively related to Kafalah but the relationship is statistically insignificant. Alternatively, Musharakah is positively and significantly related to Mudarabah. Musharakah and Mudarabah have a positive but insignificant relationship. The findings of this study suggest that management of partnership has a positive impact on firm partnership. Furthermore, it supports the hypothesis that improving partnership enhances Musharakah, which has a positive impact on the firm's partnership.