• Title/Summary/Keyword: OLS Regression

Search Result 248, Processing Time 0.022 seconds

Expressions for Shrinkage Factors of PLS Estimator

  • Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1169-1180
    • /
    • 2006
  • Partial least squares regression (PLS) is a biased, non-least squares regression method and is an alternative to the ordinary least squares regression (OLS) when predictors are highly collinear or predictors outnumber observations. One way to understand the properties of biased regression methods is to know how the estimators shrink the OLS estimator. In this paper, we introduce an expression for the shrinkage factor of PLS and develop a new shrinkage expression, and then prove the equivalence of the two representations. We use two near-infrared (NIR) data sets to show general behavior of the shrinkage and in particular for what eigendirections PLS expands the OLS coefficients.

  • PDF

A Quantitative Model for the Projection of Health Expenditure (의료비 결정요인 분석을 위한 계량적 모형 고안)

  • Kim, Han-Joong;Lee, Young-Doo;Nam, Chung-Mo
    • Journal of Preventive Medicine and Public Health
    • /
    • v.24 no.1 s.33
    • /
    • pp.29-36
    • /
    • 1991
  • A multiple regression analysis using ordinary least square (OLS) is frequently used for the projection of health expenditure as well as for the identification of factors affecting health care costs. Data for the analysis often have mixed characteristics of time series and cross section. Parameters as a result of OLS estimation, in this case, are no longer the best linear unbiased estimators (BLUE) because the data do not satisfy basic assumptions of regression analysis. The study theoretically examined statistical problems induced when OLS estimation was applied with the time series cross section data. Then both the OLS regression and time series cross section regression (TSCS regression) were applied to the same empirical da. Finally, the difference in parameters between the two estimations were explained through residual analysis.

  • PDF

Estimating the Nature of Relationship of Entrepreneurship and Business Confidence on Youth Unemployment in the Philippines

  • CAMBA, Aileen L.
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.7 no.8
    • /
    • pp.533-542
    • /
    • 2020
  • This study estimates the nature of the relationship of entrepreneurship and business confidence on youth unemployment in the Philippines over the 2001-2017 period. The paper employed a range of cointegrating regression models, namely, autoregressive distributed lag (ARDL) bounds testing approach, Johansen-Juselius (JJ) and Engle-Granger (EG) cointegration models, dynamic OLS, fully modified OLS, and canonical cointegrating regression (CCR) estimation techniques. The Granger causality based on error correction model (ECM) was also performed to determine the causal link of entrepreneurship and business confidence on youth unemployment. The ARDL bounds testing approach, Johansen-Juselius (JJ) and Engle-Granger (EG) cointegration models confirmed the existence of long-run equilibrium relationship of entrepreneurship and business confidence on youth unemployment. The long-run coefficients from JJ and dynamic OLS show significant long-run and positive relationship of entrepreneurship and business confidence on youth unemployment. While results of the long-run coefficients from fully modified OLS and canonical cointegrating regression (CCR) found that only entrepreneurship has significant and positive relationship with youth unemployment in the long-run. The Granger causality based on error correction model (ECM) estimates show evidence of long-run causal relationship of entrepreneurship and business confidence on youth unemployment. In the short-run, increases in entrepreneurship and business confidence causes youth unemployment to decrease.

Geographically Weighted Regression on the Environmental-Ecological Factors of Human Longevity (장수의 환경생태학적 요인에 관한 지리가중회귀분석)

  • Choi, Don Jeong;Suh, Yong Cheol
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.20 no.3
    • /
    • pp.57-63
    • /
    • 2012
  • The ordinary least square (OLS) regression model is assumed that the relationship between distribution of longevity population and environmental factors to be identical. Therefore, the OLS regression analysis can't explain sufficiently the spatial characteristics of longevity phenomenon and related variables. The geographically weighted regression (GWR) model can be representing the spatial relationship of adjacent area using geographically weighted function. It also characterized which can locally explain the spatial variation of distribution of longevity population by environmental characteristics. From this point of view, this study was performed the comparative analysis between OLS and GWR model for ecological factors of longevity existing studies. In the results, GWR model has higher corresponded to model than OLS model and can be accounting for spatial variability about effect of specific environmental variables.

Forecast and Review of International Airline demand in Korea (한국의 국제선 항공수요 예측과 검토)

  • Kim, Young-Rok
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.27 no.3
    • /
    • pp.98-105
    • /
    • 2019
  • In the past 30 years, our aviation demand has been growing continuously. As such, the importance of the demand forecasting field is increasing. In this study, the factors influencing Korea's international air demand were selected, and the international air demand was analyzed, forecasted and reviewed through OLS multiple regression analysis. As a result, passenger demand was affected by GDP per capita, oil price and exchange rate, while cargo demand was affected by GDP per capita and private consumption growth rate. In particular, passenger demand was analyzed to be sensitive to temporary external shocks, and cargo demand was more affected by economic variables than temporary external shocks. Demand forecasting, OLS multiple regression analysis, passenger demand, cargo demand, transient external shocks, economic variables.

Wage Determinants Analysis by Quantile Regression Tree

  • Chang, Young-Jae
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.2
    • /
    • pp.293-301
    • /
    • 2012
  • Quantile regression proposed by Koenker and Bassett (1978) is a statistical technique that estimates conditional quantiles. The advantage of using quantile regression is the robustness in response to large outliers compared to ordinary least squares(OLS) regression. A regression tree approach has been applied to OLS problems to fit flexible models. Loh (2002) proposed the GUIDE algorithm that has a negligible selection bias and relatively low computational cost. Quantile regression can be regarded as an analogue of OLS, therefore it can also be applied to GUIDE regression tree method. Chaudhuri and Loh (2002) proposed a nonparametric quantile regression method that blends key features of piecewise polynomial quantile regression and tree-structured regression based on adaptive recursive partitioning. Lee and Lee (2006) investigated wage determinants in the Korean labor market using the Korean Labor and Income Panel Study(KLIPS). Following Lee and Lee, we fit three kinds of quantile regression tree models to KLIPS data with respect to the quantiles, 0.05, 0.2, 0.5, 0.8, and 0.95. Among the three models, multiple linear piecewise quantile regression model forms the shortest tree structure, while the piecewise constant quantile regression model has a deeper tree structure with more terminal nodes in general. Age, gender, marriage status, and education seem to be the determinants of the wage level throughout the quantiles; in addition, education experience appears as the important determinant of the wage level in the highly paid group.

An Analysis of the Effects of Customer Characteristics on Sales of Alley Market Area Using Geographically Weighted Regression (지리가중회귀분석을 이용한 고객특성별 골목상권 매출액 영향 연구)

  • Kang, Hyun Mo;Lee, Sang-Kyeong
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.36 no.6
    • /
    • pp.611-620
    • /
    • 2018
  • With the revitalization of alley market area becoming a major goal of the urban regeneration project, an understanding on customer characteristics that affect the sales of alley market areas is needed. As spatial heterogeneity appears to exist in alley market areas, the use of GWR (Geographically Weighted Regression) is required as an alternative to OLS (Ordinary Least Squares) regression. This study analyzes effects of customer characteristics on sales of 1007 alley market areas in Seoul. Comparing R squared and AICc, results show that GWR is better than OLS regression. According to OLS regression, the ratio of female, the ratio of 40's and 50's, the number of employees, the opening rate of establishment, the density of building and the size of alley market area have positive effects on sales, while the ratio of 20's and 30's, the distance of bus stop and that of subway station have negative effects. As a result of comparing local regression coefficients of geographically weighted regression analysis, the ratio of female customers has the greatest effect on the northwestern region, followed by the southwestern region, the central region and the northeastern region. The ratio of 20's and 30's and that of 40's and 50's effect on the southeastern and northeastern regions, and then the southwestern region. It is expected that this study will help to identify marketing target for each alley market area.

Some model misspecification problems for time series: A Monte Carlo investigation

  • Dong-Bin Jeong
    • Communications for Statistical Applications and Methods
    • /
    • v.5 no.1
    • /
    • pp.55-67
    • /
    • 1998
  • Recent work by Shin and Sarkar (1996) examines model misspecification problems for nonstationary time series. Shin and Sarkar introduce a general regression model with integrated errors and one system of integrated regressors and discuss the limiting distributions of the OLS estimators and the usual OLS statistics such as $\hat{\sigma^2}$t, DW and $R^2$. We analyze three different model misspecification problems through a Monte Carlo study and investigate each model misspecification problem. Our Monte Carlo experiments show that DW and $R^2$ can be in general used as diagnostic tools to detect spurious regression, misspecification of nonstationary autoregressive and polynomial regression models.

  • PDF

Forecasting Exchange Rates using Support Vector Machine Regression

  • Chen, Shi-Yi;Jeong, Ki-Ho
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2005.04a
    • /
    • pp.155-163
    • /
    • 2005
  • This paper applies Support Vector Regression (SVR) to estimate and forecast nonlinear autoregressive integrated (ARI) model of the daily exchange rates of four currencies (Swiss Francs, Indian Rupees, South Korean Won and Philippines Pesos) against U.S. dollar. The forecasting abilities of SVR are compared with linear ARI model which is estimated by OLS. Sensitivity of SVR results are also examined to kernel type and other free parameters. Empirical findings are in favor of SVR. SVR method forecasts exchange rate level better than linear ARI model and also has superior ability in forecasting the exchange rates direction in short test phase but has similar performance with OLS when forecasting the turning points in long test phase.

  • PDF

Spatial Econometrics Analysis of Fire Occurrence According to Type of Facilities (시설물 유형에 따른 화재 발생의 공간 계량 분석)

  • Seo, Min Song;Yoo, Hwan Hee
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.3
    • /
    • pp.129-141
    • /
    • 2019
  • In recent years, fast growing cities in Korea are showing signs of being vulnerable to more disasters as their population and facilities increase and intensify. In particular, fire is one of the most common disasters in Korea's cities, along with traffic accidents. Therefore, in this study, we analyze what type of factors affect the fire that threatens urban people. Fire data were acquired for 10 years, from 2007 to 2017, in Jinju, Korea. Spatial distribution pattern of fire occurrence in Jinju was assessed through the spatial autocorrelation analysis. First, spatial autocorrelation analysis was carried out to grasp the spatial distribution pattern of fire occurrence in Jinju city. In addition, correlation and multiple regression analysis were used to confirm spatial dependency and abnormality among factors. Based on this, OLS (Ordinary Least Square) regression analysis was performed using space weighting considering fire location and spatial location of each facility. As a result, First, LISA (Local Indicator of Spatial Association) analysis of the occurrence of fire in Jinju shows that the most central commercial area are fire department, industrial area, and residential area. Second, the OLS regression model was analyzed by applying spatial weighting, focusing on the most derived factors of multiple regression analysis, by integrating population and social variables and physical variables. As a result, the second kind of neighborhood living facility showed the highest correlation with the fire occurrence, followed by the following in the order of single house, sales facility, first type of neighborhood living facility, and number of households. The results of this study are expected to be useful for analyzing the fire occurrence factors of each facility in urban areas and establishing fire safety measures.