• Title/Summary/Keyword: Linear Model

Search Result 9,779, Processing Time 0.034 seconds

Robustness of model averaging methods for the violation of standard linear regression assumptions

  • Lee, Yongsu;Song, Juwon
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.2
    • /
    • pp.189-204
    • /
    • 2021
  • In a regression analysis, a single best model is usually selected among several candidate models. However, it is often useful to combine several candidate models to achieve better performance, especially, in the prediction viewpoint. Model combining methods such as stacking and Bayesian model averaging (BMA) have been suggested from the perspective of averaging candidate models. When the candidate models include a true model, it is expected that BMA generally gives better performance than stacking. On the other hand, when candidate models do not include the true model, it is known that stacking outperforms BMA. Since stacking and BMA approaches have different properties, it is difficult to determine which method is more appropriate under other situations. In particular, it is not easy to find research papers that compare stacking and BMA when regression model assumptions are violated. Therefore, in the paper, we compare the performance among model averaging methods as well as a single best model in the linear regression analysis when standard linear regression assumptions are violated. Simulations were conducted to compare model averaging methods with the linear regression when data include outliers and data do not include them. We also compared them when data include errors from a non-normal distribution. The model averaging methods were applied to the water pollution data, which have a strong multicollinearity among variables. Simulation studies showed that the stacking method tends to give better performance than BMA or standard linear regression analysis (including the stepwise selection method) in the sense of risks (see (3.1)) or prediction error (see (3.2)) when typical linear regression assumptions are violated.

Variable Selection Theorems in General Linear Model

  • Park, Jeong-Soo;Yoon, Sang-Hoo
    • 한국데이터정보과학회:학술대회논문집
    • /
    • 2006.04a
    • /
    • pp.171-179
    • /
    • 2006
  • For the problem of variable selection in linear models, we consider the errors are correlated with V covariance matrix. Hocking's theorems on the effects of the overfitting and the underfitting in linear model are extended to the less than full rank and correlated error model, and to the ANCOVA model.

  • PDF

Outlying Cell Identification Method Using Interaction Estimates of Log-linear Models

  • Hong, Chong Sun;Jung, Min Jung
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.291-303
    • /
    • 2003
  • This work is proposed an alternative identification method of outlying cell which is one of important issues in categorical data analysis. One finds that there is a strong relationship between the location of an outlying cell and the corresponding parameter estimates of the well-fitted log-linear model. Among parameters of log-linear model, an outlying cell is affected by interaction terms rather than main effect terms. Hence one could identify an outlying cell by investigating of parameter estimates in an appropriate log-linear model.

Fuzzy linear regression model and its application (퍼지 선형회귀모형과 응용)

  • 이성호;홍덕헌
    • The Korean Journal of Applied Statistics
    • /
    • v.10 no.2
    • /
    • pp.403-411
    • /
    • 1997
  • Fuzzy linear regression model introduced by Tanaka et al. 91982) has been proposed and developed as alternative to statistical linear regression when our understanding of a phenomenon is imprecise or vague. In this paper we review fuzzy linear regression model and its parameter estimation and examine its strengths and weaknesses through case study. In addition another fuzzy linear model is introduced and applied to an economic study.

  • PDF

Feedback Linearization for the Looper System of Hot Strip Mills

  • Hwang, I-Cheol;Kim, Seong-Bae
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2002.10a
    • /
    • pp.56.5-56
    • /
    • 2002
  • This paper studies on the feedback linearization of the looper system for hot strip mills, where the looper system plays an important role in regulating the strip tension. Firstly, nonlinear dynamic equations of the looper system are simply introduced. Secondly, using the static feedback linearization algorithm, a linear model of the looper system is obtained, of which usefulness is validated from comparison between the linear model and the nonlinear model, and design of LQI(Linear Ouadratic Integral optimal control) and ILQ (Inverse Linear Quadratic optimal control) looper control systems. In result, it is shown that the linear looper model by the feedback linearization well describes nonlin...

  • PDF

MapReduce-based Localized Linear Regression for Electricity Price Forecasting (전기 가격 예측을 위한 맵리듀스 기반의 로컬 단위 선형회귀 모델)

  • Han, Jinju;Lee, Ingyu;On, Byung-Won
    • The Transactions of the Korean Institute of Electrical Engineers P
    • /
    • v.67 no.4
    • /
    • pp.183-190
    • /
    • 2018
  • Predicting accurate electricity prices is an important task in the electricity trading market. To address the electricity price forecasting problem, various approaches have been proposed so far and it is known that linear regression-based approaches are the best. However, the use of such linear regression-based methods is limited due to low accuracy and performance. In traditional linear regression methods, it is not practical to find a nonlinear regression model that explains the training data well. If the training data is complex (i.e., small-sized individual data and large-sized features), it is difficult to find the polynomial function with n terms as the model that fits to the training data. On the other hand, as a linear regression model approximating a nonlinear regression model is used, the accuracy of the model drops considerably because it does not accurately reflect the characteristics of the training data. To cope with this problem, we propose a new electricity price forecasting method that divides the entire dataset to multiple split datasets and find the best linear regression models, each of which is the optimal model in each dataset. Meanwhile, to improve the performance of the proposed method, we modify the proposed localized linear regression method in the map and reduce way that is a framework for parallel processing data stored in a Hadoop distributed file system. Our experimental results show that the proposed model outperforms the existing linear regression model. Specifically, the accuracy of the proposed method is improved by 45% and the performance is faster 5 times than the existing linear regression-based model.

A Linear Programming Model to the Score Adjustment among the CSAT Optional Subjects (대입수능 선택과목 점수조정을 위한 선형계획모형 개발 및 활용)

  • Nam, Bo-Woo
    • Korean Management Science Review
    • /
    • v.28 no.1
    • /
    • pp.141-158
    • /
    • 2011
  • This study concerns with an applicability of the management science approach to the score adjustment among the College Scholastic Aptitude Test(CSAT) optional subjects. A linear programming model is developed to minimize the sum of score distortions between optional subjects. Based on the analysis of the 377,089 CSAT(2010) applicants' performances in social science test section, this study proposes a new approach for the score equating or linking method of the educational measurement theory. This study makes up for the weak points in the previous linear programming model. First, the model utilize the standard score which we can get. Second, the model includes a goal programming concept which minimizes the gap between the adjusting goal and the result of the adjustment. Third, the objective function of the linear programing is the weighted sum of the score distortion and the number of applicants. Fourth, the model is applied to the score adjustment problem for the whole 11 optional subjects of the social science test section. The suggested linear programming model is a generalization of the multi-tests linking problem. So, the approach is consistent with the measurement theory for the two tests and can be applied to the optional three or more tests which do not have a common anchor test or a common anchor group. The college admission decision with CSAT score can be improved by using the suggested linear programming model.

Analysis of Linear Regression Model with Two Way Correlated Errors

  • Ssong, Seuck-Heun
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.2
    • /
    • pp.231-245
    • /
    • 2000
  • This paper considers a linear regression model with space and time data in where the disturbances follow spatially correlated error components. We provide the best linear unbiased predictor for the one way error components. We provide the best linear unbiased predictor for the one way error component model with spatial autocorrelation. Further, we derive two diagnostic test statistics for the assessment of model specification due to spatial dependence and random effects as an application of the Lagrange Multiplier principle.

  • PDF

FLEXIBLE OPTIMIZATION MODEL FOR LINEAR SCHEDULING PROBLEMS

  • Shu-Shun Liu;Chang-Jung Wang
    • International conference on construction engineering and project management
    • /
    • 2005.10a
    • /
    • pp.802-807
    • /
    • 2005
  • For linear projects, it has long been known that resource utilization is important in improving work efficiency. However, most existing scheduling techniques cannot satisfy the need for solving such issues. This paper presents an optimization model for solving linear scheduling problems involving resource assignment tasks. The proposed model adopts constraint programming (CP) as the searching algorithm for model formulation, and the proposed model is designed to optimize project total cost. Additionally, the concept of outsourcing resources is introduced here to improve project performance.

  • PDF

Seismic Fragility Analysis of Seismically Isolated Nuclear Power Plant Structures using Equivalent Linear- and Bilinear-Lead Rubber Bearing Model (등가선형 및 이선형 납-고무받침 모델을 적용한 면진된 원전구조물의 지진 취약도 해석)

  • Lee, Jin-Hi;Song, Jong-Keol
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.19 no.5
    • /
    • pp.207-217
    • /
    • 2015
  • In order to increase seismic performance of nuclear power plant (NPP) in strong seismic zone, lead-rubber bearing (LRB) can be applied to seismic isolation system of NPP structures. Simple equivalent linear model as structural analysis model of LRB is more widely used in initial design process of LRB than a bilinear model. Seismic responses for seismically isolated NPP containment structures subjected to earthquakes categorized into 5 different soil-site classes are calculated by both of the equivalent linear- and bilinear- LRB models and compared each others. It can be observed that the maximum displacements of LRB and shear forces of containment in the case of the equivalent linear LRB model are larger than those in the case of bilinear LRB model. From the seismic fragility curves of NPP containment structures isolated by LRB, it can be observed that seismic fragility in the case of equivalent linear LRB model are about 5~30 % larger than those in the case of bilinear LRB model.