• Title/Summary/Keyword: Linear regression model equation

Search Result 167, Processing Time 0.026 seconds

A Study on Stochastic Estimation of Monthly Runoff by Multiple Regression Analysis (다중회귀분석에 의한 하천 월 유출량의 추계학적 추정에 관한 연구)

  • 김태철;정하우
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.22 no.3
    • /
    • pp.75-87
    • /
    • 1980
  • Most hydro]ogic phenomena are the complex and organic products of multiple causations like climatic and hydro-geological factors. A certain significant correlation on the run-off in river basin would be expected and foreseen in advance, and the effect of each these causual and associated factors (independant variables; present-month rainfall, previous-month run-off, evapotranspiration and relative humidity etc.) upon present-month run-off(dependent variable) may be determined by multiple regression analysis. Functions between independant and dependant variables should be treated repeatedly until satisfactory and optimal combination of independant variables can be obtained. Reliability of the estimated function should be tested according to the result of statistical criterion such as analysis of variance, coefficient of determination and significance-test of regression coefficients before first estimated multiple regression model in historical sequence is determined. But some error between observed and estimated run-off is still there. The error arises because the model used is an inadequate description of the system and because the data constituting the record represent only a sample from a population of monthly discharge observation, so that estimates of model parameter will be subject to sampling errors. Since this error which is a deviation from multiple regression plane cannot be explained by first estimated multiple regression equation, it can be considered as a random error governed by law of chance in nature. This unexplained variance by multiple regression equation can be solved by stochastic approach, that is, random error can be stochastically simulated by multiplying random normal variate to standard error of estimate. Finally hybrid model on estimation of monthly run-off in nonhistorical sequence can be determined by combining the determistic component of multiple regression equation and the stochastic component of random errors. Monthly run-off in Naju station in Yong-San river basin is estimated by multiple regression model and hybrid model. And some comparisons between observed and estimated run-off and between multiple regression model and already-existing estimation methods such as Gajiyama formula, tank model and Thomas-Fiering model are done. The results are as follows. (1) The optimal function to estimate monthly run-off in historical sequence is multiple linear regression equation in overall-month unit, that is; Qn=0.788Pn+0.130Qn-1-0.273En-0.1 About 85% of total variance of monthly runoff can be explained by multiple linear regression equation and its coefficient of determination (R2) is 0.843. This means we can estimate monthly runoff in historical sequence highly significantly with short data of observation by above mentioned equation. (2) The optimal function to estimate monthly runoff in nonhistorical sequence is hybrid model combined with multiple linear regression equation in overall-month unit and stochastic component, that is; Qn=0. 788Pn+0. l30Qn-1-0. 273En-0. 10+Sy.t The rest 15% of unexplained variance of monthly runoff can be explained by addition of stochastic process and a bit more reliable results of statistical characteristics of monthly runoff in non-historical sequence are derived. This estimated monthly runoff in non-historical sequence shows up the extraordinary value (maximum, minimum value) which is not appeared in the observed runoff as a random component. (3) "Frequency best fit coefficient" (R2f) of multiple linear regression equation is 0.847 which is the same value as Gaijyama's one. This implies that multiple linear regression equation and Gajiyama formula are theoretically rather reasonable functions.

  • PDF

Relation between the Building Exterior Conditions and Energy Costs in the Running period of the Apartment Housing (공동주택의 건물외부조건과 에너지비용과의 관계분석)

  • Lee, Kang-Hee;Ryu, Seung-Hoon;Lee, Yeun-Taek
    • KIEAE Journal
    • /
    • v.9 no.1
    • /
    • pp.107-113
    • /
    • 2009
  • The energy cost is resulted from the energy use. Its sources are divided into some types and depended on the building use or energy-use type. The energy cost should be affected by the amount of the energy use. The cost could be calculated to consider various factors such as the insulation, heating type, building shape and others. But it can not consider all of the affect factors to the energy cost and need to categorize the factors to the condition for estimating the cost. In this paper, it aimed at providing the estimation model in linear equation and multiple linear regression, utilizing the building exterior condition and management characteristics in apartment housing. Its survey are conducted in two parts of management characteristics and building exterior condition. The correlation analysis is conducted to get rid of the multicolinearity among the inputted factors. The number of linear equation model is 11 and includes the 1st, 2nd and 3rd equation function, power function and others. Among these, it suggested the 2nd and 3rd function and power function in terms of the statistics. In multiple linear regression model, the building volume and management area are inputted to the estimation.

Reliability Improvement of In-Place Concreter Strength Prediction by Ultrasonic Pulse Velocity Method (초음파 속도법에 의한 현장 콘크리트 강도추정의 신뢰성 향상)

  • 원종필;박성기
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.43 no.4
    • /
    • pp.97-105
    • /
    • 2001
  • The ultrasonic pulse velocity test has a strong potential to be developed into a very useful and relatively inexpensive in-place test for assuring the quality of concrete placed in structure. The main problem in realizing this potential is that the relationship between compressive strength ad ultrasonic pulse velocity is uncertain and concrete is an inherently variable material. The objective of this study is to improve the reliability of in-place concrete strength predictions by ultrasonic pulse velocity method. Experimental cement content, s/a rate, and curing condition of concrete. Accuracy of the prediction expressed in empirical formula are examined by multiple regression analysis and linear regression analysis and practical equation for estimation the concrete strength are proposed. Multiple regression model uses water-cement ratio cement content s/a rate, and pulse velocity as dependent variables and the compressive strength as an independent variable. Also linear regression model is used to only pulse velocity as dependent variables. Comparing the results of the analysis the proposed equation expressed highest reliability than other previous proposed equations.

  • PDF

Estimation model of coefficient of permeability of soil layer using linear regression analysis (단순회귀분석에 의한 토층지반의 투수계수 산정모델)

  • Lee, Moon-Se;Kim, Kyeong-Su
    • Proceedings of the Korean Geotechical Society Conference
    • /
    • 2009.03a
    • /
    • pp.1043-1052
    • /
    • 2009
  • To derive easily the coefficient of permeability from several other soil properties, the estimation model of coefficient of permeability was proposed using linear regression analysis. The coefficient of permeability is one of the major factors to evaluate the soil characteristics. The study area is located in Kangwon-do Pyeongchang-gun Jinbu-Myeon. Soil samples of 45 spots were taken from the study area and various soil tests were carried out in laboratory. After selecting the soil factor influenced by the coefficient of permeability through the correlation analysis, the estimation model of coefficient of permeability was developed using the linear regression analysis between the selected soil factor and the coefficient of permeability from permeability test. Also, the estimation model of coefficient of permeability was compared with the results from permeability test and empirical equation, and the suitability of proposed model was proved. As the result of correlation analysis between various soil factors and the coefficient of permeability using SPSS(statistical package for the social sciences), the largest influence factor of coefficient of permeability were the effective grain size, porosity and dry unit weight. The coefficient of permeability calculated from the proposed model was similar to that resulted from permeability test. Therefore, the proposed model can be used in case of estimating the coefficient of permeability at the same soil condition like study area.

  • PDF

A Causational Study for Urban 4-legged Signalized Intersections using Structural Equation Method (구조방정식을 이용한 도시부 4지 신호교차로의 사고원인 분석)

  • Oh, Jutaek;Lee, Sangkyu;Heo, Taeyoung;Hwang, Jeongwon
    • International Journal of Highway Engineering
    • /
    • v.14 no.6
    • /
    • pp.121-129
    • /
    • 2012
  • PURPOSES : Traffic accidents at intersections have been increased annually so that it is required to examine the causations to reduce the accidents. However, the current existing accident models were developed mainly with non-linear regression models such as Poisson methods. These non-linear regression methods lack to reveal complicated causations for traffic accidents, though they are right choices to study randomness and non-linearity of accidents. Therefore, to reveal the complicated causations of traffic accidents, this study used structural equation methods(SEM). METHODS : SEM used in this study is a statistical technique for estimating causal relations using a combination of statistical data and qualitative causal assumptions. SEM allow exploratory modeling, meaning they are suited to theory development. The method is tested against the obtained measurement data to determine how well the model fits the data. Among the strengths of SEM is the ability to construct latent variables: variables which are not measured directly, but are estimated in the model from several measured variables. This allows the modeler to explicitly capture the unreliability of measurement in the model, which allows the structural relations between latent variables to be accurately estimated. RESULTS : The study results showed that causal factors could be grouped into 3. Factor 1 includes traffic variables, and Factor 2 contains turning traffic variables. Factor 3 consists of other road element variables such as speed limits or signal cycles. CONCLUSIONS : Non-linear regression models can be used to develop accident predictions models. However, they lack to estimate causal factors, because they select only few significant variables to raise the accuracy of the model performance. Compared to the regressions, SEM has merits to estimate causal factors affecting accidents, because it allows the structural relations between latent variables. Therefore, this study used SEM to estimate causal factors affecting accident at urban signalized intersections.

Hybrid Fuzzy Least Squares Support Vector Machine Regression for Crisp Input and Fuzzy Output

  • Shim, Joo-Yong;Seok, Kyung-Ha;Hwang, Chang-Ha
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.2
    • /
    • pp.141-151
    • /
    • 2010
  • Hybrid fuzzy regression analysis is used for integrating randomness and fuzziness into a regression model. Least squares support vector machine(LS-SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate hybrid fuzzy linear and nonlinear regression models with crisp inputs and fuzzy output using weighted fuzzy arithmetic(WFA) and LS-SVM. LS-SVM allows us to perform fuzzy nonlinear regression analysis by constructing a fuzzy linear regression function in a high dimensional feature space. The proposed method is not computationally expensive since its solution is obtained from a simple linear equation system. In particular, this method is a very attractive approach to modeling nonlinear data, and is nonparametric method in the sense that we do not have to assume the underlying model function for fuzzy nonlinear regression model with crisp inputs and fuzzy output. Experimental results are then presented which indicate the performance of this method.

Joint parameter identification of a cantilever beam using sub-structure synthesis and multi-linear regression

  • Ingole, Sanjay B.;Chatterjee, Animesh
    • Structural Engineering and Mechanics
    • /
    • v.45 no.4
    • /
    • pp.423-437
    • /
    • 2013
  • Complex structures are usually assembled from several substructures with joints connecting them together. These joints have significant effects on the dynamic behavior of the assembled structure and must be accurately modeled. In structural analysis, these joints are often simplified by assuming ideal boundary conditions. However, the dynamic behavior predicted on the basis of the simplified model may have significant errors. This has prompted the researchers to include the effect of joint stiffness in the structural model and to estimate the stiffness parameters using inverse dynamics. In the present work, structural joints have been modeled as a pair of translational and rotational springs and frequency equation of the overall system has been developed using sub-structure synthesis. It is shown that using first few natural frequencies of the system, one can obtain a set of over-determined system of equations involving the unknown stiffness parameters. Method of multi-linear regression is then applied to obtain the best estimate of the unknown stiffness parameters. The estimation procedure has been developed for a two parameter joint stiffness matrix.

A Study of Weighing System to Apply into Hydraulic Excavator with CNN (CNN기반 굴삭기용 부하 측정 시스템 구현을 위한 연구)

  • Hwang Hun Jeong;Young Il Shin;Jin Ho Lee;Ki Yong Cho
    • Journal of Drive and Control
    • /
    • v.20 no.4
    • /
    • pp.133-139
    • /
    • 2023
  • A weighing system calculates the bucket's excavation amount of an excavator. Usually, the excavation amount is computed by the excavator's motion equations with sensing data. But these motion equations have computing errors that are induced by assumptions to the linear systems and identification of the equation's parameters. To reduce computing errors, some commercial weighing system incorporates particular motion into the excavation process. This study introduces a linear regression model on an artificial neural network that has fewer predicted errors and doesn't need a particular pose during an excavation. Time serial data were gathered from a 30tons excavator's loading test. Then these data were preprocessed to be adjusted by MPL (Multi Layer Perceptron) or CNN (Convolutional Neural Network) based linear regression models. Each model was trained by changing hyperparameter such as layer or node numbers, drop-out rate, and kernel size. Finally ID-CNN-based linear regression model was selected.

Price Monitoring Automation with Marketing Forecasting Methods

  • Oksana Penkova;Oleksandr Zakharchuk;Ivan Blahun;Alina Berher;Veronika Nechytailo;Andrii Kharenko
    • International Journal of Computer Science & Network Security
    • /
    • v.23 no.9
    • /
    • pp.37-46
    • /
    • 2023
  • The main aim of the article is to solve the problem of automating price monitoring using marketing forecasting methods and Excel functionality under martial law. The study used the method of algorithms, trend analysis, correlation and regression analysis, ANOVA, extrapolation, index method, etc. The importance of monitoring consumer price developments in market pricing at the macro and micro levels is proved. The introduction of a Dummy variable to account for the influence of martial law in market pricing is proposed, both in linear multiple regression modelling and in forecasting the components of the Consumer Price Index. Experimentally, the high reliability of forecasting based on a five-factor linear regression model with a Dummy variable was proved in comparison with a linear trend equation and a four-factor linear regression model. Pessimistic, realistic and optimistic scenarios were developed for forecasting the Consumer Price Index for the situation of the end of the Russian-Ukrainian war until the end of 2023 and separately until the end of 2024.

Unified methods for variable selection and outlier detection in a linear regression

  • Seo, Han Son
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.6
    • /
    • pp.575-582
    • /
    • 2019
  • The problem of selecting variables in the presence of outliers is considered. Variable selection and outlier detection are not separable problems because each observation affects the fitted regression equation differently and has a different influence on each variable. We suggest a simultaneous method for variable selection and outlier detection in a linear regression model. The suggested procedure uses a sequential method to detect outliers and uses all possible subset regressions for model selections. A simplified version of the procedure is also proposed to reduce the computational burden. The procedures are compared to other variable selection methods using real data sets known to contain outliers. Examples show that the proposed procedures are effective and superior to robust algorithms in selecting the best model.