• 제목/요약/키워드: Multiple regression model

검색결과 2,531건 처리시간 0.028초

COST PERFORMANCE PREDICTION FOR INTERNATIONAL CONSTRUCTION PROJECTS USING MULTIPLE REGRESSION ANALYSIS AND STRUCTURAL EQUATION MODEL: A COMPARATIVE STUDY

  • D.Y. Kim;S.H. Han;H. Kim;H. Park
    • 국제학술발표논문집
    • /
    • The 2th International Conference on Construction Engineering and Project Management
    • /
    • pp.653-661
    • /
    • 2007
  • Overseas construction projects tend to be more complex than domestic projects, being exposed to more external risks, such as politics, economy, society, and culture, as well as more internal risks from the project itself. It is crucial to have an early understanding of the project condition, in order to be well prepared in various phases of the project. This study compares a structural equation model and multiple regression analysis, in their capacity to predict cost performance of international construction projects. The structural equation model shows a more accurate prediction of cost performance than does regression analysis, due to its intrinsic capability of considering various cost factors in a systematic way.

  • PDF

딥러닝 모형을 이용한 신호교차로 대기행렬길이 예측 (Predicting a Queue Length Using a Deep Learning Model at Signalized Intersections)

  • 나다혁;이상수;조근민;김호연
    • 한국ITS학회 논문지
    • /
    • 제20권6호
    • /
    • pp.26-36
    • /
    • 2021
  • 본 연구는 영상검지기에서 수집되는 정보를 활용하여 딥러닝 기반으로 대기행렬길이를 예측하는 모형을 개발하였다. 그리고 통계적 기법인 다중회귀 모형을 추정하여 평균절대오차와 평균제곱근오차의 두 지표를 이용하여 비교·평가하였다. 다중회귀분석 결과, 시간, 요일, 점유율, 버스 교통량이 유효한 변수로 도출되었고, 이 중에서 독립변수들의 종속변수에 대한 영향력은 점유율이 가장 큰 것으로 나타났다. 딥러닝 최적 모형은 은닉층이 4겹, Look Back이 6으로 결정되었고, 평균절대오차와 평균제곱근오차가 6.34와 8.99로 나타났다. 그리고 두 모형을 평가한 결과, 다중회귀 모형과 딥러닝 모형의 평균절대오차는 각각 13.65와 6.44, 평균제곱근오차는 각각 19.10과 9.11로 계산되었다. 이는 딥러닝 모형이 다중회귀 모형과 비교하여 평균절대오차가 52.8%, 평균제곱근오차는 52.3% 감소된 결과이다.

A Study on the Influence of a Sewage Treatment Plant's Operational Parameters using the Multiple Regression Analysis Model

  • Lee, Seung-Pil;Min, Sang-Yun;Kim, Jin-Sik;Park, Jong-Un;Kim, Man-Soo
    • Environmental Engineering Research
    • /
    • 제19권1호
    • /
    • pp.31-36
    • /
    • 2014
  • In this study, the influence of the control and operational parameters within a sewage treatment plant were reviewed by performing multiple regression analysis on the effluent quality of the sewage treatment. The data used for this review are based on the actual data from a sewage treatment plant using the media process within the year 2012. The prediction models of chemical oxygen demand ($COD_{Mn}$) and total nitrogen (T-N) within the effluent of the 2nd settling tank based on the multiple regression analysis yielded the prediction accuracy measurements of 0.93 and 0.84, respectively; and it was concluded that the model was accurately predicting the variances of the actual observed values. If the data on the energy spent on each operating condition can be collected, then the operating parameter that conserves energy without violating the effluent quality standards of COD and T-N can be determined using the regression model and the standardized regression coefficients. These results can provide appropriate operation guidelines to conserve energy to the operators at sewage treatment plants that consume a lot of energy.

TIME SERIES PREDICTION USING INCREMENTAL REGRESSION

  • Kim, Sung-Hyun;Lee, Yong-Mi;Jin, Long;Chai, Duck-Jin;Ryu, Keun-Ho
    • 대한원격탐사학회:학술대회논문집
    • /
    • 대한원격탐사학회 2006년도 Proceedings of ISRS 2006 PORSEC Volume II
    • /
    • pp.635-638
    • /
    • 2006
  • Regression of conventional prediction techniques in data mining uses the model which is generated from the training step. This model is applied to new input data without any change. If this model is applied directly to time series, the rate of prediction accuracy will be decreased. This paper proposes an incremental regression for time series prediction like typhoon track prediction. This technique considers the characteristic of time series which may be changed over time. It is composed of two steps. The first step executes a fractional process for applying input data to the regression model. The second step updates the model by using its information as new data. Additionally, the model is maintained by only recent data in a queue. This approach has the following two advantages. It maintains the minimum information of the model by using a matrix, so space complexity is reduced. Moreover, it prevents the increment of error rate by updating the model over time. Accuracy rate of the proposed method is measured by RME(Relative Mean Error) and RMSE(Root Mean Square Error). The results of typhoon track prediction experiment are performed by the proposed technique IMLR(Incremental Multiple Linear Regression) is more efficient than those of MLR(Multiple Linear Regression) and SVR(Support Vector Regression).

  • PDF

MOISTURE CONTENT MEASUREMENT OF POWDERED FOOD USING RF IMPEDANCE SPECTROSCOPIC METHOD

  • Kim, K. B.;Lee, J. W.;S. H. Noh;Lee, S. S.
    • 한국농업기계학회:학술대회논문집
    • /
    • 한국농업기계학회 2000년도 THE THIRD INTERNATIONAL CONFERENCE ON AGRICULTURAL MACHINERY ENGINEERING. V.II
    • /
    • pp.188-195
    • /
    • 2000
  • This study was conducted to measure the moisture content of powdered food using RF impedance spectroscopic method. In frequency range of 1.0 to 30㎒, the impedance such as reactance and resistance of parallel plate type sample holder filled with wheat flour and red-pepper powder of which moisture content range were 5.93∼-17.07%w.b. and 10.87 ∼ 27.36%w.b., respectively, was characterized using by Q-meter (HP4342). The reactance was a better parameter than the resistance in estimating the moisture density defined as product of moisture content and bulk density which was used to eliminate the effect of bulk density on RF spectral data in this study. Multivariate data analyses such as principal component regression, partial least square regression and multiple linear regression were performed to develop one calibration model having moisture density and reactance spectral data as parameters for determination of moisture content of both wheat flour and red-pepper powder. The best regression model was one by the multiple linear regression model. Its performance for unknown data of powdered food was showed that the bias, standard error of prediction and determination coefficient are 0.179% moisture content, 1.679% moisture content and 0.8849, respectively.

  • PDF

회귀 모델을 활용한 철강 기업의 에너지 소비 예측 (Forecasting Energy Consumption of Steel Industry Using Regression Model)

  • Sung-Ho KANG;Hyun-Ki KIM
    • Journal of Korea Artificial Intelligence Association
    • /
    • 제1권2호
    • /
    • pp.21-25
    • /
    • 2023
  • The purpose of this study was to compare the performance using multiple regression models to predict the energy consumption of steel industry. Specific independent variables were selected in consideration of correlation among various attributes such as CO2 concentration, NSM, Week Status, Day of week, and Load Type, and preprocessing was performed to solve the multicollinearity problem. In data preprocessing, we evaluated linear and nonlinear relationships between each attribute through correlation analysis. In particular, we decided to select variables with high correlation and include appropriate variables in the final model to prevent multicollinearity problems. Among the many regression models learned, Boosted Decision Tree Regression showed the best predictive performance. Ensemble learning in this model was able to effectively learn complex patterns while preventing overfitting by combining multiple decision trees. Consequently, these predictive models are expected to provide important information for improving energy efficiency and management decision-making at steel industry. In the future, we plan to improve the performance of the model by collecting more data and extending variables, and the application of the model considering interactions with external factors will also be considered.

Exact Confidence Intervals on the Regression Coeffcients in Multiple Regression Model with Nested Error Structure

  • Park, Dong-Joon
    • Communications for Statistical Applications and Methods
    • /
    • 제4권2호
    • /
    • pp.541-548
    • /
    • 1997
  • In regression model with nested error structure interval estimations on regression coefficients in different stages are proposed. Ordinary least square estimators and generalized least square estimators of the regression coefficients in this model are derived for between and within group model. The confidence intervals are dervied by using independent idstributional properties between regression coefficient estimators and quadratic froms obtained from the model.

  • PDF

다중회귀분석을 활용한 하수처리시설 에너지 소비량 예측모델 개발 (Development of Energy Consumption Estimation Model Using Multiple Regression Analysis)

  • 신원재;정용준;김예진
    • 한국환경과학회지
    • /
    • 제24권11호
    • /
    • pp.1443-1450
    • /
    • 2015
  • Wastewater treatment plant(WWTP) has been recognized as a high energy consuming plant. Usually many WWTPs has been operated in the excessive operation conditions in order to maintain stable wastewater treatment. The energy required at WWTPs consists of various subparts such as pumping, aeration, and office maintenance. For management of energy comes from process operation, it can be useful to operators to provide some information about energy variations according to the adjustment of operational variables. In this study, multiple regression analysis was used to establish an energy estimation model. The independent variables for estimation energy were selected among operational variables. The $R^2$ value in the regression analysis appeared 0.68, and performance of the electric power prediction model had less than ${\pm}5%$ error.

Deletion diagnostics in fitting a given regression model to a new observation

  • Kim, Myung Geun
    • Communications for Statistical Applications and Methods
    • /
    • 제23권3호
    • /
    • pp.231-239
    • /
    • 2016
  • A graphical diagnostic method based on multiple case deletions in a regression context is introduced by using the sampling distribution of the difference between two least squares estimators with and without multiple cases. Principal components analysis plays a key role in deriving this diagnostic method. Multiple case deletions of test statistic are also considered when a new observation is fitted to a given regression model. The result is useful for detecting influential observations in econometric data analysis, for example in checking whether the consumption pattern at a later time is the same as the one found before or not, as well as for investigating the influence of cases in the usual regression model. An illustrative example is given.

Machine learning-based regression analysis for estimating Cerchar abrasivity index

  • Kwak, No-Sang;Ko, Tae Young
    • Geomechanics and Engineering
    • /
    • 제29권3호
    • /
    • pp.219-228
    • /
    • 2022
  • The most widely used parameter to represent rock abrasiveness is the Cerchar abrasivity index (CAI). The CAI value can be applied to predict wear in TBM cutters. It has been extensively demonstrated that the CAI is affected significantly by cementation degree, strength, and amount of abrasive minerals, i.e., the quartz content or equivalent quartz content in rocks. The relationship between the properties of rocks and the CAI is investigated in this study. A database comprising 223 observations that includes rock types, uniaxial compressive strengths, Brazilian tensile strengths, equivalent quartz contents, quartz contents, brittleness indices, and CAIs is constructed. A linear model is developed by selecting independent variables while considering multicollinearity after performing multiple regression analyses. Machine learning-based regression methods including support vector regression, regression tree regression, k-nearest neighbors regression, random forest regression, and artificial neural network regression are used in addition to multiple linear regression. The results of the random forest regression model show that it yields the best prediction performance.