• 제목/요약/키워드: multiple regression technique

검색결과 278건 처리시간 0.028초

ARTIFICIAL NEURAL NETWORK FOR PREDICTION OF WATER QUALITY IN PIPELINE SYSTEMS

  • Kim, Ju-Hwan;Yoon, Jae-Heung
    • Water Engineering Research
    • /
    • 제4권2호
    • /
    • pp.59-68
    • /
    • 2003
  • The applicabilities and validities of two methodologies fur the prediction of THM (trihalomethane) formation in a water pipeline system were proposed and discussed. One is the multiple regression technique and the other is an artificial neural network technique. There are many factors which influence water quality, especially THMs formations in water pipeline systems. In this study, the prediction models of THM formation in water pipeline systems are developed based on the independent variables proposed by American Water Works Association(AWWA). Multiple linear/nonlinear regression models are estimated and three layer feed-forward artificial neural networks have been used to predict the THM formation in a water pipeline system. Input parameters of the models consist of organic compounds measured in water pipeline systems such as TOC, DOC and UV254. Also, the reaction time to each measuring site along pipeline is used as input parameter calculated by a hydraulic analysis. Using these variables as model parameters, four models are developed. And the predicted results from the four developed models are compared statistically to the measured THMs data set. It is shown that the artificial neural network approaches are much superior to the conventional regression approaches and that the developed models by neural network can be used more efficiently and reproduce more accurately the THMs formation in water pipeline systems, than the conventional regression methods proposed by AWWA.

  • PDF

A Cost Estimation Model for Highway Projects in Korea

  • Kim, Soo-Yong;Kim, Young-Mok;Luu, Truong-Van
    • 한국건설관리학회:학술대회논문집
    • /
    • 한국건설관리학회 2008년도 정기학술발표대회 논문집
    • /
    • pp.922-925
    • /
    • 2008
  • Many highway projects are under way in Korea. However, owners frequently find that the project cost exceeds the budget and they are unable to identify the underlining reasons. The main purpose of this research is to develop cost models for transportation projects in Korea using the multiple linear regression (MLR). The data consist of 27 completed transportation projects, built from 1991 to 2001, The technique of multiple regression analysis is used to develop the parametric cost estimating model for total budget cost per highway square meter (TBC/$m^2$). Findings of the study indicated that MLR car be applied to highway projects in Korea. There are twf) major contributions of this research. (1) the identification of transportation parameters as a significant cost driver for transportation costs and (2) the successful development of the parametric cost estimating models for transportation projects in Korea.

  • PDF

Optimized Neural Network Weights and Biases Using Particle Swarm Optimization Algorithm for Prediction Applications

  • Ahmadzadeh, Ezat;Lee, Jieun;Moon, Inkyu
    • 한국멀티미디어학회논문지
    • /
    • 제20권8호
    • /
    • pp.1406-1420
    • /
    • 2017
  • Artificial neural networks (ANNs) play an important role in the fields of function approximation, prediction, and classification. ANN performance is critically dependent on the input parameters, including the number of neurons in each layer, and the optimal values of weights and biases assigned to each neuron. In this study, we apply the particle swarm optimization method, a popular optimization algorithm for determining the optimal values of weights and biases for every neuron in different layers of the ANN. Several regression models, including general linear regression, Fourier regression, smoothing spline, and polynomial regression, are conducted to evaluate the proposed method's prediction power compared to multiple linear regression (MLR) methods. In addition, residual analysis is conducted to evaluate the optimized ANN accuracy for both training and test datasets. The experimental results demonstrate that the proposed method can effectively determine optimal values for neuron weights and biases, and high accuracy results are obtained for prediction applications. Evaluations of the proposed method reveal that it can be used for prediction and estimation purposes, with a high accuracy ratio, and the designed model provides a reliable technique for optimization. The simulation results show that the optimized ANN exhibits superior performance to MLR for prediction purposes.

딥러닝 모형을 이용한 신호교차로 대기행렬길이 예측 (Predicting a Queue Length Using a Deep Learning Model at Signalized Intersections)

  • 나다혁;이상수;조근민;김호연
    • 한국ITS학회 논문지
    • /
    • 제20권6호
    • /
    • pp.26-36
    • /
    • 2021
  • 본 연구는 영상검지기에서 수집되는 정보를 활용하여 딥러닝 기반으로 대기행렬길이를 예측하는 모형을 개발하였다. 그리고 통계적 기법인 다중회귀 모형을 추정하여 평균절대오차와 평균제곱근오차의 두 지표를 이용하여 비교·평가하였다. 다중회귀분석 결과, 시간, 요일, 점유율, 버스 교통량이 유효한 변수로 도출되었고, 이 중에서 독립변수들의 종속변수에 대한 영향력은 점유율이 가장 큰 것으로 나타났다. 딥러닝 최적 모형은 은닉층이 4겹, Look Back이 6으로 결정되었고, 평균절대오차와 평균제곱근오차가 6.34와 8.99로 나타났다. 그리고 두 모형을 평가한 결과, 다중회귀 모형과 딥러닝 모형의 평균절대오차는 각각 13.65와 6.44, 평균제곱근오차는 각각 19.10과 9.11로 계산되었다. 이는 딥러닝 모형이 다중회귀 모형과 비교하여 평균절대오차가 52.8%, 평균제곱근오차는 52.3% 감소된 결과이다.

작품 가격 추정을 위한 기계 학습 기법의 응용 및 가격 결정 요인 분석 (Price Determinant Factors of Artworks and Prediction Model Based on Machine Learning)

  • 장동률;박민재
    • 품질경영학회지
    • /
    • 제47권4호
    • /
    • pp.687-700
    • /
    • 2019
  • Purpose: The purpose of this study is to investigate the interaction effects between price determinants of artworks. We expand the methodology in art market by applying machine learning techniques to estimate the price of artworks and compare linear regression and machine learning in terms of prediction accuracy. Methods: Moderated regression analysis was performed to verify the interaction effects of artistic characteristics on price. The moderating effects were studied by confirming the significance level of the interaction terms of the derived regression equation. In order to derive price estimation model, we use multiple linear regression analysis, which is a parametric statistical technique, and k-nearest neighbor (kNN) regression, which is a nonparametric statistical technique in machine learning methods. Results: Mostly, the influences of the price determinants of art are different according to the auction types and the artist 's reputation. However, the auction type did not control the influence of the genre of the work on the price. As a result of the analysis, the kNN regression was superior to the linear regression analysis based on the prediction accuracy. Conclusion: It provides a theoretical basis for the complexity that exists between pricing determinant factors of artworks. In addition, the nonparametric models and machine learning techniques as well as existing parameter models are implemented to estimate the artworks' price.

Restricted support vector quantile regression without crossing

  • Shim, Joo-Yong;Lee, Jang-Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권6호
    • /
    • pp.1319-1325
    • /
    • 2010
  • Quantile regression provides a more complete statistical analysis of the stochastic relationships among random variables. Sometimes quantile functions estimated at different orders can cross each other. We propose a new non-crossing quantile regression method applying support vector median regression to restricted regression quantile, restricted support vector quantile regression. The proposed method provides a satisfying solution to estimating non-crossing quantile functions when multiple quantiles for high dimensional data are needed. We also present the model selection method that employs cross validation techniques for choosing the parameters which aect the performance of the proposed method. One real example and a simulated example are provided to show the usefulness of the proposed method.

Is it Possible to Predict the ADI of Pesticides using the QSAR Approach?

  • Kim, Jae Hyoun
    • 한국환경보건학회지
    • /
    • 제38권6호
    • /
    • pp.550-560
    • /
    • 2012
  • Objectives: QSAR methodology was applied to explain two different sets of acceptable daily intake (ADI) data of 74 pesticides proposed by both the USEPA and WHO in terms of setting guidelines for food and drinking water. Methods: A subset of calculated descriptors was selected from Dragon$^{(R)}$ software. QSARs were then developed utilizing a statistical technique, genetic algorithm-multiple linear regression (GA-MLR). The differences in each specific model in the prediction of the ADI of the pesticides were discussed. Results: The stepwise multiple linear regression analysis resulted in a statistically significant QSAR model with five descriptors. Resultant QSAR models were robust, showing good utility across multiple classes of pesticide compounds. The applicability domain was also defined. The proposed models were robust and satisfactory. Conclusions: The QSAR model could be a feasible and effective tool for predicting ADI and for the comparison of logADIEPA to logADIWHO. The statistical results agree with the fact that USEPA focuses on more subtle endpoints than does WHO.

비선형 회귀분석에 의한 엔드밀 가공조건에 따른 Al7075의 표면정도 예측 (Prediction of Surface Roughness of Al7075 on End-Milling Working Conditions by Non-linear Regression Analysis)

  • 조연상;박흥식
    • Tribology and Lubricants
    • /
    • 제26권6호
    • /
    • pp.329-335
    • /
    • 2010
  • Recently, the End-milling processing is needed the high-precise technique to get a good surface roughness and rapid time in manufacturing of precision machine parts and electronic parts. The optimum surface roughness has an effect on end-milling working condition such as, cutting direction, spindle speed, feed rate and depth of cut, and so on. It needs to form the correlation of working conditions and surface roughness. Therefore this study was carried out to presume of surface roughness on end-milling working condition of Al7075 by regression analysis. The results was shown that the coefficient of determination($R^2$) of regression equation had a fine reliability of 87.5% and nonlinear regression equation of surface rough was made by multiple regression analysis.

다중회귀분석법에 의한 소나무, 곰솔 및 리기다소나무의 상대성장 비교 (Comparisons on Relative Growth of Red Pine, Black Pine and Pitch Pine by Means of Multiple Regression)

  • 박만춘;이윤근;최기룡
    • 한국환경과학회지
    • /
    • 제19권3호
    • /
    • pp.305-312
    • /
    • 2010
  • The purpose of this study is to compare the relative growth of annual ring width of red pine(Pinus densiflora), black pine(Pinus thunbergii) and pitch pine(Pinus rigida) by means of multiple regression method according to Graybill hypothesis. The obtained results are as follows. 1. The changes of rainfall have affected to tree growth during the periods of 1975 through 1978. 2. Among these pine trees, red pine was mostly influenced by environmental factors. 3. The growth of annual ring width was sensitively responded to the changes of rainfall and air temperature. 4. Among the heavy metals analyzed, the concentrations(ppm) of Lead(Pb) and Copper(Cu) were negatively effected on the growth of annual ring width of pine trees. 5. The analytical technique of annual ring width may be useful for estimation of the pollution in forest areas near industrial complexes.

Wage Determinants Analysis by Quantile Regression Tree

  • Chang, Young-Jae
    • Communications for Statistical Applications and Methods
    • /
    • 제19권2호
    • /
    • pp.293-301
    • /
    • 2012
  • Quantile regression proposed by Koenker and Bassett (1978) is a statistical technique that estimates conditional quantiles. The advantage of using quantile regression is the robustness in response to large outliers compared to ordinary least squares(OLS) regression. A regression tree approach has been applied to OLS problems to fit flexible models. Loh (2002) proposed the GUIDE algorithm that has a negligible selection bias and relatively low computational cost. Quantile regression can be regarded as an analogue of OLS, therefore it can also be applied to GUIDE regression tree method. Chaudhuri and Loh (2002) proposed a nonparametric quantile regression method that blends key features of piecewise polynomial quantile regression and tree-structured regression based on adaptive recursive partitioning. Lee and Lee (2006) investigated wage determinants in the Korean labor market using the Korean Labor and Income Panel Study(KLIPS). Following Lee and Lee, we fit three kinds of quantile regression tree models to KLIPS data with respect to the quantiles, 0.05, 0.2, 0.5, 0.8, and 0.95. Among the three models, multiple linear piecewise quantile regression model forms the shortest tree structure, while the piecewise constant quantile regression model has a deeper tree structure with more terminal nodes in general. Age, gender, marriage status, and education seem to be the determinants of the wage level throughout the quantiles; in addition, education experience appears as the important determinant of the wage level in the highly paid group.