• Title/Summary/Keyword: Multiple Linear Regression(MLR)

Search Result 124, Processing Time 0.021 seconds

Is it Possible to Predict the ADI of Pesticides using the QSAR Approach?

  • Kim, Jae Hyoun
    • Journal of Environmental Health Sciences
    • /
    • v.38 no.6
    • /
    • pp.550-560
    • /
    • 2012
  • Objectives: QSAR methodology was applied to explain two different sets of acceptable daily intake (ADI) data of 74 pesticides proposed by both the USEPA and WHO in terms of setting guidelines for food and drinking water. Methods: A subset of calculated descriptors was selected from Dragon$^{(R)}$ software. QSARs were then developed utilizing a statistical technique, genetic algorithm-multiple linear regression (GA-MLR). The differences in each specific model in the prediction of the ADI of the pesticides were discussed. Results: The stepwise multiple linear regression analysis resulted in a statistically significant QSAR model with five descriptors. Resultant QSAR models were robust, showing good utility across multiple classes of pesticide compounds. The applicability domain was also defined. The proposed models were robust and satisfactory. Conclusions: The QSAR model could be a feasible and effective tool for predicting ADI and for the comparison of logADIEPA to logADIWHO. The statistical results agree with the fact that USEPA focuses on more subtle endpoints than does WHO.

Applicability evaluation of aerodynamic approaches for evaporation estimation using pan evaporation data (증발접시 증발량자료를 이용한 공기동력학적 증발량 산정 방법의 적용성 평가)

  • Rim, Chang-Soo
    • Journal of Korea Water Resources Association
    • /
    • v.50 no.11
    • /
    • pp.781-793
    • /
    • 2017
  • In this study, applicabilities of aerodynamic approaches for the estimation of pan evaporation were evaluated on 56 study stations in South Korea. To accomplish this study purpose, previous researchers' evaporation estimation equations based on aerodynamic approaches were grouped into seven generalized evaporation models. Furthermore, four multiple linear regression (MLR) models were developed and tested. The independent variables of MLR models are meteorological variables such as wind speed, vapor pressure deficit, air temperature, and atmospheric pressure. These meteorological variables are required for the application of aerodynamic approaches. In order to consider the effect of autocorrelation, MLR models were developed after differencing variables. The applicability of MLR models with differenced variables was compared with that of MLR models with undifferenced variables and the comparison results showed no significant difference between the two methods. The study results have indicated that there is strong correlation between estimated pan evaporation (using aerodynamic models and MLR models) and measured pan evaporation. However, pan evaporation are overestimated during August, September, October, November, and December. Most of meteorological variables that are used for MLR models show statistical significance in the estimation of pan evaporation. Vapor pressure deficit was turned out to be the most significant meteorological variable. The second most significant variable was air temperature; wind speed was the third most significant variable, followed by atmospheric pressure.

Prediction of retention of uncharged solutes in nanofiltration by means of molecular descriptors

  • Nowaczyk, Alicja;Nowaczyk, Jacek;Koter, Stanislaw
    • Membrane and Water Treatment
    • /
    • v.1 no.3
    • /
    • pp.181-192
    • /
    • 2010
  • A linear quantitative structure-property relationship (QSPR) model is presented for the prediction of rejection in permeation through membrane. The model was produced by using the multiple linear regression (MLR) technique on the database consisting of retention data of 25 pesticides in 4 different membrane separation experiments. Among the 3224 different physicochemical, topological and structural descriptors that were considered as inputs to the model only 50 were selected using several criteria of elimination. The physical meaning of chosen descriptor is discussed in detail. The accuracy of the proposed MLR models is illustrated using the following evaluation techniques: leave-one-out cross validation procedure, leave-many-out cross validation procedure and Y-randomization.

Yield Prediction of Chinese Cabbage (Brassicaceae) Using Broadband Multispectral Imagery Mounted Unmanned Aerial System in the Air and Narrowband Hyperspectral Imagery on the Ground

  • Kang, Ye Seong;Ryu, Chan Seok;Kim, Seong Heon;Jun, Sae Rom;Jang, Si Hyeong;Park, Jun Woo;Sarkar, Tapash Kumar;Song, Hye young
    • Journal of Biosystems Engineering
    • /
    • v.43 no.2
    • /
    • pp.138-147
    • /
    • 2018
  • Purpose: A narrowband hyperspectral imaging sensor of high-dimensional spectral bands is advantageous for identifying the reflectance by selecting the significant spectral bands for predicting crop yield over the broadband multispectral imaging sensor for each wavelength range of the crop canopy. The images acquired by each imaging sensor were used to develop the models for predicting the Chinese cabbage yield. Methods: The models for predicting the Chinese cabbage (Brassica campestris L.) yield, with multispectral images based on unmanned aerial vehicle (UAV), were developed by simple linear regression (SLR) using vegetation indices, and forward stepwise multiple linear regression (MLR) using four spectral bands. The model with hyperspectral images based on the ground were developed using forward stepwise MLR from the significant spectral bands selected by dimension reduction methods based on a partial least squares regression (PLSR) model of high precision and accuracy. Results: The SLR model by the multispectral image cannot predict the yield well because of its low sensitivity in high fresh weight. Despite improved sensitivity in high fresh weight of the MLR model, its precision and accuracy was unsuitable for predicting the yield as its $R^2$ is 0.697, root-mean-square error (RMSE) is 1170 g/plant, relative error (RE) is 67.1%. When selecting the significant spectral bands for predicting the yield using hyperspectral images, the MLR model using four spectral bands show high precision and accuracy, with 0.891 for $R^2$, 616 g/plant for the RMSE, and 35.3% for the RE. Conclusions: Little difference was observed in the precision and accuracy of the PLSR model of 0.896 for $R^2$, 576.7 g/plant for the RMSE, and 33.1% for the RE, compared with the MLR model. If the multispectral imaging sensor composed of the significant spectral bands is produced, the crop yield of a wide area can be predicted using a UAV.

Prediction of UCS and STS of Kaolin clay stabilized with supplementary cementitious material using ANN and MLR

  • Kumar, Arvind;Rupali, S.
    • Advances in Computational Design
    • /
    • v.5 no.2
    • /
    • pp.195-207
    • /
    • 2020
  • The present study focuses on the application of artificial neural network (ANN) and Multiple linear Regression (MLR) analysis for developing a model to predict the unconfined compressive strength (UCS) and split tensile strength (STS) of the fiber reinforced clay stabilized with grass ash, fly ash and lime. Unconfined compressive strength and Split tensile strength are the nonlinear functions and becomes difficult for developing a predicting model. Artificial neural networks are the efficient tools for predicting models possessing non linearity and are used in the present study along with regression analysis for predicting both UCS and STS. The data required for the model was obtained by systematic experiments performed on only Kaolin clay, clay mixed with varying percentages of fly ash, grass ash, polypropylene fibers and lime as between 10-20%, 1-4%, 0-1.5% and 0-8% respectively. Further, the optimum values of the various stabilizing materials were determined from the experiments. The effect of stabilization is observed by performing compaction tests, split tensile tests and unconfined compression tests. ANN models are trained using the inputs and targets obtained from the experiments. Performance of ANN and Regression analysis is checked with statistical error of correlation coefficient (R) and both the methods predict the UCS and STS values quite well; but it is observed that ANN can predict both the values of UCS as well as STS simultaneously whereas MLR predicts the values separately. It is also observed that only STS values can be predicted efficiently by MLR.

TIME SERIES PREDICTION USING INCREMENTAL REGRESSION

  • Kim, Sung-Hyun;Lee, Yong-Mi;Jin, Long;Chai, Duck-Jin;Ryu, Keun-Ho
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.635-638
    • /
    • 2006
  • Regression of conventional prediction techniques in data mining uses the model which is generated from the training step. This model is applied to new input data without any change. If this model is applied directly to time series, the rate of prediction accuracy will be decreased. This paper proposes an incremental regression for time series prediction like typhoon track prediction. This technique considers the characteristic of time series which may be changed over time. It is composed of two steps. The first step executes a fractional process for applying input data to the regression model. The second step updates the model by using its information as new data. Additionally, the model is maintained by only recent data in a queue. This approach has the following two advantages. It maintains the minimum information of the model by using a matrix, so space complexity is reduced. Moreover, it prevents the increment of error rate by updating the model over time. Accuracy rate of the proposed method is measured by RME(Relative Mean Error) and RMSE(Root Mean Square Error). The results of typhoon track prediction experiment are performed by the proposed technique IMLR(Incremental Multiple Linear Regression) is more efficient than those of MLR(Multiple Linear Regression) and SVR(Support Vector Regression).

  • PDF

QSPR Study of the Absorption Maxima of Azobenzene Dyes

  • Xu, Jie;Wang, Lei;Liu, Li;Bai, Zikui;Wang, Luoxin
    • Bulletin of the Korean Chemical Society
    • /
    • v.32 no.11
    • /
    • pp.3865-3872
    • /
    • 2011
  • A quantitative structure-property relationship (QSPR) study was performed for the prediction of the absorption maxima of azobenzene dyes. The entire set of 191 azobenzenes was divided into a training set of 150 azobenzenes and a test set of 41 azobenzenes according to Kennard and Stones algorithm. A seven-descriptor model, with squared correlation coefficient ($R^2$) of 0.8755 and standard error of estimation (s) of 14.476, was developed by applying stepwise multiple linear regression (MLR) analysis on the training set. The reliability of the proposed model was further illustrated using various evaluation techniques: leave-many-out crossvalidation procedure, randomization tests, and validation through the test set.

An Innovative Application Method of Monthly Load Forecasting for Smart IEDs

  • Choi, Myeon-Song;Xiang, Ling;Lee, Seung-Jae;Kim, Tae-Wan
    • Journal of Electrical Engineering and Technology
    • /
    • v.8 no.5
    • /
    • pp.984-990
    • /
    • 2013
  • This paper develops a new Intelligent Electronic Device (IED), and then presents an application method of a monthly load forecasting algorithm on the smart IEDs. A Multiple Linear Regression (MLR) model implemented with Recursive Least Square (RLS) estimation is established in the algorithm. Case Study proves the accuracy and reliability of this algorithm and demonstrates the practical meanings through designed screens. The application method shows the general way to make use of IED's smart characteristics and thereby reveals a broad prospect of smart function realization in application.

Effect of Entrepreneurial Ecosystem Quality on Entrepreneurship Performance (창업 생태계 품질이 창업 성과에 미치는 영향)

  • Lee, Eun-Ji;Cho, Young-Ju
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.3
    • /
    • pp.305-332
    • /
    • 2022
  • Purpose: As the public interest in entrepreneurship has been highlighted and entrepreneurship policies have been generated, this study is to construct Entrepreneurship Ecosystem (EE) models which have a significant relationship to national entrepreneurship with quantitative analysis. It aims to provide implications to EE policymakers that which national components are effective in cultivating innovative entrepreneurship and validate its EE quality based on quantitative performance goals. Methods: This study utilizes secondary data, categorized under the PESTLE factor from credible international organizations (WB, UNDP, GEM, GEDI, and OECD) to determine significant factors in the quality of the entrepreneurial ecosystem. This paper uses the Multiple Linear Regression (MLR) analysis to select the significant variables contributing to entrepreneurship performance. Using the AUC-ROC performance evaluation method for machine learning MLR results, this paper evaluates the performance of EE models so that it can allow approving EE quality by predicting potential performance. Results: Among nine hypothesis models, MLR analysis examines that the number of the Unicorn company, Unicorn companies' economic value, and entrepreneurship measured as GEI can be reasonable dependent variables to indicate the performance derived from EE quality. Rather than government policies and regulations, the social, finance, technology, and economic variables are significant factors of EE quality determining its performance. By having high Area Under Curve values under AUC-ROC analysis, accepted MLR models are regarded as having high prediction accuracy. Conclusion: Superior EE contributes to the outstanding Unicorn companies, and improvement in macro-environmental components can enhance EE quality.

Determination of Rice Milling Ratio by Visible / Near-Infrared Spectroscopy (가시광선 / 근적외선 분광 분석법을 이용한 쌀의 정백수율 측정)

  • 김재민;민봉기;최창현
    • Journal of Biosystems Engineering
    • /
    • v.22 no.3
    • /
    • pp.333-342
    • /
    • 1997
  • The objective of this research was to develop model equations for measuring rice milling ratio by using visible / HIR spectroscopy. Twelve kinds of brown rice(n = 149) were milled to obtain various milling ratio ranged from 86% to 94%. Visible/NIR spectra were collected with a spectrophotometer with sample transport module. The reflectance and transmission spectra were measured in the range of 400~2, 500nm and 600~1, 400nm, respectively, with 2 nm intervals. Multiple linear regression(MLR), Partial least square (PLS), and Artificial neural network(ANN) were used to develop models. Model developed with reflectance spectra showed better prediction results then those with transmission spectra. The MLR model with six-wavelength obtained from first derivative spectra gave to the best results for measuring the rice milling ratio(SEP = 0.535, , $r^2$ = 0.980). The PLS model(SEP = 0.604, $r^2$= 0.976) and ANN model(SEP = 0.566, $r^2$= 0.978) also can be used to determine the rice milling ratio effectively.

  • PDF