• Title/Summary/Keyword: Multiple linear Regression

Search Result 1,741, Processing Time 0.028 seconds

A New Deletion Criterion of Principal Components Regression with Orientations of the Parameters

  • Lee, Won-Woo
    • Journal of the Korean Statistical Society
    • /
    • v.16 no.2
    • /
    • pp.55-70
    • /
    • 1987
  • The principal components regression is one of the substitues for least squares method when there exists multicollinearity in the multiple linear regression model. It is observed graphically that the performance of the principal components regression is strongly dependent upon the values of the parameters. Accordingly, a new deletion criterion which determines proper principal components to be deleted from the analysis is developed and its usefulness is checked by simulations.

  • PDF

Development of the Algorithm for Optimizing Wavelength Selection in Multiple Linear Regression

  • Hoeil Chung
    • Near Infrared Analysis
    • /
    • v.1 no.1
    • /
    • pp.1-7
    • /
    • 2000
  • A convenient algorithm for optimizing wavelength selection in multiple linear regression (MLR) has been developed. MOP (MLP Optimization Program) has been developed to test all possible MLR calibration models in a given spectral range and finally find an optimal MLR model with external validation capability. MOP generates all calibration models from all possible combinations of wavelength, and simultaneously calculates SEC (Standard Error of Calibration) and SEV (Standard Error of Validation) by predicting samples in a validation data set. Finally, with determined SEC and SEV, it calculates another parameter called SAD (Sum of SEC, SEV, and Absolute Difference between SEC and SEV: sum(SEC+SEV+Abs(SEC-SEV)). SAD is an useful parameter to find an optimal calibration model without over-fitting by simultaneously evaluating SEC, SEV, and difference of error between calibration and validation. The calibration model corresponding to the smallest SAD value is chosen as an optimum because the errors in both calibration and validation are minimal as well as similar in scale. To evaluate the capability of MOP, the determination of benzene content in unleaded gasoline has been examined. MOP successfully found the optimal calibration model and showed the better calibration and independent prediction performance compared to conventional MLR calibration.

Motion estimation method using multiple linear regression model (다중선형회귀모델을 이용한 움직임 추정방법)

  • 김학수;임원택;이재철;이규원;박규택
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.34S no.10
    • /
    • pp.98-103
    • /
    • 1997
  • Given the small bit allocation for motion information in very low bit-rate coding, motion estimation using the block matching algorithm(BMA) fails to maintain an acceptable level of prediction errors. The reson is that the motion model, or spatial transformation, assumed in block matching cannot approximate the motion in the real world precisely with a small number of parameters. In order to overcome the drawback of the conventional block matching algorithm, several triangle-based methods which utilize triangular patches insead of blocks have been proposed. To estimate the motions of image sequences, these methods usually have been based on the combination of optical flow equation, affine transform, and iteration. But the compuataional cost of these methods is expensive. This paper presents a fast motion estimation algorithm using a multiple linear regression model to solve the defects of the BMA and the triange-based methods. After describing the basic 2-D triangle-based method, the details of the proposed multiple linear regression model are presented along with the motion estimation results from one standard video sequence, representative of MPEG-4 class A data. The simulationresuls show that in the proposed method, the average PSNR is improved about 1.24 dB in comparison with the BMA method, and the computational cost is reduced about 25% in comparison with the 2-D triangle-based method.

  • PDF

Subset selection in multiple linear regression: An improved Tabu search

  • Bae, Jaegug;Kim, Jung-Tae;Kim, Jae-Hwan
    • Journal of Advanced Marine Engineering and Technology
    • /
    • v.40 no.2
    • /
    • pp.138-145
    • /
    • 2016
  • This paper proposes an improved tabu search method for subset selection in multiple linear regression models. Variable selection is a vital combinatorial optimization problem in multivariate statistics. The selection of the optimal subset of variables is necessary in order to reliably construct a multiple linear regression model. Its applications widely range from machine learning, timeseries prediction, and multi-class classification to noise detection. Since this problem has NP-complete nature, it becomes more difficult to find the optimal solution as the number of variables increases. Two typical metaheuristic methods have been developed to tackle the problem: the tabu search algorithm and hybrid genetic and simulated annealing algorithm. However, these two methods have shortcomings. The tabu search method requires a large amount of computing time, and the hybrid algorithm produces a less accurate solution. To overcome the shortcomings of these methods, we propose an improved tabu search algorithm to reduce moves of the neighborhood and to adopt an effective move search strategy. To evaluate the performance of the proposed method, comparative studies are performed on small literature data sets and on large simulation data sets. Computational results show that the proposed method outperforms two metaheuristic methods in terms of the computing time and solution quality.

Predicting the Soluble Solids of Apples by Near Infrared Spectroscopy (I) - Multiple Linear Regression Models - (근적외선을 이용한 사과의 당도예측 (I) - 다중회귀모델 -)

  • ;W. R. Hruschka;J. A. Abbott;;B. S. Park
    • Journal of Biosystems Engineering
    • /
    • v.23 no.6
    • /
    • pp.561-570
    • /
    • 1998
  • The MLR(Multiple Linear Regression) models to estimate soluble solids content non-destructively were presented to make a selection of optimal photosensor utilized to measure the soluble solids content of apples. Visible and NIR absorbance in the 400 to 2498 nanometer(nm) wavelength region, soluble solids content(sugar content), hardness, and weight were measured for 400 apples(gala). Spectrophotometer with fiber optic probe was utilized for spectrum measurement and digital refractometer was used for soluble solids content. Correlation between absorbance spectrum and soluble solids content was analyzed to pick out the optimal wavelengths and to develop corresponding prediction model by means of MLR. For the coefficient of determination($R^2$) to be over 0.92, the MLR models out of the original absorbance were built based on 7 wavelengths of 992, 904, 1096, 1032, 880, 824, 1048nm, and the ones of the second derivative absorbance based on 5 wavelengths of 784, 1056, 992, 808, 872nm. The best model of the second derivative absorbance spectrum had $R^2$=0.91, bias= -0.02bx, SEP=0.28bx for unknown samples.

  • PDF

Estimation of Soil Moisture Using Multiple Linear Regression Model and COMS Land Surface Temperature Data (다중선형 회귀모형과 천리안 지면온도를 활용한 토양수분 산정 연구)

  • Lee, Yong Gwan;Jung, Chung Gil;Cho, Young Hyun;Kim, Seong Joon
    • Journal of The Korean Society of Agricultural Engineers
    • /
    • v.59 no.1
    • /
    • pp.11-20
    • /
    • 2017
  • This study is to estimate the spatial soil moisture using multiple linear regression model (MLRM) and 15 minutes interval Land Surface Temperature (LST) data of Communication, Ocean and Meteorological Satellite (COMS). For the modeling, the input data of COMS LST, Terra MODIS Normalized Difference Vegetation Index (NDVI), daily rainfall and sunshine hour were considered and prepared. Using the observed soil moisture data at 9 stations of Automated Agriculture Observing System (AAOS) from January 2013 to May 2015, the MLRMs were developed by twelve scenarios of input components combination. The model results showed that the correlation between observed and modelled soil moisture increased when using antecedent rainfalls before the soil moisture simulation day. In addition, the correlation increased more when the model coefficients were evaluated by seasonal base. This was from the reverse correlation between MODIS NDVI and soil moisture in spring and autumn season.

Tilling Load Characteristics and Power Requirement for Rotary Tillers (로우터리 경운(耕耘)의 부하특성(負荷特性) 및 소요동력(所要動力)에 관(関)한 연구(硏究))

  • Choi, Kyu Hong;Ryu, Kwan Hee
    • Journal of Biosystems Engineering
    • /
    • v.9 no.2
    • /
    • pp.27-36
    • /
    • 1984
  • This study was carried out to investigate the effects of the tilling depth, tilling travel speed and soil shear stress on the tilling load characteristics and power requirement for rotary tillers. The results obtained from the study are summarized as follows. 1. The average and maximum PTO torque increased as the tilling depth, tilling pitch and soil shear stress increased. A multiple linear regression equation to estimate the average PTO torque in terms of the above parameters was developed. 2. The ratios of maximum PTO torque to the average torque were in the range of 1.17 to 1.65 for the various tilling conditions tested. The variation in PTO torque increased greatly as the tilling pitch and soil shear stress increased, but decreased as the tilling depth increased. 3. Power requirement for the PTO shaft increased with the tilling depth, travel speed and soil shear stress, but decreased slightly as the tilling pitch increased. A multiple linear regression equation to estimate power requirement for the PTO shaft in terms of the above parameters was developed. 4. The specific power requirement for the rotary tiller was in the range of $0.008-0.015ps/cm^2$ for the various tilling conditons tested. The specific tilling capacity decreased as the tilling depth and soil shear stress increased, but increased with the tilling pitch. A multiple linear regression equation to estimate the specific tilling capacity in terms of the above parameters was developed.

  • PDF

Development of a Multiple Linear Regression Model to Analyze Traffic Volume Error Factors in Radar Detectors

  • Kim, Do Hoon;Kim, Eung Cheol
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.39 no.5
    • /
    • pp.253-263
    • /
    • 2021
  • Traffic data collected using advanced equipment are highly valuable for traffic planning and efficient road operation. However, there is a problem regarding the reliability of the analysis results due to equipment defects, errors in the data aggregation process, and missing data. Unlike other detectors installed for each vehicle lane, radar detectors can yield different error types because they detect all traffic volume in multilane two-way roads via a single installation external to the roadway. For the traffic data of a radar detector to be representative of reliable data, the error factors of the radar detector must be analyzed. This study presents a field survey of variables that may cause errors in traffic volume collection by targeting the points where radar detectors are installed. Video traffic data are used to determine the errors in traffic measured by a radar detector. This study establishes three types of radar detector traffic errors, i.e., artificial, mechanical, and complex errors. Among these types, it is difficult to determine the cause of the errors due to several complex factors. To solve this problem, this study developed a radar detector traffic volume error analysis model using a multiple linear regression model. The results indicate that the characteristics of the detector, road facilities, geometry, and other traffic environment factors affect errors in traffic volume detection.

Development of prediction methodology from CO2 emissions of construction equipment based multiple linear regression (다중선형회귀분석 기반 건설장비 이산화탄소 배출량 예측모델 개발)

  • Gwon, Jae-Min;Lee, Jae-Hak;Jo, Min-Do;Choi, Young-Jun;Han, Seung-Woo
    • Proceedings of the Korean Institute of Building Construction Conference
    • /
    • 2019.11a
    • /
    • pp.38-39
    • /
    • 2019
  • Environmental problems caused by GHG emitted by various industries are emerging around the world, and accordingly, relevant regulations are being applied by countries around the world. Korea is operating a carbon credit system that trades GHG in industry for money, which is expected to be applied to the construction industry. In addition, construction equipment using fossil fuels accounts for the largest portion of $CO_2$ emissions in the construction industry, and the importance of $CO_2$ reduction and prediction is increasing. However, there is a lack of data on the directly measured $CO_2$ emissions of construction equipment and there is no accurate methodology for measuring methods. Therefore, in this study, independent variables were derived based on the $CO_2$ emission data. In addition, multiple linear regression is performed for each independent variable to derive a predictive model of carbon dioxide emission by work type of construction equipment. It is expected that the construction process plan based on environmental factors in the construction industry can be established in the future.

  • PDF

Correlation Analysis of Reservoir Water Quality with respect to Land Use Types of Watersheds (유역 토지이용과 저수지 수질의 상관관계 분석)

  • Youn, Dong-Koun;Chung, Sang-Ok
    • Current Research on Agriculture and Life Sciences
    • /
    • v.24
    • /
    • pp.49-53
    • /
    • 2006
  • The objective of this study was to present regression equations between reservoir water quality and land use types of the watersheds. In order to derive regression equations, a multiple linear regression analysis was used using observed data from 88 reservoirs in the Kyungpook Provcince. The measured values of BOD, COD, T-N, and T-P were correlated with the areas of land use types. 23 regression equations were obtained for all the water quality items and watershed sizes. The results showed that 2 regression equations have the multiple correlation coefficient(MCC) above 0.90, 10 regression equations have the MCC values from 0.70 to 0.90, 9 equations have the MCC from 0.40 to 0.70, and 2 equations have the MCC from 0.20 to 0.40. The results of this study can be used to estimate reservoir water quality simply and quickly in the planning phase.

  • PDF