• 제목/요약/키워드: multiple linear regression models

검색결과 318건 처리시간 0.027초

A Technique to Improve the Fit of Linear Regression Models for Successive Sets of Data

  • Park, Sung H.
    • Journal of the Korean Statistical Society
    • /
    • 제5권1호
    • /
    • pp.19-28
    • /
    • 1976
  • In empirical study for fitting a multiple linear regression model for successive cross-sections data observed on the same set of independent variables over several time periods, one often faces the problem of poor $R^2$, the multiple coefficient of determination, which provides a standard measure of how good a specified regression line fits the sample data.

  • PDF

실시간 수위 예측을 위한 다중선형회귀 모형의 비교 (Comparison of Different Multiple Linear Regression Models for Real-time Flood Stage Forecasting)

  • 최승용;한건연;김병현
    • 대한토목학회논문집
    • /
    • 제32권1B호
    • /
    • pp.9-20
    • /
    • 2012
  • 최근 수위 예측을 위한 개념적 기반, 수문학적, 물리적 기반 모형 등의 단점을 극복하고자 홍수예측을 위해 자료지향형 모형 중의 하나인 다중선형회귀 모형이 널리 도입되고 있다. 본 연구의 목적은 이러한 다중선형회귀 모형의 서로 다른 회귀계수 선정 방법에 따른 홍수예측 성능을 비교 검토하고 이를 통해 적절한 다중회귀 홍수예측 모형을 구축하는 것이다. 이를 위해 입력자료의 자기상관분석을 통해 독립변수의 시간 규모를 결정한 후 최소 자승법, 가중 최소 자승법, 단계별 선택법의 각기 다른 회귀계수 산정 방법을 이용한 홍수예측 모형을 구축하고 중랑천 유역의 다양한 홍수사상에 대해 적용하였다. 구축된 모형들의 성능을 평가하기 위해 평균제곱근오차, Nash-Suttcliffe 효율계수, 평균절대오차, 수정 결정계수와 같이 4개의 통계지표들을 사용하였다. 모의결과 단계별 선택법을 이용한 다중선형회귀 홍수예측 모형이 가장 정확한 예측 결과를 보였고, 최소자승법을 이용한 홍수예측 모형이 가중 최소자승법을 이용한 홍수예측 모형보다 좀 더 나은 예측 결과를 나타냈다.

다변량 선형회귀분석을 이용한 증발접시계수 산정방법 적용성 검토 (Evaluation of applicability of pan coefficient estimation method by multiple linear regression analysis)

  • 임창수
    • 한국수자원학회논문집
    • /
    • 제55권3호
    • /
    • pp.229-243
    • /
    • 2022
  • 우리나라 11개 기상관측지역의 월별 기상자료가 증발접시계수에 미치는 영향을 분석하고, 증발접시계수 산정을 위한 4가지 형태의 다변량 선형회귀모형의 적용성을 검토하였다. 개발된 증발접시계수 산정모형의 적용성을 평가하기 위해서 기존에 다른 연구자들에 의해서 제안된 6가지의 모형과 비교 평가하였다. 우리나라 11개 기상관측지역에서 증발접시계수는 1, 2, 3, 7, 11, 12월은 기온에 가장 큰 영향을 받고, 다른 월들은 일사량에 가장 큰 영향을 받는 것으로 나타났다. 전반적으로 모든 월에서 풍속과 상대습도는 기온이나 일사량과 비교해서 증발접시계수에 큰 영향을 미치지 않는 것으로 나타났다. 모든 지역과 월에서 각 지역별로 5개의 독립변수(풍속, 상대습도, 기온, 일조시간과 가조시간의 비, 일사량)를 적용하여 유도된 모형이 가장 양호한 증발량 산정 결과를 보였다. 모형 검증결과에 의하면 다변량 선형회귀분석을 적용하여 증발접시계수를 산정하는 경우 일부 지역과 월에서 제한적으로 적용할 수 있을 것으로 판단된다.

NB-IoT 기술에서 Multiple Linear Regression Model을 활용하여 OTDOA 기반 포지셔닝 정확도 최적화 (Optimize OTDOA-based Positioning Accuracy by Utilizing Multiple Linear Regression Model under NB-IoT Technology)

  • 판이첸;김재수
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2020년도 제62차 하계학술대회논문집 28권2호
    • /
    • pp.139-142
    • /
    • 2020
  • NB-IoT(Narrow Band Internet of Things) is an emerging LPWAN(Low Power Wide Area Network) radio technology. NB-IoT has many advantages like low power, low cost, and high coverage. However low bandwidth and low sampling rates also lead to poor positioning accuracy. This paper proposed a solution to optimize positioning accuracy under the OTDOA(Observed Time Difference of Arrival) approach by utilizing MLR(Multiple Linear Regression) models. Through the MLR model to predict the influence degree of weather(temperature, humidity, light intensity and air pressure) on the arrival time of signal transmission to improve the measurement accuracy. The improvement of measurement accuracy can greatly improve IoT applications based on NB-IoT.

  • PDF

3차원 박판형성 공정 유한요소해석용 드로우비드 모델 (Drawbead Model for 3-Dimensional Finite Element Analysis of Sheet Metal Forming Processess)

  • 금영탁;김준환;차지혜
    • 소성∙가공
    • /
    • 제11권5호
    • /
    • pp.394-404
    • /
    • 2002
  • The drawbead model for a three-dimensional a finite element analysis of sheet metal forming processes is developed. The mathematical models of the basic drawbeads like circular drawbead, stepped drawbead, and squared drawbaed are first derived using the bending theory, belt-pulley equation, and Coulomb friction law. Next, the experiments for finding the drawing characteristics of the drawbead are performed. Based on mathematical models and drawing test results, expert models of basic drawbeads are then developed employing a linear multiple regression method. For the expert models of combined drawbeads such as the double circular drawbead, double stepped drawbead, circular-and-stepped drawbead, etc., those of the basic drawbeads are summed. Finally, in order to verify the expert models developed, the drawing characteristics calculated by the expert models of the double circular drawbead and circular-and-stepped drawbead are compared with those obtained from the experiments. The predictions by expert models agree well with the measurements by experiments.

중소하천유역의 임계지속시간 결정에 관한 연구 (Study on the Critical Storm Duration Decision of the Rivers Basin)

  • 안승섭;이효정;정도준
    • 한국환경과학회지
    • /
    • 제16권11호
    • /
    • pp.1301-1312
    • /
    • 2007
  • The objective of this study is to propose a critical storm duration forecasting model on storm runoff in small river basin. The critical storm duration data of 582 sub-basin which introduced disaster impact assessment report on the National Emergency Management Agency during the period from 2004 to 2007 were collected, analyzed and studied. The stepwise multiple regression method are used to establish critical storm duration forecasting models(Linear and exponential type). The results of multiple regression analysis discriminated the linear type more than exponential type. The results of multiple linear regression analysis between the critical storm duration and 5 basin characteristics parameters such as basin area, main stream length, average slope of main stream, shape factor and CN showed more than 0.75 of correlation in terms of the multi correlation coefficient.

근적외선을 이용한 사과의 당도예측 (I) - 다중회귀모델 - (Predicting the Soluble Solids of Apples by Near Infrared Spectroscopy (I) - Multiple Linear Regression Models -)

  • 이강진;;;노상하
    • Journal of Biosystems Engineering
    • /
    • 제23권6호
    • /
    • pp.561-570
    • /
    • 1998
  • The MLR(Multiple Linear Regression) models to estimate soluble solids content non-destructively were presented to make a selection of optimal photosensor utilized to measure the soluble solids content of apples. Visible and NIR absorbance in the 400 to 2498 nanometer(nm) wavelength region, soluble solids content(sugar content), hardness, and weight were measured for 400 apples(gala). Spectrophotometer with fiber optic probe was utilized for spectrum measurement and digital refractometer was used for soluble solids content. Correlation between absorbance spectrum and soluble solids content was analyzed to pick out the optimal wavelengths and to develop corresponding prediction model by means of MLR. For the coefficient of determination($R^2$) to be over 0.92, the MLR models out of the original absorbance were built based on 7 wavelengths of 992, 904, 1096, 1032, 880, 824, 1048nm, and the ones of the second derivative absorbance based on 5 wavelengths of 784, 1056, 992, 808, 872nm. The best model of the second derivative absorbance spectrum had $R^2$=0.91, bias= -0.02bx, SEP=0.28bx for unknown samples.

  • PDF

기계학습 기반의 가스폭발위험범위 예측모델에 관한 연구 (A Study on Predictive Models based on the Machine Learning for Evaluating the Extent of Hazardous Zone of Explosive Gases)

  • 정용재;이창준
    • Korean Chemical Engineering Research
    • /
    • 제58권2호
    • /
    • pp.248-256
    • /
    • 2020
  • 본 연구에서는 폭발위험장소의 방폭설비 설치를 위해 필요한 가스폭발위험범위 예측모델 개발을 수행하였다. 이를 위해 12개의 가연성가스에 대한 1,200개의 폭발위험범위 데이터를 생성하였다. 가스폭발위험범위를 출력변수로 설정하였고 데이터 생성과정에서 필요한 12개의 변수를 입력변수로 설정하였다. 다중 회귀, 주성분 회귀, 인공신경망 기법을 이용해 예측모델을 개발하였다. 각각 모델의 예측 성능을 비교한 결과, 평균절대퍼센트오차(MAPE)는 각각 44.2%, 49.3%, 5.7%이고 평균제곱근오차(RMSE)는 1.389 m, 1.602 m, 0.203 m로 나타났다. 결과를 통해 인공신경망이 가장 우수한 성능을 보여주었고 가스폭발위험범위 예측을 위한 최적 모델이라는 것을 확인하였다.

LACTATION CURVE OF HOLSTEIN FRIESIAN COWS IN THE KINGDOM OF SAUDI ARABIA

  • Ali, A.K.A.;Al-Jumaah, R.S.;Hayes, E.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제9권4호
    • /
    • pp.439-447
    • /
    • 1996
  • Monthly test day production for 12,020 records, were collected from six of the largest specialized dairy farms located in central region of the Kingdom of Saudi Arabia. The records described lactating cows in four parities and two seasons of calving. Monthly test day records were fitted using Wood's model $At{{^b}{_e}}^{-ct}$ with multiple and additive error term. Linear and non-linear regression models were used to find the estimates of the parameters necessary to draw the lactation curves. The shape of the lactation curves of different parities showed that third lactation has the heighest peak (43.08 kg) for linear regression model and (42.08 kg) for non-linear regression model. Fourth lactation has the lowest peak (24.00kg) for linear regression model and (25.64 kg) for non-linear regression models. Cows of second and third lactations reached the peak at 58 day for both linear and non-linear regression models. Cows of first lactation were more persistent and had late peak at 68 and 67 days for both models respectively. While, third lactation cows were lower persistent and had early peak at 58 day for both models. Cows calved at winter months have higher starting values (A), higher ascending slope (b) and higher decending slope (c). Least square means of milk yield of the first four parities and for overall data were 6,653, 7,659, 7,482, 6,988 and 7,614 kg respectively. The corresponding lactation period were 358, 367, 350, 363 and 364 days respectively.

회귀 모델을 활용한 철강 기업의 에너지 소비 예측 (Forecasting Energy Consumption of Steel Industry Using Regression Model)

  • Sung-Ho KANG;Hyun-Ki KIM
    • Journal of Korea Artificial Intelligence Association
    • /
    • 제1권2호
    • /
    • pp.21-25
    • /
    • 2023
  • The purpose of this study was to compare the performance using multiple regression models to predict the energy consumption of steel industry. Specific independent variables were selected in consideration of correlation among various attributes such as CO2 concentration, NSM, Week Status, Day of week, and Load Type, and preprocessing was performed to solve the multicollinearity problem. In data preprocessing, we evaluated linear and nonlinear relationships between each attribute through correlation analysis. In particular, we decided to select variables with high correlation and include appropriate variables in the final model to prevent multicollinearity problems. Among the many regression models learned, Boosted Decision Tree Regression showed the best predictive performance. Ensemble learning in this model was able to effectively learn complex patterns while preventing overfitting by combining multiple decision trees. Consequently, these predictive models are expected to provide important information for improving energy efficiency and management decision-making at steel industry. In the future, we plan to improve the performance of the model by collecting more data and extending variables, and the application of the model considering interactions with external factors will also be considered.