• 제목/요약/키워드: Multiple regression model

검색결과 2,523건 처리시간 0.031초

로터리 사고발생 위치별 사고모형 개발 (Developing Accident Models of Rotary by Accident Occurrence Location)

  • 나희;박병호
    • 한국도로학회논문집
    • /
    • 제14권4호
    • /
    • pp.83-91
    • /
    • 2012
  • PURPOSES : This study deals with Rotary by Accident Occurrence Location. The purpose of this study is to develop the accident models of rotary by location. METHODS : In pursuing the above, this study gives particular attentions to developing the appropriate models using multiple linear, Poisson and negative binomial regression models and statistical analysis tools. RESULTS : First, four multiple linear regression models which are statistically significant(their $R^2$ values are 0.781, 0.300, 0.784 and 0.644 respectively) are developed, and four Poisson regression models which are statistically significant(their ${\rho}^2$ values are 0.407, 0.306, 0.378 and 0.366 respectively) are developed. Second, the test results of fitness using RMSE, %RMSE, MPB and MAD show that Poisson regression model in the case of circulatory roadway, pedestrian crossing and others and multiple linear regression model in the case of entry/exit sections are appropriate to the given data. Finally, the common variable that affects to the accident is adopted to be traffic volume. CONCLUSIONS : 8 models which are all statistically significant are developed, and the common and specific variables that are related to the models are derived.

다중회귀모형을 이용한 104주 주 최대 전력수요예측 (Weekly Maximum Electric Load Forecasting Method for 104 Weeks Using Multiple Regression Models)

  • 정현우;김시연;송경빈
    • 전기학회논문지
    • /
    • 제63권9호
    • /
    • pp.1186-1191
    • /
    • 2014
  • Weekly and monthly electric load forecasting are essential for the generator maintenance plan and the systematic operation of the electric power reserve. This paper proposes the weekly maximum electric load forecasting model for 104 weeks with the multiple regression model. Input variables of the multiple regression model are temperatures and GDP that are highly correlated with electric loads. The weekly variable is added as input variable to improve the accuracy of electric load forecasting. Test results show that the proposed algorithm improves the accuracy of electric load forecasting over the seasonal autoregressive integrated moving average model. We expect that the proposed algorithm can contribute to the systematic operation of the power system by improving the accuracy of the electric load forecasting.

NB-IoT 기술에서 Multiple Linear Regression Model을 활용하여 OTDOA 기반 포지셔닝 정확도 최적화 (Optimize OTDOA-based Positioning Accuracy by Utilizing Multiple Linear Regression Model under NB-IoT Technology)

  • 판이첸;김재수
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2020년도 제62차 하계학술대회논문집 28권2호
    • /
    • pp.139-142
    • /
    • 2020
  • NB-IoT(Narrow Band Internet of Things) is an emerging LPWAN(Low Power Wide Area Network) radio technology. NB-IoT has many advantages like low power, low cost, and high coverage. However low bandwidth and low sampling rates also lead to poor positioning accuracy. This paper proposed a solution to optimize positioning accuracy under the OTDOA(Observed Time Difference of Arrival) approach by utilizing MLR(Multiple Linear Regression) models. Through the MLR model to predict the influence degree of weather(temperature, humidity, light intensity and air pressure) on the arrival time of signal transmission to improve the measurement accuracy. The improvement of measurement accuracy can greatly improve IoT applications based on NB-IoT.

  • PDF

A Comparison of Construction Cost Estimation Using Multiple Regression Analysis and Neural Network in Elementary School Project

  • Cho, Hong-Gyu;Kim, Kyong-Gon;Kim, Jang-Young;Kim, Gwang-Hee
    • 한국건축시공학회지
    • /
    • 제13권1호
    • /
    • pp.66-74
    • /
    • 2013
  • In the early stages of a construction project, the most important thing is to predict construction costs in a rational way. For this reason, many studies have been performed on the estimation of construction costs for apartment housing and office buildings at early stage using artificial intelligence, statistics, and the like. In this study, cost data held by a provincial Office of Education on elementary schools constructed from 2004 to 2007 were used to compare the multiple regression model with an artificial neural network model. A total of 96 historical data were classified into 76 historical data for constructing models and 20 historical data for comparing the constructed regression model with the artificial neural network model. The results of an analysis of predicted construction costs were that the error rate of the artificial neural network model is lower than that of the multiple regression model.

MULTIPLE OUTLIER DETECTION IN LOGISTIC REGRESSION BY USING INFLUENCE MATRIX

  • Lee, Gwi-Hyun;Park, Sung-Hyun
    • Journal of the Korean Statistical Society
    • /
    • 제36권4호
    • /
    • pp.457-469
    • /
    • 2007
  • Many procedures are available to identify a single outlier or an isolated influential point in linear regression and logistic regression. But the detection of influential points or multiple outliers is more difficult, owing to masking and swamping problems. The multiple outlier detection methods for logistic regression have not been studied from the points of direct procedure yet. In this paper we consider the direct methods for logistic regression by extending the $Pe\tilde{n}a$ and Yohai (1995) influence matrix algorithm. We define the influence matrix in logistic regression by using Cook's distance in logistic regression, and test multiple outliers by using the mean shift model. To show accuracy of the proposed multiple outlier detection algorithm, we simulate artificial data including multiple outliers with masking and swamping.

시계열모형을 이용한 굴 생산량 예측 가능성에 관한 연구 (A Study on Forecast of Oyster Production using Time Series Models)

  • 남종오;노승국
    • Ocean and Polar Research
    • /
    • 제34권2호
    • /
    • pp.185-195
    • /
    • 2012
  • This paper focused on forecasting a short-term production of oysters, which have been farmed in Korea, with distinct periodicity of production by year, and different production level by month. To forecast a short-term oyster production, this paper uses monthly data (260 observations) from January 1990 to August 2011, and also adopts several econometrics methods, such as Multiple Regression Analysis Model (MRAM), Seasonal Autoregressive Integrated Moving Average (SARIMA) Model, and Vector Error Correction Model (VECM). As a result, first, the amount of short-term oyster production forecasted by the multiple regression analysis model was 1,337 ton with prediction error of 246 ton. Secondly, the amount of oyster production of the SARIMA I and II models was forecasted as 12,423 ton and 12,442 ton with prediction error of 11,404 ton and 11,423 ton, respectively. Thirdly, the amount of oyster production based on the VECM was estimated as 10,425 ton with prediction errors of 9,406 ton. In conclusion, based on Theil inequality coefficient criterion, short-term prediction of oyster by the VECM exhibited a better fit than ones by the SARIMA I and II models and Multiple Regression Analysis Model.

알루미늄 합금의 레이저 가공에서 인장 강도 예측을 위한 회귀 모델 및 신경망 모델의 개발 (Development of Statistical Model and Neural Network Model for Tensile Strength Estimation in Laser Material Processing of Aluminum Alloy)

  • 박영환;이세헌
    • 한국정밀공학회지
    • /
    • 제24권4호
    • /
    • pp.93-101
    • /
    • 2007
  • Aluminum alloy which is one of the light materials has been tried to apply to light weight vehicle body. In order to do that, welding technology is very important. In case of the aluminum laser welding, the strength of welded part is reduced due to porosity, underfill, and magnesium loss. To overcome these problems, laser welding of aluminum with filler wire was suggested. In this study, experiment about laser welding of AA5182 aluminum alloy with AA5356 filler wire was performed according to process parameters such as laser power, welding speed and wire feed rate. The tensile strength was measured to find the weldability of laser welding with filler wire. The models to estimate tensile strength were suggested using three regression models and one neural network model. For regression models, one was the multiple linear regression model, another was the second order polynomial regression model, and the other was the multiple nonlinear regression model. Neural network model with 2 hidden layers which had 5 and 3 nodes respectively was investigated to find the most suitable model for the system. Estimation performance was evaluated for each model using the average error rate. Among the three regression models, the second order polynomial regression model had the best estimation performance. For all models, neural network model has the best estimation performance.

의사방문수 결정요인 분석 (A Study on Factors Affecting the Use of Ambulatory Physician Services)

  • 박현애;송건용
    • 보건행정학회지
    • /
    • 제4권2호
    • /
    • pp.58-76
    • /
    • 1994
  • In order to study factors affecting the use of the ambulatory physician services. Andersen's model for health utilization was modified by adding the health behavior component and examined with three different approaches. Three different approaches were the multiople regression model, logistic regression model, and LISREL model. For multiple regression, dependent variable was reported illness-related visits to a physician during past one year and independent variables are variaous variables measuring predisposing factor, enabling factor, need factor and health behavior. For the logistic regression, dependent variable was visit or no-visit to a physician during past one year and independent variables were same as the multiple regression analysis. For the LISREL, five endogenous variables of health utiliztion, predisposing factor, enabling factor, need factor, and health behavior and 20 exogeneous variables which measures five endogenous variables were used. According to the multiple regression analysis, chronic illness, health status, perceived health status of the need factor; residence, sex, age, marital status, education of the predisposing factor ; health insurance, usual source for medical care of enabling factor were the siginificant exploratory variables for the health utilization. Out of the logistic regression analysis, health status, chronic illness, residence, marital status, education, drinking, use of health aid were found to be significant exploratory variables. From LISREL, need factor affect utilization most following by predisposing factor, enabling factor and health behavior. For LISREL model, age, education, and residence for predisposing factor; health status, chronic illess, and perceived health status for need factor; medical insurance for enabling factor; and doing any kind of health behavior for the health behavior were found as the significant observed variables for each theoretical variables.

  • PDF

다중 지역기후모델로부터 모의된 월 기온자료를 이용한 다중선형회귀모형들의 예측성능 비교 (Inter-comparison of Prediction Skills of Multiple Linear Regression Methods Using Monthly Temperature Simulated by Multi-Regional Climate Models)

  • 성민규;김찬수;서명석
    • 대기
    • /
    • 제25권4호
    • /
    • pp.669-683
    • /
    • 2015
  • In this study, we investigated the prediction skills of four multiple linear regression methods for monthly air temperature over South Korea. We used simulation results from four regional climate models (RegCM4, SNURCM, WRF, and YSURSM) driven by two boundary conditions (NCEP/DOE Reanalysis 2 and ERA-Interim). We selected 15 years (1989~2003) as the training period and the last 5 years (2004~2008) as validation period. The four regression methods used in this study are as follows: 1) Homogeneous Multiple linear Regression (HMR), 2) Homogeneous Multiple linear Regression constraining the regression coefficients to be nonnegative (HMR+), 3) non-homogeneous multiple linear regression (EMOS; Ensemble Model Output Statistics), 4) EMOS with positive coefficients (EMOS+). It is same method as the third method except for constraining the coefficients to be nonnegative. The four regression methods showed similar prediction skills for the monthly air temperature over South Korea. However, the prediction skills of regression methods which don't constrain regression coefficients to be nonnegative are clearly impacted by the existence of outliers. Among the four multiple linear regression methods, HMR+ and EMOS+ methods showed the best skill during the validation period. HMR+ and EMOS+ methods showed a very similar performance in terms of the MAE and RMSE. Therefore, we recommend the HMR+ as the best method because of ease of development and applications.

통계모형을 이용한 NO2 농도 예측에 관한 연구 (A study on Estimation of NO2 concentration by Statistical model)

  • 장난심
    • 한국환경과학회지
    • /
    • 제14권11호
    • /
    • pp.1049-1056
    • /
    • 2005
  • [ $NO_2$ ] concentration characteristics of Busan metropolitan city was analysed by statistical method using hourly $NO_2$ concentration data$(1998\~2000)$ collected from air quality monitoring sites of the metropolitan city. 4 representative regions were selected among air quality monitoring sites of Ministry of environment. Concentration data of $NO_2$, 5 air pollutants, and data collected at AWS was used. Both Stepwise Multiple Regression model and ARIMA model for prediction of $NO_2$ concentrations were adopted, and then their results were compared with observed concentration. While ARIMA model was useful for the prediction of daily variation of the concentration, it was not satisfactory for the prediction of both rapid variation and seasonal variation of the concentration. Multiple Regression model was better estimated than ARIMA model for prediction of $NO_2$ concentration.