• 제목/요약/키워드: multiple linear regression analysis

검색결과 1,120건 처리시간 0.025초

지역빈도해석 및 다중회귀분석을 이용한 산악형 강수해석 (Orographic Precipitation Analysis with Regional Frequency Analysis and Multiple Linear Regression)

  • 윤혜선;엄명진;조원철;허준행
    • 한국수자원학회논문집
    • /
    • 제42권6호
    • /
    • pp.465-480
    • /
    • 2009
  • 본 연구에서는 다중회귀분석을 이용하여 산악효과를 야기하는 지형인자와 강수와의 관계를 파악하였다. 섬 전체가 산악지형인 제주도의 연평균강수량과 지수홍수법으로 산출한 확률강우량을 강수자료로 사용하여 산악효과를 야기하는 지형인자로 선정한 고도, 위 경도와 회귀모형을 구성하였다. 회귀분석 결과 연평균강수량과 고도와의 선형관계가 확률강우량에서도 동일하게 나타났으며, 고도이외에 위도, 경도를 각각 추가인자로 고려할 경우 강우량과 더욱 강한 상관성을 보였다. 또한, 고도와 위도, 경도를 모두 고려한 회귀모형을 이용한 지형공간분석 결과 제주도의 실제 강수특성과 마찬가지로 남동부로 편중된 강수형태를 보여 모형의 적합성을 증명하였다. 그러나 지속시간 및 재현기간과 무관하게 높은 고도에서 회귀식의 유효성이 감소하므로, 높은 고도에서의 추가적인 산악효과인자의 강수량에 대한 영향이 존재될 것으로 판단되므로 추후 연구가 필요하다.

로터리 사고발생 위치별 사고모형 개발 (Developing Accident Models of Rotary by Accident Occurrence Location)

  • 나희;박병호
    • 한국도로학회논문집
    • /
    • 제14권4호
    • /
    • pp.83-91
    • /
    • 2012
  • PURPOSES : This study deals with Rotary by Accident Occurrence Location. The purpose of this study is to develop the accident models of rotary by location. METHODS : In pursuing the above, this study gives particular attentions to developing the appropriate models using multiple linear, Poisson and negative binomial regression models and statistical analysis tools. RESULTS : First, four multiple linear regression models which are statistically significant(their $R^2$ values are 0.781, 0.300, 0.784 and 0.644 respectively) are developed, and four Poisson regression models which are statistically significant(their ${\rho}^2$ values are 0.407, 0.306, 0.378 and 0.366 respectively) are developed. Second, the test results of fitness using RMSE, %RMSE, MPB and MAD show that Poisson regression model in the case of circulatory roadway, pedestrian crossing and others and multiple linear regression model in the case of entry/exit sections are appropriate to the given data. Finally, the common variable that affects to the accident is adopted to be traffic volume. CONCLUSIONS : 8 models which are all statistically significant are developed, and the common and specific variables that are related to the models are derived.

순수 성분의 물성 자료를 이용한 2성분계 혼합물의 인화점에 대한 다변량 통계 분석 및 예측 (Multivariate Statistical Analysis and Prediction for the Flash Points of Binary Systems Using Physical Properties of Pure Substances)

  • 이범석;김성영
    • 한국가스학회지
    • /
    • 제11권3호
    • /
    • pp.13-18
    • /
    • 2007
  • 다변량 통계 분석법(Multivariate statistical analysis method)의 대표적 방법인 다중 선형 회귀법(Multiple linear regression. MLR)을 이용하여 2성분계 혼합물의 인화점을 회귀 분석하고 예측하였다. 가연성 물질의 인화점에 대한 예측은 실제 화학 공정 설계에서 화재 및 폭발 위험성을 판단하는 중요한 부분 중의 하나이다. 본 연구에서는 순수 성분의 물성 자료만을 이용하여 2성분계 혼합물의 인화점 실험 자료에 대해 다중 선형 회귀법(MLR)을 수행하였고, 이를 이용하여 새로운 혼합물에 대한 인화점을 예측하였다. 2성분계 혼합물의 인화점에 대한 MLR의 회귀 성능과 새로운 혼합물에 대한 예측 성능을 알아보기 위해, 기존의 인화점 추정 방법인 Raoult의 법칙과 Van Laar식에 의한 추정값과 비교해 보았다.

  • PDF

임상의를 위한 다변량 분석의 실제 (Multivariate Analysis for Clinicians)

  • 오주한;정석원
    • Clinics in Shoulder and Elbow
    • /
    • 제16권1호
    • /
    • pp.63-72
    • /
    • 2013
  • 임상 의학의 연구에 사용되는 대표적 다변량 분석 방법은 다중 회귀 분석 방법인데, 이는 인과 관계를 토대로 여러 개의 변수에 의한 한꺼번에의 영향력을 분석하기 위한 방법이다. 다중 회귀 분석은 기본적으로 회귀 분석의 기본 가정을 만족해야 함은 물론, 여러 개의 독립 변수들이 포함되기 때문에 변수들을 모형에 포함시키는 방법 및 다중 공선성 문제에 대한 고려가 필요하다. 다중 회귀 분석 모형의 설명력은 결정 계수 $R^2$으로 표현되어 1에 가까울수록 설명력이 크며, 각 독립 변수들의 결과에의 영향력은 회귀 계수인 ${\beta}$값으로 표현된다. 다중 회귀 분석은 종속 변수의 형태에 따라 다중 선형 회귀 분석, 다중 로지스틱 회귀 분석, 콕스 회귀 분석으로 나눌 수 있다. 종속 변수가 연속 변수인 경우 다중 선형 회귀 분석, 범주형 변수인 경우 다중 로지스틱 회귀 분석, 시간의 영향을 고려한 상태 변수인 경우는 콕스 회귀 분석을 시행해야 하며, 각각 결과에의 영향력은 회귀 계수 ${\beta}$, 교차비, 위험비로 평가한다. 이러한 다변량 분석에 대한 이해는 연구를 계획하고 결과를 분석하고자 하는 임상 의사에게 있어 보다 효율적인 연구를 위해 필수적인 소양이라고 할 수 있다.

GMA용접의 단락이행영역에 있어서 아크 상태 평가를 위한 모델 개발 (Development of the Index for Estimating the Arc Status in the Short-circuiting Transfer Region of GMA Welding)

  • 강문진;이세헌;엄기원
    • Journal of Welding and Joining
    • /
    • 제17권4호
    • /
    • pp.85-92
    • /
    • 1999
  • In GMAW, the spatter is generated because of the variation of the arc state. If the arc state is quantitatively assessed, the control method to make the spatter be reduced is able to develop. This study was attempted to develop the optimal model that could estimate the arc state quantitatively. To do this, the generated spatters was captured under the limited welding conditions, and the waveforms of the arc voltage and of the welding current were collected. From the collected waveforms, the waveform factors and their standard deviations were produced, and the linear and non-linear regression models constituted using the factors and their standard deviations are proposed to estimate the arc state. the performance test to the proposed models was practiced. Obtained results are as follow. From the results of correlation analysis between the factors and the amount of the generated spatters, the standard deviations of the waveform factors have more the multiple regression coefficients than the waveform factors. Because the correlation coefficient between T and {TEX}$T_{a}${/TEX}, and s[T] and s[{TEX}$T_{a}${/TEX}] was nearly one, it was found that these factors have the same effect to the spatter generation. In the regression models to estimate the arc state, it was fond that the linear and the non linear models were also consisted of similar factors. In addition, the linear regression model was assessed the optimal model for estimating the arc state because the variance of data was narrow and multiple regression coefficient was highest among the models. But in the welding conditions which the amount of the generated spatters were small, it was found that the non linear regression model had better the estimation performance for the spatter generation than the linear.

  • PDF

디스플레이 FAB 생산능력 예측 개선 사례 연구 (A Case Study on the Improvement of Display FAB Production Capacity Prediction)

  • 길준필;최진영
    • 산업경영시스템학회지
    • /
    • 제43권2호
    • /
    • pp.137-145
    • /
    • 2020
  • Various elements of Fabrication (FAB), mass production of existing products, new product development and process improvement evaluation might increase the complexity of production process when products are produced at the same time. As a result, complex production operation makes it difficult to predict production capacity of facilities. In this environment, production forecasting is the basic information used for production plan, preventive maintenance, yield management, and new product development. In this paper, we tried to develop a multiple linear regression analysis model in order to improve the existing production capacity forecasting method, which is to estimate production capacity by using a simple trend analysis during short time periods. Specifically, we defined overall equipment effectiveness of facility as a performance measure to represent production capacity. Then, we considered the production capacities of interrelated facilities in the FAB production process during past several weeks as independent regression variables in order to reflect the impact of facility maintenance cycles and production sequences. By applying variable selection methods and selecting only some significant variables, we developed a multiple linear regression forecasting model. Through a numerical experiment, we showed the superiority of the proposed method by obtaining the mean residual error of 3.98%, and improving the previous one by 7.9%.

비선형 회귀 분석을 이용한 부유식 해양 구조물의 중량 추정 모델 연구 (A Study on the Weight Estimation Model of Floating Offshore Structures using the Non-linear Regression Analysis)

  • 서성호;노명일;신현경
    • 대한조선학회논문집
    • /
    • 제51권6호
    • /
    • pp.530-538
    • /
    • 2014
  • The weight estimation of floating offshore structures such as FPSO, TLP, semi-Submersibles, Floating Offshore Wind Turbines etc. in the preliminary design, is one of important measures of both construction cost and basic performance. Through both literature investigation and internet search, the weight data of floating offshore structures such as FPSO and TLP was collected. In this study, the weight estimation model was suggested for FPSO. The weight estimation model using non-linear regression analysis was established by fixing independent variables based on this data and the multiple regression analysis was introduced into the weight estimation model. Its reliability was within 4% of error rate.

Machine learning-based regression analysis for estimating Cerchar abrasivity index

  • Kwak, No-Sang;Ko, Tae Young
    • Geomechanics and Engineering
    • /
    • 제29권3호
    • /
    • pp.219-228
    • /
    • 2022
  • The most widely used parameter to represent rock abrasiveness is the Cerchar abrasivity index (CAI). The CAI value can be applied to predict wear in TBM cutters. It has been extensively demonstrated that the CAI is affected significantly by cementation degree, strength, and amount of abrasive minerals, i.e., the quartz content or equivalent quartz content in rocks. The relationship between the properties of rocks and the CAI is investigated in this study. A database comprising 223 observations that includes rock types, uniaxial compressive strengths, Brazilian tensile strengths, equivalent quartz contents, quartz contents, brittleness indices, and CAIs is constructed. A linear model is developed by selecting independent variables while considering multicollinearity after performing multiple regression analyses. Machine learning-based regression methods including support vector regression, regression tree regression, k-nearest neighbors regression, random forest regression, and artificial neural network regression are used in addition to multiple linear regression. The results of the random forest regression model show that it yields the best prediction performance.

Optimized Neural Network Weights and Biases Using Particle Swarm Optimization Algorithm for Prediction Applications

  • Ahmadzadeh, Ezat;Lee, Jieun;Moon, Inkyu
    • 한국멀티미디어학회논문지
    • /
    • 제20권8호
    • /
    • pp.1406-1420
    • /
    • 2017
  • Artificial neural networks (ANNs) play an important role in the fields of function approximation, prediction, and classification. ANN performance is critically dependent on the input parameters, including the number of neurons in each layer, and the optimal values of weights and biases assigned to each neuron. In this study, we apply the particle swarm optimization method, a popular optimization algorithm for determining the optimal values of weights and biases for every neuron in different layers of the ANN. Several regression models, including general linear regression, Fourier regression, smoothing spline, and polynomial regression, are conducted to evaluate the proposed method's prediction power compared to multiple linear regression (MLR) methods. In addition, residual analysis is conducted to evaluate the optimized ANN accuracy for both training and test datasets. The experimental results demonstrate that the proposed method can effectively determine optimal values for neuron weights and biases, and high accuracy results are obtained for prediction applications. Evaluations of the proposed method reveal that it can be used for prediction and estimation purposes, with a high accuracy ratio, and the designed model provides a reliable technique for optimization. The simulation results show that the optimized ANN exhibits superior performance to MLR for prediction purposes.