• 제목/요약/키워드: linear regression analysis

검색결과 2,850건 처리시간 0.03초

비선형 회귀 분석을 이용한 부유식 해양 구조물의 중량 추정 모델 연구 (A Study on the Weight Estimation Model of Floating Offshore Structures using the Non-linear Regression Analysis)

  • 서성호;노명일;신현경
    • 대한조선학회논문집
    • /
    • 제51권6호
    • /
    • pp.530-538
    • /
    • 2014
  • The weight estimation of floating offshore structures such as FPSO, TLP, semi-Submersibles, Floating Offshore Wind Turbines etc. in the preliminary design, is one of important measures of both construction cost and basic performance. Through both literature investigation and internet search, the weight data of floating offshore structures such as FPSO and TLP was collected. In this study, the weight estimation model was suggested for FPSO. The weight estimation model using non-linear regression analysis was established by fixing independent variables based on this data and the multiple regression analysis was introduced into the weight estimation model. Its reliability was within 4% of error rate.

순수 성분의 물성 자료를 이용한 2성분계 혼합물의 인화점에 대한 다변량 통계 분석 및 예측 (Multivariate Statistical Analysis and Prediction for the Flash Points of Binary Systems Using Physical Properties of Pure Substances)

  • 이범석;김성영
    • 한국가스학회지
    • /
    • 제11권3호
    • /
    • pp.13-18
    • /
    • 2007
  • 다변량 통계 분석법(Multivariate statistical analysis method)의 대표적 방법인 다중 선형 회귀법(Multiple linear regression. MLR)을 이용하여 2성분계 혼합물의 인화점을 회귀 분석하고 예측하였다. 가연성 물질의 인화점에 대한 예측은 실제 화학 공정 설계에서 화재 및 폭발 위험성을 판단하는 중요한 부분 중의 하나이다. 본 연구에서는 순수 성분의 물성 자료만을 이용하여 2성분계 혼합물의 인화점 실험 자료에 대해 다중 선형 회귀법(MLR)을 수행하였고, 이를 이용하여 새로운 혼합물에 대한 인화점을 예측하였다. 2성분계 혼합물의 인화점에 대한 MLR의 회귀 성능과 새로운 혼합물에 대한 예측 성능을 알아보기 위해, 기존의 인화점 추정 방법인 Raoult의 법칙과 Van Laar식에 의한 추정값과 비교해 보았다.

  • PDF

Hybrid Fuzzy Least Squares Support Vector Machine Regression for Crisp Input and Fuzzy Output

  • Shim, Joo-Yong;Seok, Kyung-Ha;Hwang, Chang-Ha
    • Communications for Statistical Applications and Methods
    • /
    • 제17권2호
    • /
    • pp.141-151
    • /
    • 2010
  • Hybrid fuzzy regression analysis is used for integrating randomness and fuzziness into a regression model. Least squares support vector machine(LS-SVM) has been very successful in pattern recognition and function estimation problems for crisp data. This paper proposes a new method to evaluate hybrid fuzzy linear and nonlinear regression models with crisp inputs and fuzzy output using weighted fuzzy arithmetic(WFA) and LS-SVM. LS-SVM allows us to perform fuzzy nonlinear regression analysis by constructing a fuzzy linear regression function in a high dimensional feature space. The proposed method is not computationally expensive since its solution is obtained from a simple linear equation system. In particular, this method is a very attractive approach to modeling nonlinear data, and is nonparametric method in the sense that we do not have to assume the underlying model function for fuzzy nonlinear regression model with crisp inputs and fuzzy output. Experimental results are then presented which indicate the performance of this method.

An Investigation on Application of Experimental Design and Linear Regression Technique to Predict Pitting Potential of Stainless Steel

  • Jung, Kwang-Hu;Kim, Seong-Jong
    • Corrosion Science and Technology
    • /
    • 제20권2호
    • /
    • pp.52-61
    • /
    • 2021
  • This study using experimental design and linear regression technique was implemented in order to predict the pitting potential of stainless steel in marine environments, with the target materials being AL-6XN and STS 316L. The various variables (inputs) which affect stainless steel's pitting potential included the pitting resistance equivalent number (PRNE), temperature, pH, Cl- concentration, sulfate levels, and nitrate levels. Among them, significant factors affecting pitting potential were chosen through an experimental design method (screening design, full factor design, analysis of variance). The potentiodynamic polarization test was performed based on the experimental design, including significant factor levels. From these testing methods, a total 32 polarization curves were obtained, which were used as training data for the linear regression model. As a result of the model's validation, it showed an acceptable prediction performance, which was statistically significant within the 95% confidence level. The linear regression model based on the full factorial design and ANOVA also showed a high confidence level in the prediction of pitting potential. This study confirmed the possibility to predict the pitting potential of stainless steel according to various variables used with experimental linear regression design.

ILL-CONDITIONING IN LINEAR REGRESSION MODELS AND ITS DIAGNOSTICS

  • Ghorbani, Hamid
    • 한국수학교육학회지시리즈B:순수및응용수학
    • /
    • 제27권2호
    • /
    • pp.71-81
    • /
    • 2020
  • Multicollinearity is a common problem in linear regression models when two or more regressors are highly correlated, which yields some serious problems for the ordinary least square estimates of the parameters as well as model validation and interpretation. In this paper, first the problem of multicollinearity and its subsequent effects on the linear regression along with some important measures for detecting multicollinearity is reviewed, then the role of eigenvalues and eigenvectors in detecting multicollinearity are bolded. At the end a real data set is evaluated for which the fitted linear regression models is investigated for multicollinearity diagnostics.

Optimized Neural Network Weights and Biases Using Particle Swarm Optimization Algorithm for Prediction Applications

  • Ahmadzadeh, Ezat;Lee, Jieun;Moon, Inkyu
    • 한국멀티미디어학회논문지
    • /
    • 제20권8호
    • /
    • pp.1406-1420
    • /
    • 2017
  • Artificial neural networks (ANNs) play an important role in the fields of function approximation, prediction, and classification. ANN performance is critically dependent on the input parameters, including the number of neurons in each layer, and the optimal values of weights and biases assigned to each neuron. In this study, we apply the particle swarm optimization method, a popular optimization algorithm for determining the optimal values of weights and biases for every neuron in different layers of the ANN. Several regression models, including general linear regression, Fourier regression, smoothing spline, and polynomial regression, are conducted to evaluate the proposed method's prediction power compared to multiple linear regression (MLR) methods. In addition, residual analysis is conducted to evaluate the optimized ANN accuracy for both training and test datasets. The experimental results demonstrate that the proposed method can effectively determine optimal values for neuron weights and biases, and high accuracy results are obtained for prediction applications. Evaluations of the proposed method reveal that it can be used for prediction and estimation purposes, with a high accuracy ratio, and the designed model provides a reliable technique for optimization. The simulation results show that the optimized ANN exhibits superior performance to MLR for prediction purposes.

원격탐사자료와 GIS를 활용한 도시 표면온도의 공간적 분포특성에 관한 연구 (A Study on the Spatial Distribution Characteristic of Urban Surface Temperature using Remotely Sensed Data and GIS)

  • 조명희;이광재;김운수
    • 한국지리정보학회지
    • /
    • 제4권1호
    • /
    • pp.57-66
    • /
    • 2001
  • 본 연구에서는 도시표면온도를 추출하기 위하여 다시기 Landsat TM band 6 영상을 이용하여 과학기술부의 4가지 모델 즉 two-point linear model, linear regression model, quadratic regression model, cubic regression model에 대하여 각각 공간분석을 실시하였으며 그 결과를 AWS(automatic weather station) 관측자료와 상관 및 회귀분석 함과 동시에 GIS 공간분석 기법을 이용하여 도시 표면온도의 공간적 분포특성을 규명하였다. Landsat TM band 6으로부터 추출된 표면온도를 기초로 하여 토지피복별 표면온도 분포를 분석한 결과 도시 및 나지 지역이 가장 높은 온도분포대를 형성하고 있었으며, 표면온도와 NDVI간의 상관분석결과 평균 -0.85 정도의 음의 상관성을 확인할 수 있었다. 이와 같은 결과는 향후 기상환경 특성을 고려한 도시계획수립에 있어 중요한 인자로 작용할 것으로 사료된다.

  • PDF

Local linear regression analysis for interval-valued data

  • Jang, Jungteak;Kang, Kee-Hoon
    • Communications for Statistical Applications and Methods
    • /
    • 제27권3호
    • /
    • pp.365-376
    • /
    • 2020
  • Interval-valued data, a type of symbolic data, is given as an interval in which the observation object is not a single value. It can also occur frequently in the process of aggregating large databases into a form that is easy to manage. Various regression methods for interval-valued data have been proposed relatively recently. In this paper, we introduce a nonparametric regression model using the kernel function and a nonlinear regression model for the interval-valued data. We also propose applying the local linear regression model, one of the nonparametric methods, to the interval-valued data. Simulations based on several distributions of the center point and the range are conducted using each of the methods presented in this paper. Various conditions confirm that the performance of the proposed local linear estimator is better than the others.

국내 화강암의 점하중강도와 일축압축강도간의 상관분석 (Correlation Between the Point-Load Strength and the Uniaxial Compressive Strength of Korean Granites)

  • 우익
    • 지질공학
    • /
    • 제24권1호
    • /
    • pp.101-110
    • /
    • 2014
  • 이 연구에서는 국내에 분포하는 화강암에 대한 점하중강도($I_{s(50)}$) 와 일축압축강도(UCS) 사이의 상관관계를 선형회귀분석을 통하여 구하였다. 이를 위하여 암석시료의 물리적 물성에 근거하여 화강암 시료를 재분류하여 세 경우에 대한 회귀분석을 수행하였다. 첫째로, 풍화등급에 따른 원점회귀분석을 수행하여 풍화에 따른 상관관계를 구하였으나 풍화등급별 시료의 개수가 부족하여 만족할만한 신뢰수준의 회귀분석결과를 얻지 못하였지만, 풍화가 진행될수록 회귀직선의 기울기가 급해지는 경향을 파악할 수 있었다. 두 번째, 전체 화강암에 대한 회귀분석을 수행하기 위하여 단순선형회귀분석과 신뢰도 및 정확도를 향상시키기 위하여 부트스트랩 리샘플링법을 이용한 단순선형회귀분석을 통하여 두 강도간의 상관관계를 구하였다. 세번째로는 유사한 물리적 물성을 지닌 화강암 시료들의 평균강도에 대한 선형회귀분석을 수행하여 상관관계를 구하였다. 이 방법들을 사용하여 구한 회귀직선방정식의 기울기는 14 내외의 값을 보이고 작은 편차를 지니고 있으며 국내 화강암에 대하여 수행한 기존 연구와 유사한 값을 보이고 있었다. 그러나 16에서 43의 범위를 지닌 y-축 절편은 큰 편차를 보이기 있었기 때문에, 점하중강도로 일축압축강도를 추정할 때에는 이러한 회귀방정식의 오차범위를 고려하여야 할 것으로 판단된다.

중소하천유역의 임계지속시간 결정에 관한 연구 (Study on the Critical Storm Duration Decision of the Rivers Basin)

  • 안승섭;이효정;정도준
    • 한국환경과학회지
    • /
    • 제16권11호
    • /
    • pp.1301-1312
    • /
    • 2007
  • The objective of this study is to propose a critical storm duration forecasting model on storm runoff in small river basin. The critical storm duration data of 582 sub-basin which introduced disaster impact assessment report on the National Emergency Management Agency during the period from 2004 to 2007 were collected, analyzed and studied. The stepwise multiple regression method are used to establish critical storm duration forecasting models(Linear and exponential type). The results of multiple regression analysis discriminated the linear type more than exponential type. The results of multiple linear regression analysis between the critical storm duration and 5 basin characteristics parameters such as basin area, main stream length, average slope of main stream, shape factor and CN showed more than 0.75 of correlation in terms of the multi correlation coefficient.