• Title/Summary/Keyword: 다중선형 및 비선형회귀분석

Search Result 84, Processing Time 0.014 seconds

Comparison of Linear and Nonlinear Regressions and Elements Analysis for Wind Speed Prediction (풍속 예측을 위한 선형회귀분석과 비선형회귀분석 기법의 비교 및 인자분석)

  • Kim, Dongyeon;Seo, Kisung
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.25 no.5
    • /
    • pp.477-482
    • /
    • 2015
  • Linear regressions and evolutionary nonlinear regression based compensation techniques for the short-range prediction of wind speed are investigated. Development of an efficient MOS(Model Output Statistics) is necessary to correct systematic errors of the model, but a linear regression based MOS is hard to manage an irregular nature of weather prediction. In order to solve the problem, a nonlinear and symbolic regression method using GP(Genetic Programming) is suggested for a development of MOS for wind speed prediction. The proposed method is compared to various linear regression methods for prediction of wind speed. Also, statistical analysis of distribution for UM elements for each method is executed. experiments are performed for KLAPS(Korea Local Analysis and Prediction System) re-analysis data from 2007 to 2013 year for Jeju Island and Busan area in South Korea.

Performance Evaluation of Multilinear Regression Empirical Formula and Machine Learning Model for Prediction of Two-dimensional Transverse Dispersion Coefficient (다중선형회귀경험식과 머신러닝모델의 2차원 횡 분산계수 예측성능 평가)

  • Lee, Sun Mi;Park, Inhwan
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.172-172
    • /
    • 2022
  • 분산계수는 하천에서 오염물질의 혼합능을 파악할 수 있는 대표적인 인자이다. 특히 하수처리장 방류수 혼합예측과 같이 횡 방향 혼합에 대한 예측이 중요한 경우, 하천의 지형적, 수리학적 특성을 고려한 2차원 횡 분산계수의 결정이 필요하다. 2차원 횡 분산계수의 결정을 위해 기존 연구에서는 추적자실험결과로부터 경험식을 만들어 횡 분산계수 산정에 사용해왔다. 회귀분석을 통한 경험식 산정을 위해서는 충분한 데이터가 필요하지만, 2차원 추적자 실험 건수가 충분치 않아 신뢰성 높은 경험식 산정이 어려운 상황이다. 따라서 본 연구에서는 SMOTE기법을 이용하여 횡분산계수 실험데이터를 증폭시켜 이로부터 횡 분산계수 경험식을 산정하고자 한다. 또한 다중선형회귀분석을 통해 도출된 경험식의 한계를 보완하기 위해 다양한 머신러닝 기법을 적용하고, 횡 분산계수 산정에 적합한 머신러닝 기법을 제안하고자 한다. 기존 추적자실험 데이터로부터 하폭 대 수심비, 유속 대 마찰유속비, 횡 분산계수 데이터 셋을 수집하였으며, SMOTE 알고리즘의 적용을 통해 회귀분석과 머신러닝 기법 적용에 필요한 데이터그룹을 생성했다. 새롭게 생성된 데이터 셋을 포함하여 다중선형회귀분석을 통해 횡 분산계수 경험식을 결정하였으며, 새로 제안한 경험식과 기존 경험식에 대한 정확도를 비교했다. 또한 다중선형회귀분석을 통해 결정된 경험식은 횡 분산계수 예측범위에 한계를 보였기 때문에 머신러닝기법을 적용하여 다중선형회귀분석에 대한 예측성능을 평가했다. 이를 위해 머신러닝 기법으로서 서포트 벡터 머신 회귀(SVR), K근접이웃 회귀(KNN-R), 랜덤 포레스트 회귀(RFR)를 활용했다. 세 가지 머신러닝 기법을 통해 도출된 횡 분산계수와 경험식으로부터 결정된 횡 분산계수를 비교하여 예측 성능을 비교했다. 이를 통해 제한된 실험데이터 셋으로부터 2차원 횡 분산계수 산정을 위한 데이터 전처리 기법 및 횡 분산계수 산정에 적합한 머신러닝 절차와 최적 학습기법을 도출했다.

  • PDF

Characteristics and Models of the Side-swipe Accident in the Case of Cheongju 4-legged Signalized Intersections (4지 신호교차로의 측면접촉사고 특성 및 사고모형 - 청주시를 사례로 -)

  • Park, Sang-Hyuk;Kim, Tae-Young;Park, Byung-Ho
    • International Journal of Highway Engineering
    • /
    • v.11 no.4
    • /
    • pp.41-47
    • /
    • 2009
  • This study deals with the side-swipe accidents of 4-legged signalized intersections in Cheongju. The objectives are to analyze the characteristics of the accidents and to develop the related models. In pursuing the above, this study gives particular emphasis to finding the appropriate methodology to modelling. The main results are as follows. First, injuries were analyzed to be twice than property-only accidents in the side-swipe accidents. The accidents were evaluated to occur more in inside-intersection. Also, the accidents were analyzed to be almost the auto-related accidents and to be occurred by the unsafely-driving activity. Second, multiple linear regression models were evaluated to be more statistically significant than multiple non-linear. The most fitted models were analyzed to be the models with the number of accidents as the dependent variable. The factors of side-swipe accidents analyzed in this study were ADT, area of intersection, right-turn-only-lane, number of pedestrian crossings, limited speed of main road, maximum grade and number of signal phase.

  • PDF

Multivariate Analysis for Clinicians (임상의를 위한 다변량 분석의 실제)

  • Oh, Joo Han;Chung, Seok Won
    • Clinics in Shoulder and Elbow
    • /
    • v.16 no.1
    • /
    • pp.63-72
    • /
    • 2013
  • In medical research, multivariate analysis, especially multiple regression analysis, is used to analyze the influence of multiple variables on the result. Multiple regression analysis should include variables in the model and the problem of multi-collinearity as there are many variables as well as the basic assumption of regression analysis. The multiple regression model is expressed as the coefficient of determination, $R^2$ and the influence of independent variables on result as a regression coefficient, ${\beta}$. Multiple regression analysis can be divided into multiple linear regression analysis, multiple logistic regression analysis, and Cox regression analysis according to the type of dependent variables (continuous variable, categorical variable (binary logit), and state variable, respectively), and the influence of variables on the result is evaluated by regression coefficient${\beta}$, odds ratio, and hazard ratio, respectively. The knowledge of multivariate analysis enables clinicians to analyze the result accurately and to design the further research efficiently.

Hydrologic Variable Prediction Using Nonlinear Ensemble Model (비선형 앙상블 모형을 이용한 수문량 예측)

  • Kwon, Hyun-Han;Kim, Min-Ji;Kim, Jang-Kyung;Na, Bong-Gil
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2011.05a
    • /
    • pp.359-359
    • /
    • 2011
  • 기존 수자원계획에 있어서 수문량 예측은 매우 제한적으로 활용되고 있는 실정으로서 최근 기후변화 및 이상기후로 기인하는 기상학적 불확실성 증가에 대해서 효과적으로 대응 하기가 어렵다. 본 연구에서는 기상인자를 활용한 수문변량 예측기법을 개발하고자 하며 국내에 수문자료가 충분한 지역에 대해서 모형의 적합성과 타당성을 평가하고자 한다. 대부분의 수문변량은 해수면온도, 해수면기압, 바람장 등 Large Scale의 기상학적 특성과 연관성을 가지고 있으며 선행시간을 가지고 수문순환에 영향을 주고 있다. 수문변량과 기상학적 변량사이에는 일반적으로 비선형 관계를 가지고 있는 것으로 알려지고 있으며 이러한 비선형 관계를 효과적으로 예측하기 위해서 본 연구에서는 비선형 예측모형을 개발 하고자 한다. 최근 비선형 예측모형에서 불확실성을 고려한 모형에 대한 연구가 활발히 진행되고 있으며 특히, 다중 모형을 사용한 Ensemble 개념의 예측모형 도입이 이루어지고 있다. 본 연구에서는 국내 다목적댐 유입량 및 강수량에 대해서 최적 기상변량을 도출하고 이를 활용한 비선형 Ensemble 예측모형을 개발하였다. 일반적인 선형 회귀분석 모형에 비해 기상현상과 수문현상에 비선형성을 효과적으로 재현할 수 있는 장점을 확인할 수 있었으며 이와 더불어 예측결과에 대한 불확실성을 제공함으로서 신뢰성 있는 수자원 계획을 위한 기초자료로서 활용이 가능할 것으로 판단된다.

  • PDF

An Analysis Study for Optimal Uptake of Nutrient Solution Based on Multiple Linear Regression Model in Strawberry Hydroponic Environments (딸기 수경 재배 환경에서의 다중 선형 회귀 모델 기반의 양액 적정 흡수량 분석 연구)

  • Lim, Jong-Hyun;Lee, Myeong-Bae;Cho, Hyun-Wook;Shin, Chang-Sun;Park, Chang-Woo;Cho, Yong-Yun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.10a
    • /
    • pp.578-580
    • /
    • 2019
  • 우리 나라의 딸기 수경재배 면적은 2002년 5ha로 시작해서, 2007년에는 84ha, 2012년에는 317ha, 2017년에 1,575ha로 매년 30% 이상 급속하게 성장하고 있다. 이런 경향은 수경재배가 토양재배보다 작업이 용이하여 노동시간이 절약되며, 수량을 더 많이 생산할 수 있기 때문이다. 하지만, 공급양액을 배액으로 흘려버리는 비순환식 수경재배 방식이 증가 하면서 환경오염을 유발시킬 뿐만 아니라 수경재배 운영비용의 증가를 가져오고 있다. 본 논문은 작물 생장에 최적화된 양액공급을 위해 상관관계 분석 및 다중 선형 회귀 모델 기반의 딸기 수경재배 환경에서의 최적 양액 흡수량을 분석하고 추정해 보았다. 분석 결과, 수경재배 환경정보(일사량, 온도, 습도, CO2 등)를 대상으로 일사량 및 온도가 습도 및 CO2에 비해 딸기재배를 위한 양액 흡수량에 더 큰 영향을 주는 것으로 분석되었고, 다중 선형 회귀 모델을 통한 회귀식의 R-Square값은 0.358으로 나타났다.

Relationship Between Physical Properties and Compression Index for Marine Clay (해성점토의 물리적 특성과 압축지수의 상관성)

  • 김동후;김기웅;백영식
    • Journal of the Korean Geotechnical Society
    • /
    • v.19 no.6
    • /
    • pp.371-378
    • /
    • 2003
  • The compression index of clay distributed in the west and south coast of the Korean Peninsula had been studied. Compression index was obtained from the conventional consolidation test, and was conducted accordingly to obtain the field virgin compression curve by means of Schmertmann's graphical correction. To examine a correlation closely between physical properties of soils($e_o$, LL, w) and compression index(Cc), linen. and non-linear regression analysis were employed based on the data collected from tests. The conclusions are as follows. The compression index obtained by means of Schmereann's graphical correction is about 1.16 times for the value of original oedometer test curve for U/D samples. Non-liner regression curve was preferable to establish a correlation equation rather than linear regression curve. All derived equations so far achieved have been summarized and given. However, linear equation is better for practical use so that part by part simplified linear equations were also suggested alternatively together with their own non-linear regression curve.

A Propose on Seismic Performance Evaluation Model of Slope using Artificial Neural Network Technique (인공신경망 기법을 이용한 사면의 내진성능평가 모델 제안)

  • Kwag, Shinyoung;Hahm, Daegi
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.32 no.2
    • /
    • pp.93-101
    • /
    • 2019
  • The objective of this study is to develop a model which can predict the seismic performance of the slope relatively accurately and efficiently by using artificial neural network(ANN) technique. The quantification of such the seismic performance of the slope is not easy task due to the randomness and the uncertainty of the earthquake input and slope model. Under these circumstances, probabilistic seismic fragility analyses of slope have been carried out by several researchers, and a closed-form equation for slope seismic performance was proposed through a multiple linear regression analysis. However, a traditional statistical linear regression analysis has shown a limit that cannot accurately represent the nonlinearistic relationship between the slope of various conditions and seismic performance. In order to overcome these problems, in this study, we attempted to apply the ANN to generate prediction models of the seismic performance of the slope. The validity of the derived model was verified by comparing this with the conventional multi-linear and multi-nonlinear regression models. As a result, the models obtained through the ANN basically showed excellent performance in predicting the seismic performance of the slope, compared to the models obtained by the statistical regression analyses of the previous study.

Comparative Analysis on the Characteristics and Models of Traffic Accidents by Day and Nighttime in the Case of Cheongju 4-legged ignalized Intersections (주·야간 교통사고의 특성 및 사고모형 비교분석 -청주시 4지 신호교차로를 중심으로 -)

  • Yoo, Doo Seon;Oh, Sang Jin;Kim, Tae Young;Park, Byung Ho
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.2D
    • /
    • pp.181-189
    • /
    • 2008
  • The purpose of this study is to comparatively analyze the characteristics and models of traffic accidents by day and nighttime. In pursuing the above, this study gives particular attentions to testing the differences and developing the models (multiple linear and non-linear and Poisson and negative binomial regression) using the data of Cheongju 4-legged signalized intersections. The main results analyzed are as follows. First, the differences between day and nighttime accidents were defined. Second, 12 accident models which are all statistically significant were developed. Finally, the differences between day and nighttime models were comparatively analyzed using the common and specific variables.

N-supplying Capability Evaluation of Corn Field Soils in Pennsylvania (Pennsylvania주 옥수수 재배 토양의 질소공급능력 평가)

  • Hong, Soon-Dal
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.31 no.4
    • /
    • pp.359-367
    • /
    • 1998
  • In order to determine the nitrogen supplying capabilities (NSC) of corn fields, 47 field experiments were performed in Pennsylvania over 3 year from 1986 and NSCs were estimated by the regression analysis with chemical properties and soil attributes. Although the content of $NO_3-N$ in soil showed the best correlation with NSC ($R^2=0.518$), the standardized partial regression coefficient of $NO_3-N$ for NSC was 0.52, with some variations over the years. This value was slightly higher than those of the other properties which ranged from 0.001 to 0.351. Multiple linear regression with soil attributes for the evaluation of NSC was better than simple regression with $NO_3-N$. The coefficient of determination ($R^2$) for the evaluation of NSC was gradually increased; 0.599 with selected chemical properties, 0.698 with quantitative attributes(chemical properties and depth of Ap horizon), and 0.839 with quantitative and selected qualitative soil attributes. Consequently, in order to evaluate NSC, analysis by multiple linear regression with soil attributes was more reliable and better model than by the simple regression model.

  • PDF