• Title/Summary/Keyword: 다중선형회귀모델

Search Result 110, Processing Time 0.031 seconds

Prediction of the Water Level of the Tidal River using Artificial Neural Networks and Stationary Wavelets Transform (인공신경망과 정상 웨이블렛 변환을 활용한 감조하천 수위 예측)

  • Lee, Jeongha;Hwang, SeokHwan
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.357-357
    • /
    • 2021
  • 홍수로 인한 침수피해 발생을 최소화하기 위해 정확한 하천의 수위 예측과 리드타임 확보가 매우 중요하다. 특히 조석현상의 영향을 받는 감조하천의 경우 기존의 물리적 수문모형의 적용이 제한되어 하천수위 예측의 정확도가 떨어지기도 한다. 따라서 본 연구에서는 이러한 감조하천 수위 예측의 정확도를 높이기 위해 조석현상을 분리하고 인공신경망을 활용하는 하이브리드 모델을 제안 하였으며 다중 선형회귀분석과 비교 분석하였다. 감조하천에 위치한 교량의 수위데이터에서 Stationary Wavelet Transform으로 조석현상을 분리하였으며, 이외의 수위에 영향을 주는 time series data와 인공신경망(ANN)을 활용하여 1시간, 2시간, 3시간 후의 수위를 예측하였다. 하이브리드 모델은 96% 이상의 정확도를 보였으며 다중 선형회귀 분석과 비교하여도 높은 정확성을 보여주었다.

  • PDF

Estimation of Cerchar abrasivity index based on rock strength and petrological characteristics using linear regression and machine learning (선형회귀분석과 머신러닝을 이용한 암석의 강도 및 암석학적 특징 기반 세르샤 마모지수 추정)

  • Ju-Pyo Hong;Yun Seong Kang;Tae Young Ko
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.26 no.1
    • /
    • pp.39-58
    • /
    • 2024
  • Tunnel Boring Machines (TBM) use multiple disc cutters to excavate tunnels through rock. These cutters wear out due to continuous contact and friction with the rock, leading to decreased cutting efficiency and reduced excavation performance. The rock's abrasivity significantly affects cutter wear, with highly abrasive rocks causing more wear and reducing the cutter's lifespan. The Cerchar Abrasivity Index (CAI) is a key indicator for assessing rock abrasivity, essential for predicting disc cutter life and performance. This study aims to develop a new method for effectively estimating CAI using rock strength, petrological characteristics, linear regression, and machine learning. A database including CAI, uniaxial compressive strength, Brazilian tensile strength, and equivalent quartz content was created, with additional derived variables. Variables for multiple linear regression were selected considering statistical significance and multicollinearity, while machine learning model inputs were chosen based on variable importance. Among the machine learning prediction models, the Gradient Boosting model showed the highest predictive performance. Finally, the predictive performance of the multiple linear regression analysis and the Gradient Boosting model derived in this study were compared with the CAI prediction models of previous studies to validate the results of this research.

Multiple linear regression model-based voltage imbalance estimation for high-power series battery pack (다중선형회귀모델 기반 고출력 직렬 배터리 팩의 전압 불균형 추정)

  • Kim, Seung-Woo;Lee, Pyeong-Yeon;Han, Dong-Ho;Kim, Jong-hoon
    • Journal of IKEEE
    • /
    • v.23 no.1
    • /
    • pp.1-8
    • /
    • 2019
  • In this paper, the electrical characteristics with various C-rates are tested with a high power series battery pack comprised of 18650 cylindrical nickel cobalt aluminum(NCA) lithium-ion battery. The electrical characteristics of discharge capacity test with 14S1P battery pack and electric vehicle (EV) cycle test with 4S1P battery pack are compared and analyzed by the various of C-rates. Multiple linear regression is used to estimate voltage imbalance of 14S1P and 4S1P battery packs with various C-rates based on experimental data. The estimation accuracy is evaluated by root mean square error(RMSE) to validate multiple linear regression. The result of this paper is contributed that to use for estimating the voltage imbalance of discharge capacity test with 14S1P battery pack using multiple linear regression better than to use the voltage imbalance of EV cycle with 4S1P battery pack.

A Study on Estimating the Crossing Speed of Mobility Handicapped for the Activation of the Smart Crossing System (스마트횡단시스템 활성화를 위한 교통약자의 횡단속도 추정)

  • Hyung Kyu Kim;Sang Cheal Byun;Yeo Hwan Yoon;Jae Seok Kim
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.21 no.6
    • /
    • pp.87-96
    • /
    • 2022
  • The traffic vulnerable, including elderly pedestrians, have a relatively low walking speed and slow cognitive response time due to reduced physical ability. Although a smart crossing system has been developed and operated to improve problem, it is difficult to operate a signal that reflects the appropriate walking speed for each pedestrian. In this study, a neural network model and a multiple regression model-based traversing speed estimation model were developed using image information collected in an area with a high percentage of traffic vulnerability. to support the provision of optimal walking signals according to real-time traffic weakness. actual traffic data collected from the urban traffic network of Paju-si, Gyeonggi-do were used. The performance of the model was evaluated through seven selected indicators, including correlation coefficient and mean absolute error. The multiple linear regression model had a correlation coefficient of 0.652 and 0.182; the neural network model had a correlation coefficient of 0.823 and 0.105. The neural network model showed higher predictive power.

Analysis of Empirical Multiple Linear Regression Models for the Production of PM2.5 Concentrations (PM2.5농도 산출을 위한 경험적 다중선형 모델 분석)

  • Choo, Gyo-Hwang;Lee, Kyu-Tae;Jeong, Myeong-Jae
    • Journal of the Korean earth science society
    • /
    • v.38 no.4
    • /
    • pp.283-292
    • /
    • 2017
  • In this study, the empirical models were established to estimate the concentrations of surface-level $PM_{2.5}$ over Seoul, Korea from 1 January 2012 to 31 December 2013. We used six different multiple linear regression models with aerosol optical thickness (AOT), ${\AA}ngstr{\ddot{o}}m$ exponents (AE) data from Moderate Resolution Imaging Spectroradiometer (MODIS) aboard Terra and Aqua satellites, meteorological data, and planetary boundary layer depth (PBLD) data. The results showed that $M_6$ was the best empirical model and AOT, AE, relative humidity (RH), wind speed, wind direction, PBLD, and air temperature data were used as input data. Statistical analysis showed that the result between the observed $PM_{2.5}$ and the estimated $PM_{2.5}$ concentrations using $M_6$ model were correlations (R=0.62) and root square mean error ($RMSE=10.70{\mu}gm^{-3}$). In addition, our study show that the relation strongly depends on the seasons due to seasonal observation characteristics of AOT, with a relatively better correlation in spring (R=0.66) and autumntime (R=0.75) than summer and wintertime (R was about 0.38 and 0.56). These results were due to cloud contamination of summertime and the influence of snow/ice surface of wintertime, compared with those of other seasons. Therefore, the empirical multiple linear regression model used in this study showed that the AOT data retrieved from the satellite was important a dominant variable and we will need to use additional weather variables to improve the results of $PM_{2.5}$. Also, the result calculated for $PM_{2.5}$ using empirical multi linear regression model will be useful as a method to enable monitoring of atmospheric environment from satellite and ground meteorological data.

Improvement of Search Method of Genetic Programing for Wind Prediction MOS (풍속 예측 보정을 위한 Genetic Programing 탐색 기법의 개선)

  • Oh, Seungchul;Seo, Kisung
    • Proceedings of the KIEE Conference
    • /
    • 2015.07a
    • /
    • pp.1349-1350
    • /
    • 2015
  • 풍속은 다른 기상요소들보다 순간 변동이 심하고 국지성이 강하여 수치 예보 모델만으로 예측의 정확성을 높이기가 어렵다. 기상청의 단기 풍속 예보는 전 지구적 통합 예보모델인 UM(Unified Model)의 예측값에 MOS(Model Output Statictics)를 통한 보정을 수행하며, 보정식의 생성에 다중선형회귀분석 방법을 사용한다. 본 연구자는 유전프로그래밍(Genetic Programming)을 이용한 비선형 회귀분석 기반의 보정식 생성을 통하여 이를 개선한 바 있는데, 본 연구에서는 보다 향상된 성능을 얻기 위하여 GP 기법 측면에서 Automatically Defined Functions과 다군집(Multiple Populations) 수행을 통해 성능을 높이고자 한다.

  • PDF

N-supplying Capability Evaluation of Corn Field Soils in Pennsylvania (Pennsylvania주 옥수수 재배 토양의 질소공급능력 평가)

  • Hong, Soon-Dal
    • Korean Journal of Soil Science and Fertilizer
    • /
    • v.31 no.4
    • /
    • pp.359-367
    • /
    • 1998
  • In order to determine the nitrogen supplying capabilities (NSC) of corn fields, 47 field experiments were performed in Pennsylvania over 3 year from 1986 and NSCs were estimated by the regression analysis with chemical properties and soil attributes. Although the content of $NO_3-N$ in soil showed the best correlation with NSC ($R^2=0.518$), the standardized partial regression coefficient of $NO_3-N$ for NSC was 0.52, with some variations over the years. This value was slightly higher than those of the other properties which ranged from 0.001 to 0.351. Multiple linear regression with soil attributes for the evaluation of NSC was better than simple regression with $NO_3-N$. The coefficient of determination ($R^2$) for the evaluation of NSC was gradually increased; 0.599 with selected chemical properties, 0.698 with quantitative attributes(chemical properties and depth of Ap horizon), and 0.839 with quantitative and selected qualitative soil attributes. Consequently, in order to evaluate NSC, analysis by multiple linear regression with soil attributes was more reliable and better model than by the simple regression model.

  • PDF

QSPR analysis for predicting heat of sublimation of organic compounds (유기화합물의 승화열 예측을 위한 QSPR분석)

  • Park, Yu Sun;Lee, Jong Hyuk;Park, Han Woong;Lee, Sung Kwang
    • Analytical Science and Technology
    • /
    • v.28 no.3
    • /
    • pp.187-195
    • /
    • 2015
  • The heat of sublimation (HOS) is an essential parameter used to resolve environmental problems in the transfer of organic contaminants to the atmosphere and to assess the risk of toxic chemicals. The experimental measurement of the heat of sublimation is time-consuming, expensive, and complicated. In this study, quantitative structural property relationships (QSPR) were used to develop a simple and predictive model for measuring the heat of sublimation of organic compounds. The population-based forward selection method was applied to select an informative subset of descriptors of learning algorithms, such as by using multiple linear regression (MLR) and the support vector machine (SVM) method. Each individual model and consensus model was evaluated by internal validation using the bootstrap method and y-randomization. The predictions of the performance of the external test set were improved by considering their applicability to the domain. Based on the results of the MLR model, we showed that the heat of sublimation was related to dispersion, H-bond, electrostatic forces, and the dipole-dipole interaction between inter-molecules.

Prediction Model for Specific Cutting Energy of Pick Cutters Based on Gene Expression Programming and Particle Swarm Optimization (유전자 프로그래밍과 개체군집최적화를 이용한 픽 커터의 절삭비에너지 예측모델)

  • Hojjati, Shahabedin;Jeong, Hoyoung;Jeon, Seokwon
    • Tunnel and Underground Space
    • /
    • v.28 no.6
    • /
    • pp.651-669
    • /
    • 2018
  • This study suggests the prediction model to estimate the specific energy of a pick cutter using a gene expression programming (GEP) and particle swarm optimization (PSO). Estimating the performance of mechanical excavators is of crucial importance in early design stage of tunnelling projects, and the specific energy (SE) based approach serves as a standard performance prediction procedure that is applicable to all excavation machines. The purpose of this research, is to investigate the relationship between UCS and BTS, penetration depth, cut spacing, and SE. A total of 46 full-scale linear cutting test results using pick cutters and different values of depth of cut and cut spacing on various rock types was collected from the previous study for the analysis. The Mean Squared Error (MSE) associated with the conventional Multiple Linear Regression (MLR) method is more than two times larger than the MSE generated by GEP-PSO algorithm. The $R^2$ value associated with the GEP-PSO algorithm, is about 0.13 higher than the $R^2$ associated with MLR.

Prediction of Short and Long-term PV Power Generation in Specific Regions using Actual Converter Output Data (실제 컨버터 출력 데이터를 이용한 특정 지역 태양광 장단기 발전 예측)

  • Ha, Eun-gyu;Kim, Tae-oh;Kim, Chang-bok
    • Journal of Advanced Navigation Technology
    • /
    • v.23 no.6
    • /
    • pp.561-569
    • /
    • 2019
  • Solar photovoltaic can provide electrical energy with only radiation, and its use is expanding rapidly as a new energy source. This study predicts the short and long-term PV power generation using actual converter output data of photovoltaic system. The prediction algorithm uses multiple linear regression, support vector machine (SVM), and deep learning such as deep neural network (DNN) and long short-term memory (LSTM). In addition, three models are used according to the input and output structure of the weather element. Long-term forecasts are made monthly, seasonally and annually, and short-term forecasts are made for 7 days. As a result, the deep learning network is better in prediction accuracy than multiple linear regression and SVM. In addition, LSTM, which is a better model for time series prediction than DNN, is somewhat superior in terms of prediction accuracy. The experiment results according to the input and output structure appear Model 2 has less error than Model 1, and Model 3 has less error than Model 2.