• Title/Summary/Keyword: 선형회귀 모델

Search Result 440, Processing Time 0.024 seconds

A Study on the Estimation Possibility of Driver's Stress Degrees with the HRV Analysis (HRV를 통한 운전자 스트레스 정도 추정 가능성 연구)

  • Lee, Jeon
    • Science of Emotion and Sensibility
    • /
    • v.13 no.1
    • /
    • pp.61-68
    • /
    • 2010
  • The objective of this study was to test the estimation possibility of driver's stress degrees with the HRV analysis. For this, first, HRV analysis was applied to the 5 driver's ECG signals which were acquired in 7 different stress situations. From this, the facts that HRV trend was different from that in long-distance driving and 6 parameters - meanRR, sdRR, HF, LF/HF, LFnorm, HFnorm were useful for the stress estimation in stress varying driving situation. Next, we designed 5 personalized linear regression models in which 6 HRV parameters were input and the outcomes were 7 different stress degrees. Finally, we tested each model for 5min-long 16 segments individually. Consequently, the models could not hit the stress degrees exactly in some segments but the correlation coefficients between original stress pattern and estimated stress pattern during entire driving showed reasonably high.

  • PDF

Blood Loss Prediction of Rats in Hemorrhagic Shock Using a Linear Regression Model (출혈성 쇼크를 일으킨 흰쥐에서 선형회귀 분석모델을 이용한 출혈량 추정)

  • Lee, Tak-Hyung;Lee, Ju-Hyung;Choi, Jae-Rim;Yang, Dong-In;Kim, Deok-Won
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.47 no.1
    • /
    • pp.56-61
    • /
    • 2010
  • Hemorrhagic shock is a common cause of death in the emergency department. The purpose of this study was to investigate the relationship between blood loss as a percent of the total estimated blood volume (% blood loss) and changes in several physiological parameters. The other goal was to achieve an accurate prediction of percent blood loss for hemorrhagic shock in rats using a linear regression model. We allocated 60 Sprague-Dawley rats into four groups: 0ml, 2ml, 2.5ml, 3 mL/100 g during 15 min. We analyzed the heart rate, systolic and diastolic blood pressure, respiration rate, and body temperature in relation to the percent blood loss. We generated a linear regression model predicting the percent blood loss using a randomly chosen 360 data set and the R-square value of the model was 0.80. Root mean square error of the tested 360 data set using the linear regression was 5.7%. Even though the linear regression model is not directly applicable to clinical situation, our method of predicting % blood loss could be helpful in determining the necessary fluid volume for resuscitation in the future.

A Study on Applying the Nonlinear Regression Schemes to the Low-GloSea6 Weather Prediction Model (Low-GloSea6 기상 예측 모델 기반의 비선형 회귀 기법 적용 연구)

  • Hye-Sung Park;Ye-Rin Cho;Dae-Yeong Shin;Eun-Ok Yun;Sung-Wook Chung
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.16 no.6
    • /
    • pp.489-498
    • /
    • 2023
  • Advancements in hardware performance and computing technology have facilitated the progress of climate prediction models to address climate change. The Korea Meteorological Administration employs the GloSea6 model with supercomputer technology for operational use. Various universities and research institutions utilize the Low-GloSea6 model, a low-resolution coupled model, on small to medium-scale servers for weather research. This paper presents an analysis using Intel VTune Profiler on Low-GloSea6 to facilitate smooth weather research on small to medium-scale servers. The tri_sor_dp_dp function of the atmospheric model, taking 1125.987 seconds of CPU time, is identified as a hotspot. Nonlinear regression models, a machine learning technique, are applied and compared to existing functions conducting numerical operations. The K-Nearest Neighbors regression model exhibits superior performance with MAE of 1.3637e-08 and SMAPE of 123.2707%. Additionally, the Light Gradient Boosting Machine regression model demonstrates the best performance with an RMSE of 2.8453e-08. Therefore, it is confirmed that applying a nonlinear regression model to the tri_sor_dp_dp function during the execution of Low-GloSea6 could be a viable alternative.

Applicability evaluation of aerodynamic approaches for evaporation estimation using pan evaporation data (증발접시 증발량자료를 이용한 공기동력학적 증발량 산정 방법의 적용성 평가)

  • Rim, Chang-Soo
    • Journal of Korea Water Resources Association
    • /
    • v.50 no.11
    • /
    • pp.781-793
    • /
    • 2017
  • In this study, applicabilities of aerodynamic approaches for the estimation of pan evaporation were evaluated on 56 study stations in South Korea. To accomplish this study purpose, previous researchers' evaporation estimation equations based on aerodynamic approaches were grouped into seven generalized evaporation models. Furthermore, four multiple linear regression (MLR) models were developed and tested. The independent variables of MLR models are meteorological variables such as wind speed, vapor pressure deficit, air temperature, and atmospheric pressure. These meteorological variables are required for the application of aerodynamic approaches. In order to consider the effect of autocorrelation, MLR models were developed after differencing variables. The applicability of MLR models with differenced variables was compared with that of MLR models with undifferenced variables and the comparison results showed no significant difference between the two methods. The study results have indicated that there is strong correlation between estimated pan evaporation (using aerodynamic models and MLR models) and measured pan evaporation. However, pan evaporation are overestimated during August, September, October, November, and December. Most of meteorological variables that are used for MLR models show statistical significance in the estimation of pan evaporation. Vapor pressure deficit was turned out to be the most significant meteorological variable. The second most significant variable was air temperature; wind speed was the third most significant variable, followed by atmospheric pressure.

Application of Multiple Linear Regression Analysis and Tree-Based Machine Learning Techniques for Cutter Life Index(CLI) Prediction (커터수명지수 예측을 위한 다중선형회귀분석과 트리 기반 머신러닝 기법 적용)

  • Ju-Pyo Hong;Tae Young Ko
    • Tunnel and Underground Space
    • /
    • v.33 no.6
    • /
    • pp.594-609
    • /
    • 2023
  • TBM (Tunnel Boring Machine) method is gaining popularity in urban and underwater tunneling projects due to its ability to ensure excavation face stability and minimize environmental impact. Among the prominent models for predicting disc cutter life, the NTNU model uses the Cutter Life Index(CLI) as a key parameter, but the complexity of testing procedures and rarity of equipment make measurement challenging. In this study, CLI was predicted using multiple linear regression analysis and tree-based machine learning techniques, utilizing rock properties. Through literature review, a database including rock uniaxial compressive strength, Brazilian tensile strength, equivalent quartz content, and Cerchar abrasivity index was built, and derived variables were added. The multiple linear regression analysis selected input variables based on statistical significance and multicollinearity, while the machine learning prediction model chose variables based on their importance. Dividing the data into 80% for training and 20% for testing, a comparative analysis of the predictive performance was conducted, and XGBoost was identified as the optimal model. The validity of the multiple linear regression and XGBoost models derived in this study was confirmed by comparing their predictive performance with prior research.

Analyzing Significant Variables from a Linear Regression-Based Prediction Model for Rice Prices (선형 회귀를 이용한 쌀 가격 예측 모델의 유의미한 변수 추출)

  • Seo, Jin-kyeong;Choi, Da-jeong;Ko, Kwang-Ho;Paik, Juryon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.07a
    • /
    • pp.39-42
    • /
    • 2022
  • 쌀을 주식으로 하는 우리나라에서, 쌀의 가격에 영향을 미치는 변수를 찾는 것은 유의미한 연구이다. 본 논문에서는 쌀 가격을 예측하는 모델에 포함되는 여러 변수 가운데 상대적인 중요도가 낮은 변수를 제거하고 유의미한 변수만을 남기고자 한다. 이를 위해 기상, 수확량, 소비자물가의 10년 치 정보를 수집하고 정제한 결과 총 2460일, 7개 지역에서 추출된 17,219개의 데이터를 이용하였다. 모델 평가 결과, 모든 변수를 포함한 모델의 RMSE는 166.0759, 단계적으로 계수가 작은 9개의 변수를 제거한 최종적인 모델의 RMSE는 168.5576으로 유의미한 차이를 보이지 않았다. 최종적으로 남은 변수는 총 10개로 평균 기온, 평균 풍속, 합계 일사, 평균 지면 온도, 0.5M 평균 습도, 4.0M 평균 습도, 10CM 일 토양 수분, 30CM 일 토양 수분, 50CM 일 토양 수분, 전년도 생산량이 포함된다.

  • PDF

An Improved Frequency Modeling Corresponding to the Location of the Anjok of the Gayageum (가야금 안족의 위치에 따른 개선된 주파수 모델링)

  • Kwon, Sundeok;Cho, Sangjin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.33 no.2
    • /
    • pp.146-151
    • /
    • 2014
  • This paper analyzes the previous Anjok model of the Gayageum and describes a method to improve the frequency modeling based on previous model. In the previous work, relation between the fundamental frequency and Anjok's location on the body is assumed as an exponential function and these frequencies are integrated by a first-order leaky integrator. Finally, a parameter of the formula to calculate the fundamental frequency is obtained by applying integrated frequencies to the linear regression. This model shows 2.5 Hz absolute deviation on average and has maximum error 7.75 Hz for the low fundamental frequencies. In order to overcome this problem, this paper proposes that the Anjok's locations are grouped according to the rate of error increase and linear regression is applied to each group. To find the optimal parameter, the RMSE(Root Mean Square Error) between measured and calculated fundamental frequencies is used. The proposed model shows substantial reduction in errors, especially maximum three times.

Prediction of the Water Level of the Tidal River using Artificial Neural Networks and Stationary Wavelets Transform (인공신경망과 정상 웨이블렛 변환을 활용한 감조하천 수위 예측)

  • Lee, Jeongha;Hwang, SeokHwan
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2021.06a
    • /
    • pp.357-357
    • /
    • 2021
  • 홍수로 인한 침수피해 발생을 최소화하기 위해 정확한 하천의 수위 예측과 리드타임 확보가 매우 중요하다. 특히 조석현상의 영향을 받는 감조하천의 경우 기존의 물리적 수문모형의 적용이 제한되어 하천수위 예측의 정확도가 떨어지기도 한다. 따라서 본 연구에서는 이러한 감조하천 수위 예측의 정확도를 높이기 위해 조석현상을 분리하고 인공신경망을 활용하는 하이브리드 모델을 제안 하였으며 다중 선형회귀분석과 비교 분석하였다. 감조하천에 위치한 교량의 수위데이터에서 Stationary Wavelet Transform으로 조석현상을 분리하였으며, 이외의 수위에 영향을 주는 time series data와 인공신경망(ANN)을 활용하여 1시간, 2시간, 3시간 후의 수위를 예측하였다. 하이브리드 모델은 96% 이상의 정확도를 보였으며 다중 선형회귀 분석과 비교하여도 높은 정확성을 보여주었다.

  • PDF

Analysis of Eunpyeong New Town Land Price Using Geographically Weighted Regression (지리가중회귀분석을 이용한 은평뉴타운 지가 분석)

  • Jung, Hyo-jin;Lee, Jiyeong
    • Spatial Information Research
    • /
    • v.23 no.5
    • /
    • pp.65-73
    • /
    • 2015
  • Newtown Business of Seoul had been performed to reduce deterioration of Gangbuk and economic inequality between Gangnam and Gangbuk. According to this, Eunpyeong-gu was set as test-bed for Newtown business and Newtown business had been completed until 2013. This study aims to analyze the influence of social and economical factors which affect land price using GWR (Geographically Weighted Regression) considered spatial effect. As a result of analysis, GWR model demonstrated a better goodness-of-fit than OLS (Ordinary least square) model typically used in most study. Furthermore, AIC value and Moran's I of residual prove that GWR model is more suitable than OLS model. GWR model enable to explain more detailed than global regression model as coefficient and sign show different value locally. In future, this research will be helpful to develop Eunpyeong-gu considering spatial characters and strength effectiveness of development.

Software Cost Estimation Model Based on Use Case Points by using Regression Model (회귀분석을 이용한 UCP 기반 소프트웨어 개발 노력 추정 모델)

  • Park, Ju-Seok;Yang, Hea-Sool
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.8
    • /
    • pp.147-157
    • /
    • 2009
  • Recently, there has been continued research on UCP from the development effort estimation method to a software development project applying object oriented development methodology. Current research proposes a linear model estimating the developmenteffort by multiplying a constant to AUCP which applies technical and environmental factors. However, the fact that a non-linear regression model is more appropriate as the software size increases, the development period increases exponentially. In addition, in the UCP calculation process the occurrence of FP errors due to the application of TCF and EF, it is unrealistic to estimate the size with AUCP. This paper presents the issue of current research based on UCP without considering problems of the research, for example, TCF and EF and expresses the models (linear, logarithmic, polynomial, power and exponential type) estimating the development effort directly from UUCP. Consequently, the exponential model within non-linear models exhibit more accurate results than the current linear model. Therefore, after calculating the UUCP of the developing software system, using the proposed model to estimate the development effort, it is possible to estimate the direct cost required in development.