• Title/Summary/Keyword: 다중 선형 회귀

Search Result 417, Processing Time 0.031 seconds

Development of Naïve-Bayes classification and multiple linear regression model to predict agricultural reservoir storage rate based on weather forecast data (기상예보자료 기반의 농업용저수지 저수율 전망을 위한 나이브 베이즈 분류 및 다중선형 회귀모형 개발)

  • Kim, Jin Uk;Jung, Chung Gil;Lee, Ji Wan;Kim, Seong Joon
    • Journal of Korea Water Resources Association
    • /
    • v.51 no.10
    • /
    • pp.839-852
    • /
    • 2018
  • The purpose of this study is to predict monthly agricultural reservoir storage by developing weather data-based Multiple Linear Regression Model (MLRM) with precipitation, maximum temperature, minimum temperature, average temperature, and average wind speed. Using Naïve-Bayes classification, total 1,559 nationwide reservoirs were classified into 30 clusters based on geomorphological specification (effective storage volume, irrigation area, watershed area, latitude, longitude and frequency of drought). For each cluster, the monthly MLRM was derived using 13 years (2002~2014) meteorological data by KMA (Korea Meteorological Administration) and reservoir storage rate data by KRC (Korea Rural Community). The MLRM for reservoir storage rate showed the determination coefficient ($R^2$) of 0.76, Nash-Sutcliffe efficiency (NSE) of 0.73, and root mean square error (RMSE) of 8.33% respectively. The MLRM was evaluated for 2 years (2015~2016) using 3 months weather forecast data of GloSea5 (GS5) by KMA. The Reservoir Drought Index (RDI) that was represented by present and normal year reservoir storage rate showed that the ROC (Receiver Operating Characteristics) average hit rate was 0.80 using observed data and 0.73 using GS5 data in the MLRM. Using the results of this study, future reservoir storage rates can be predicted and used as decision-making data on stable future agricultural water supply.

Estimating the Total Precipitation Amount with Simulated Precipitation for Ungauged Stations in Jeju Island (미계측 관측 강수 자료 생성을 통한 제주도 지역의 수문총량 추정)

  • Kim, Nam-Won;Um, Myoung-Jin;Chung, Il-Moon;Heo, Jun-Haeng
    • Journal of Korea Water Resources Association
    • /
    • v.45 no.9
    • /
    • pp.875-885
    • /
    • 2012
  • In this study, the total precipitation amount in Jeju Island was estimated with the simulated precipitation for ungauged stations missing precipitation data using the spatial precipitation analysis. The missing data were generated through the modified multiple linear regression in this study, and the analysis of spatial precipitation was conducted with the PRISM(Parameter-elevation Regression on Independent Slope Model). The generated data with modified multiple linear regression model have similar pattern with original data. Thus, the model in this study shows good applicability to estimate the missing data. The difference of annual average precipitation between Case 1 (original data) and Case 2 (modified data) appears very small ratio which is about 1.5%. However, the difference of annual average precipitation according to elevation shows the large ratio up to 37.4%. As the results, the method of estimating missing data in this study would be useful to calculate the total precipitation amount at the low station density area and the places with the high spatial variation of precipitation.

Abnormal air temperature prediction of South Korea using multiple linear regression model and Terra/Aqua MODIS LST (다중 선형회귀모형과 Terra/Aqua MODIS 지표면온도를 활용한 우리나라 이상기온 예측)

  • Chung, Jeehun;Lee, Yonggwan;Lee, Jiwan;Kim, Seongjoon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2019.05a
    • /
    • pp.139-139
    • /
    • 2019
  • 지구 온난화 및 기후변화로 인해 비롯된 전 지구적인 기온 상승은 가뭄, 폭염, 한파 등의 이상 기후 현상을 야기하여 인류의 생존을 위협하는 환경 문제로 대두되고 있다. 이와 같은 기후변화 및 이상기후 현상을 이해하고 파악하기 위해서는 정확하고 상세한 기온 정보가 필수적이다. 우리나라는 기상청에서 전국 590개소의 기상관측장비로 기온 정보를 생산하고 있지만 산림이 약 70%를 차지하는 복잡한 지형을 가지고 있어 지상관측밀도의 공간적 제약이 발생해 상세하고 균일한 기온 정보 생산에 제약이 있다. 이러한 단점을 극복하기 위해 본 연구에서는 위성으로 측정한 지표면 온도(Land Surface Temperature, LST) 자료와 다중선형회귀모형(Multiple Linear Regression Model)을 활용해 두 자료간의 상관관계를 파악하고 지상기온을 예측하고자 한다. 위성자료로 Terra 및 Aqua MODIS 위성의 1000m 공간해상도를 가진 일별 LST자료 MOD11A1, MYD11A1의 Daytime 자료를 각각 2000년부터 2018년까지 총 19년의 기간에 대해 구축하였으며, 전국 92개의 기상청 관측소로부터 최고, 최저 기온 자료를 동 기간에 대해 구축하였다. LST를 이용한 이상기온 예측 알고리즘은 python을 이용해 구현하였으며 예측 결과는 실제 기온 자료를 통해 검증하였다. 또한, 예측 기온 자료의 연대별, 순별(상, 중, 하순) 분석을 실시하고, 2018년 극한 폭염 및 한파(2017년 12월~2018년 2월)의 예측 가능성을 검토하여 연구 결과에 대한 다양한 활용방안을 제시하고자 한다.

  • PDF

Estimation of river water depth using UAV-assisted RGB imagery and multiple linear regression analysis (무인기 지원 RGB 영상과 다중선형회귀분석을 이용한 하천 수심 추정)

  • Moon, Hyeon-Tae;Lee, Jung-Hwan;Yuk, Ji-Moon;Moon, Young-Il
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.12
    • /
    • pp.1059-1070
    • /
    • 2020
  • River cross-section measurement data is one of the most important input data in research related to hydraulic and hydrological modeling, such as flow calculation and flood forecasting warning methods for river management. However, the acquisition of accurate and continuous cross-section data of rivers leading to irregular geometric structure has significant limitations in terms of time and cost. In this regard, a primary objective of this study is to develop a methodology that is able to measure the spatial distribution of continuous river characteristics by minimizing the input of time, cost, and manpower. Therefore, in this study, we tried to examine the possibility and accuracy of continuous cross-section estimation by estimating the water depth for each cross-section through multiple linear regression analysis using RGB-based aerial images and actual data. As a result of comparing with the actual data, it was confirmed that the depth can be accurately estimated within about 2 m of water depth, which can capture spatially heterogeneous relationships, and this is expected to contribute to accurate and continuous river cross-section acquisition.

A Study on Estimation of Soil Moisture Multiple Linear Regression Model Using Conditional Merging and MODIS Land Surface Temperature Data (조건부 합성기법과 MODIS LST를 활용한 토양수분 다중선형 회귀모형 산정 연구)

  • Jung, Chung Gil;Lee, Ji Wan;Kim, Da Rae;Kim, Se Hun;Kim, Seong Joon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.103-104
    • /
    • 2017
  • 본 연구에서는 다중회귀분석모형(MLRM)과 MODIS (MODerate resolution Imaging Spectroradiometer) LST (Land Surface Temperature) 자료를 이용하여 전국 공간토양수분을 산정하였다. 공간토양수분을 산정하기 위한 과정은 크게 두가지로 구분된다. 첫 번째로 기존의 MODIS LST 자료를 조건부 합성 보정기법을 적용하여 실측 LST 자료와 비교하여 위성 LST 자료가 갖고 있는 오차를 보정하였다. 그 결과, 조건부 합성 보정기법을 적용하기전 전국 71개 지상 관측지점에서 관측한 실측 LST와 MODIS LST의 R2는 전체 평균 0.70으로 어는정도 유의성 있는 상관관계를 나타냈으나 조건부 합성 보정기법을 적용한 후 실측 LST와 MODIS LST의 R2는 전체 평균 0.92로 상당히 크게 향상됨을 알 수 있었다. 두 번째로 보정된 MODIS LST를 이용하여 다중회귀분석 모형을 개발하고 토양수분을 예측하는 단계로 입력자료로 위성영상 자료와 관측자료를 융합하여 사용하였다. 위성영상 자료로는 보정된 MODIS LST와 MODIS NDV를 구축하였고 일단위 강수량 및 일조시간의 기상자료는 기상청으로부터 전국 68개 지점에 대해 구축하여 IDW 공간보간기법을 이용한 공간자료로 구축하였다. 토양수분 결과를 비교하기 위한 관측 토양수분은 자동농업기상관측(Automated Agriculture Observing System, AAOS)지점에서 2013년 1월부터 2015년 12월까지의 실측 일단위 토양수분 자료를 구축하여 사용하였다. 다중회귀분석 모형은 각각의 입력자료를 독립인자로서 조합하여 12개의 시나리오를 만들었다. 시공간적 경향을 고려하기 위하여 계절별, 토양 토성(soil texture)를 구분하여 회귀분석을 실시하였다. 관측 토양수분과 모의 토양수분을 비교한 결과 $R^2$가 0.80 (철원), 0.90 (춘천), 0.80 (수원), 0.63 (서산), 0.77 (청주), 0.82 (전주), 0.52 (순천), 0.63 (진주), 0.99 (보성)로 높은 상관성을 보였다. 본 연구에서는 토양수분을 예측하기 위한 인자 중 가장 민간함 LST를 보정하지 않는 토양수분 예측 방법은 상당한 오차를 포함하게 되어 실측 토양수분 결과와 크게 차이가 나타남을 보여주었다.

  • PDF

Analysis of Accident Characteristics and Development of Accident Models in the Signalized Intersections of Cheongju and Cheongwon (지방부 신호교차로 사고특성분석 및 모형개발 (청주.청원을 중심으로))

  • Park, Byung-Ho;Yoo, Doo-Seon;Yang, Jeong-Mo;Lee, Young-Min
    • Journal of Korean Society of Transportation
    • /
    • v.26 no.2
    • /
    • pp.35-46
    • /
    • 2008
  • The purposes of this study are to analyze the characteristics and to develop the models of traffic accidents. In pursuing the above, this study gives particular attentions to developing the models(multiple linear, poisson and negative binomial regression) using the data of Cheongju and Cheongwon signalized intersections. The main results analyzed are as follows. First, the accident characteristics of rural area were defined by factor. Second, 4 accident models which are all statistically significant were developed. Finally, such the variables as $X_2$ and $X_{11}$ were evaluated to be specific variables which reflect the characteristics of rural area.

Hadi와 Simonoff의 다중이상점 식별방법의 개선과 여러 다중이상점 식별방법의 효율성 비교

  • 유종영;김현철
    • Communications for Statistical Applications and Methods
    • /
    • v.3 no.3
    • /
    • pp.11-23
    • /
    • 1996
  • 본 연구에서는 선형회귀분석에서 Hadi와 Simonoff의 다중이상점 식별방법을 수정하여 새로운 알고리즘을 제시하였다. Hadi와 Simonoff의 알고리즘 첫 단계에서 이상점일 가능성이 없는 점들의 집합을 추출할 때 가장효과와 편승효과에 영향을 받을 수 있음으로, 이 첫 단계를 수정하였다. 우리는 잔차가 일정한 분산을 갖는 정규분포에 다르다는 가정하에서 잔차의 신뢰구간을 생각하고, 이 구간안에서 잔차의 MAD가 최소인 새로운 모형을 탐색하고, 이를 이상점일 가능성이 없는 점들의 집합을 추출하는데 일용하는 새로운 알로리즘을 제시하였다. 제시된 방법은 실제자료에서 다른 방법에 비해 효율적으로 이상점을 식별할 수 있었다.

  • PDF

The Estimation of Software Development Effort Using Multiple Regression Method (다중회귀 분석을 이용한 소프트웨어 개발노력추정)

  • Jung Hye-Jung;Yang Hae-Sool;Shin Seok-Kyoo;Lee Sang-Un
    • The KIPS Transactions:PartD
    • /
    • v.11D no.7 s.96
    • /
    • pp.1483-1490
    • /
    • 2004
  • To accomplish a project successfuly, we have to estimate develpment effort accurately. But, development effort is different to software size and operation environment. Usually, we made use of function point for estimating development effort. In this paper. we make use of 789 project data. It is related to development projects in 1990`s. We investigate the variable affecting development effort. Also, we exedcute multiple liner regression analysis for looking linear relation about variables. We find the regression equation for multistage by dividing PDR that influ-enced development effort step by step.

Estimation of Cerchar abrasivity index based on rock strength and petrological characteristics using linear regression and machine learning (선형회귀분석과 머신러닝을 이용한 암석의 강도 및 암석학적 특징 기반 세르샤 마모지수 추정)

  • Ju-Pyo Hong;Yun Seong Kang;Tae Young Ko
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.26 no.1
    • /
    • pp.39-58
    • /
    • 2024
  • Tunnel Boring Machines (TBM) use multiple disc cutters to excavate tunnels through rock. These cutters wear out due to continuous contact and friction with the rock, leading to decreased cutting efficiency and reduced excavation performance. The rock's abrasivity significantly affects cutter wear, with highly abrasive rocks causing more wear and reducing the cutter's lifespan. The Cerchar Abrasivity Index (CAI) is a key indicator for assessing rock abrasivity, essential for predicting disc cutter life and performance. This study aims to develop a new method for effectively estimating CAI using rock strength, petrological characteristics, linear regression, and machine learning. A database including CAI, uniaxial compressive strength, Brazilian tensile strength, and equivalent quartz content was created, with additional derived variables. Variables for multiple linear regression were selected considering statistical significance and multicollinearity, while machine learning model inputs were chosen based on variable importance. Among the machine learning prediction models, the Gradient Boosting model showed the highest predictive performance. Finally, the predictive performance of the multiple linear regression analysis and the Gradient Boosting model derived in this study were compared with the CAI prediction models of previous studies to validate the results of this research.

A Propose on Seismic Performance Evaluation Model of Slope using Artificial Neural Network Technique (인공신경망 기법을 이용한 사면의 내진성능평가 모델 제안)

  • Kwag, Shinyoung;Hahm, Daegi
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.32 no.2
    • /
    • pp.93-101
    • /
    • 2019
  • The objective of this study is to develop a model which can predict the seismic performance of the slope relatively accurately and efficiently by using artificial neural network(ANN) technique. The quantification of such the seismic performance of the slope is not easy task due to the randomness and the uncertainty of the earthquake input and slope model. Under these circumstances, probabilistic seismic fragility analyses of slope have been carried out by several researchers, and a closed-form equation for slope seismic performance was proposed through a multiple linear regression analysis. However, a traditional statistical linear regression analysis has shown a limit that cannot accurately represent the nonlinearistic relationship between the slope of various conditions and seismic performance. In order to overcome these problems, in this study, we attempted to apply the ANN to generate prediction models of the seismic performance of the slope. The validity of the derived model was verified by comparing this with the conventional multi-linear and multi-nonlinear regression models. As a result, the models obtained through the ANN basically showed excellent performance in predicting the seismic performance of the slope, compared to the models obtained by the statistical regression analyses of the previous study.