• Title/Summary/Keyword: 단계적 회귀분석

Search Result 913, Processing Time 0.033 seconds

A Comparative Analysis of the Forecasting Performance of Coal and Iron Ore in Gwangyang Port Using Stepwise Regression and Artificial Neural Network Model (단계적 회귀분석과 인공신경망 모형을 이용한 광양항 석탄·철광석 물동량 예측력 비교 분석)

  • Cho, Sang-Ho;Nam, Hyung-Sik;Ryu, Ki-Jin;Ryoo, Dong-Keun
    • Journal of Navigation and Port Research
    • /
    • v.44 no.3
    • /
    • pp.187-194
    • /
    • 2020
  • It is very important to forecast freight volume accurately to establish major port policies and future operation plans. Thus, related studies are being conducted because of this importance. In this paper, stepwise regression analysis and artificial neural network model were analyzed to compare the predictive power of each model on Gwangyang Port, the largest domestic port for coal and iron ore transportation. Data of a total of 121 months J anuary 2009-J anuary 2019 were used. Factors affecting coal and iron ore trade volume were selected and classified into supply-related factors and market/economy-related factors. In the stepwise regression analysis, the tonnage of ships entering the port, coal price, and dollar exchange rate were selected as the final variables in case of the Gwangyang Port coal volume forecasting model. In the iron ore volume forecasting model, the tonnage of ships entering the port and the price of iron ore were selected as the final variables. In the analysis using the artificial neural network model, trial-and-error method that various Hyper-parameters affecting the performance of the model were selected to identify the most optimal model used. The analysis results showed that the artificial neural network model had better predictive performance than the stepwise regression analysis. The model which showed the most excellent performance was the Gwangyang Port Coal Volume Forecasting Artificial Neural Network Model. In comparing forecasted values by various predictive models and actually measured values, the artificial neural network model showed closer values to the actual highest point and the lowest point than the stepwise regression analysis.

Development of Variable Selection Technique using Stepwise Regression and Data Envelopment Analysis (단계적 회귀법과 자료봉합분석을 이용한 변수선택기법의 개발)

  • Jeong, Min-Eui;Yu, Song-Jin
    • Journal of KIISE:Software and Applications
    • /
    • v.41 no.8
    • /
    • pp.598-604
    • /
    • 2014
  • In this paper, we develop stepwise regression data envelopment model to select important variables. We formulate null hypothesis to understand the importance of each variable and use Kruskal-Wallis test for this purpose. If the Kruskal-Wallis test does reject the null hypothesis this will imply there is significant fluctuation in the efficiency score relative to base model. And therefore we have to further check the pair of variables that causes the fluctuation in order to determine its importance using Conover-Inman test. The proposed models helps understand the extent of misclassification decision making units as efficient/inefficient when variables are retained or discarded alongside provides useful managerial prescription to make improvement strategies.

A Study on Estimation of Lowflow Ungauged Basin Using Multiple Regression Analysis (다중회귀분석을 이용한 미계측 유역의 갈수유량 산정에 관한 연구)

  • Lim, Ga Kyun;Jeung, Se Jin;Kim, Byung Sik
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.133-133
    • /
    • 2020
  • 갈수량이란 1년 중 355일은 유지되는 유량을 말하며 물 공급 계획 및 관리, 저수지 설계, 관개용수의 수량과 수질 관리, 생태계 보존 등에 있어서 갈수량의 크기와 빈도를 파악하는 것은 매우 중요한 과정이다. 갈수량 산정을 위해서는 오랜 기간의 관측 일유량 자료가 필요하지만 우리나라의 경우 관측 유량 자료의 결측자료가 많아 갈수량 산정에 필요한 장기간의 자료가 부족하다. 따라서 본 연구에서는 전국 40개 중권역 유역을 대상으로 갈수 빈도별 갈수량 산정 회귀식 개발을 수행하였다. 갈수량 산정에 적용할 수 있는 18개의 유역인자와 4개의 수문 인자를 상관분석을 통해 다중공선성을 고려하였으며 상관분석 결과를 토대로 미계측 유역에 적용 가능한 인자를 선정하였다. 갈수 빈도 분석과 단계적 회귀분석을 통하여 미계측 유역에 적용할 수 있는 갈수 빈도별 갈수량 산정 회귀식을 개발하였다. 또한 계측 유역을 미계측 유역으로 가정하여 개발된 갈수량 산정 회귀식을 이용하여 갈수량을 산정하고 분석 결과와 실제 갈수량을 비교하여 개발된 회귀식의 적정성을 검토하였다.

  • PDF

Trip Generation Model based on Geographically Weighted Regression (공간가중회귀분석을 이용한 통행발생모형)

  • Kim, Jin-Hui;Park, Il-Seop;Jeong, Jin-Hyeok
    • Journal of Korean Society of Transportation
    • /
    • v.29 no.2
    • /
    • pp.101-109
    • /
    • 2011
  • In most of the urbanized cities, socio-economic attributes tend to cluster as patterns of similarity in space, namely spatial autocorrelation, by agglomeration forces. The classical linear regression model, the most frequently adopted in the trip generation step, cannot sufficiently represent this effect. In order to take into account the effect properly, we need a model which adequately deals with the spatial dependence patterns. In this study, the Geographically Weighted Regression (GWR) model is adopted as an alternative method for the local analysis of relationships in multivariate data sets; that is GWR extends this traditional regression framework by estimating local rather than global parameters. This study shows the existence of spatial effects in the production and attraction of home base/non-home based trips through the GWR model using travel data collected in Daegu metropolitan area. Furthermore, LISA is employed to verify the fact that the local spatial autocorrelation exists.

A study on equating method based on regression analysis (회귀분석에 기초한 균등화 방법에 관한 연구)

  • Cho, Jang-Sik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.3
    • /
    • pp.513-521
    • /
    • 2010
  • Most of universities have carried out course evaluation to apply the performance appraisal for professor. But, course evaluation depends on characteristics of each class such as class size, type of lecture, evaluator's grade and so on. As the results, such characteristics of each class lead to serious bias which makes lecturers distrust the course evaluation results. Hence, we propose a equating method for the course evaluation by regression analysis which use stepwise variable selection. And we compare proposed method with the other method by Cho et al. (2009) with respect to efficiencies. Also we give the example to which the method is applied.

Development of Multiple Linear Regression Model to Predict Agricultural Reservoir Storage based on Naive Bayes Classification and Weather Forecast Data (나이브 베이즈 분류와 기상예보자료 기반의 농업용 저수지 저수율 전망을 위한 저수율 예측 다중선형 회귀모형 개발)

  • Kim, Jin Uk;Jung, Chung Gil;Lee, Ji Wan;Kim, Seong Joon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2018.05a
    • /
    • pp.112-112
    • /
    • 2018
  • 최근 이상기후로 인한 국부적인 혹은 광역적인 가뭄이 빈번하게 발생하고 있는 추세이며 발생횟수 뿐 아니라 가뭄 심도 및 지속기간이 과거보다 크게 증가하여 그에 따른 피해가 커질 것으로 예측되고 있다. 특히, 2014~2015년도의 유례없는 가뭄으로 인해 저수지 용수공급이 제한되면서 많은 농가들이 피해를 입었다. 본 연구의 목적은 전국 농업용 저수지를 대상으로 기상청 3개월 예보자료를 활용 할 수 있는 농업용 저수지 저수율 다중선형 회귀 모형을 개발하여 저수율 전망정보를 생산하는 것이다. 본 연구에서는 전국에 적용 가능한 저수율 다중선형 회귀 모형개발을 위해 5개의 기상요소(강수량, 최고기온, 최저기온, 평균기온, 평균풍속)와 관측 저수지 저수율을 활용했다. 기상자료는 2002년부터 2017년까지의 기상청 63개 지상관측소로부터 기상관측자료를 수집하였다. 본 연구에서는 저수율 전망 단계를 세 단계로 나누었다. 첫 번째 단계로 농어촌공사에서 전국 511개 용수구역을 대상으로 군집분석 및 의사결정나무 분석을 통해 제시한 65개 대표저수지를 대상으로 기상자료 및 관측 저수율 자료를 이용하여 다중선형 회귀분석을 실시하였다. 수집한 기상요소와 저수율을 독립변수로 하여 월별 회귀식을 산정한 결과 결정계수($R^2$)는 0.51~0.95로 나타났다. 두 번째 단계로 대표저수지의 회귀분석 결과를 전국의 저수지로 확대하기 위해 나이브 베이즈 분류법을 적용하여 전국 3098개의 저수지를 65의 군집으로 분류하고 각각의 군집에 해당되는 월별 회귀식을 산정하였다. 마지막으로 전국 저수지로 산정된 회귀식과 농업 가뭄 예측을 위해 기상청의 GS5(Global Seasonal Forecasting System 5) 3개월 예보자료를 수집하여 회귀식에 적용해 2017년 전국 저수지의 3개월 저수율 전망정보를 생산하였다. 본 연구의 전국 저수지 군집결과 기반의 저수율 전망기술은 2017년도 관측 저수율과 비교한 결과 유의한 상관성을 나타냈으며 이 결과는 추후 농업용 저수지의 물 공급 및 농업가뭄 전망 자료로서 이용이 가능할 것으로 판단된다.

  • PDF

An Analysis on Determinants of the Capesize Freight Rate and Forecasting Models (케이프선 시장 운임의 결정요인 및 운임예측 모형 분석)

  • Lim, Sang-Seop;Yun, Hee-Sung
    • Journal of Navigation and Port Research
    • /
    • v.42 no.6
    • /
    • pp.539-545
    • /
    • 2018
  • In recent years, research on shipping market forecasting with the employment of non-linear AI models has attracted significant interest. In previous studies, input variables were selected with reference to past papers or by relying on the intuitions of the researchers. This paper attempts to address this issue by applying the stepwise regression model and the random forest model to the Cape-size bulk carrier market. The Cape market was selected due to the simplicity of its supply and demand structure. The preliminary selection of the determinants resulted in 16 variables. In the next stage, 8 features from the stepwise regression model and 10 features from the random forest model were screened as important determinants. The chosen variables were used to test both models. Based on the analysis of the models, it was observed that the random forest model outperforms the stepwise regression model. This research is significant because it provides a scientific basis which can be used to find the determinants in shipping market forecasting, and utilize a machine-learning model in the process. The results of this research can be used to enhance the decisions of chartering desks by offering a guideline for market analysis.

A Comparative Study on the Genetic Algorithm and Regression Analysis in Urban Population Surface Modeling (도시인구분포모형 개발을 위한 GA모형과 회귀모형의 적합성 비교연구)

  • Choei, Nae-Young
    • Spatial Information Research
    • /
    • v.18 no.5
    • /
    • pp.107-117
    • /
    • 2010
  • Taking the East-Hwasung area as the case, this study first builds gridded population data based on the municipal population survey raw data, and then measures, by way of GIS tools, the major urban spatial variables that are thought to influence the composition of the regional population. For the purpose of comparison, the urban models based on the Genetic Algorithm technique and the regression technique are constructed using the same input variables. The findings indicate that the GA output performed better in differentiating the effective variables among the pilot model variables, and predicted as much consistent and meaningful coefficient estimates for the explanatory variables as the regression models. The study results indicate that GA technique could be a very useful and supplementary research tool in understanding the urban phenomena.

A Development of Formula on Time of Concentration and Storage Constant in Sumjin River Basin (섬진강 유역의 도달시간 및 저류상수 산정공식 개발)

  • 이신재;박양래;김명수;박상우
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2004.05b
    • /
    • pp.1193-1197
    • /
    • 2004
  • 본 연구는 강우에 내한 유역의 반응시간에 관한 연구로써 우리나라 자연하천유역에 적합한 도달시간 및 저류상수 산정공식을 개발하기 위하여 섬진강 유역을 대상으로 유역특성인자 및 강우 특성인자를 분석하고, 이를 다중회귀분석방범 중 최적의 회귀모형을 추출하기 위한 단계별 회귀분석방법을 이용하여 산정공식을 개발하였다. 그리고 개발된 산정공식으로부터의 도달시간 및 저류 상수들을 기존 경험공식의 값들과 비교하였으면, 또한 이를 Clark 모형에 적용하여 실제 호우사상들에 대한 유출수문곡선을 분석하여 관측수문곡선과 비교 검토하였다. 그 걸과 계산된 유출수문곡선과 관측수문곡선은 첨두유량 및 첨두발생시간에서 비교적 적은 오차를 보였으며, 유출수문곡선의 양상에서도 상호 높은 상관성을 보여 개발된 산정공식에 대한 적합성을 잘 나타내주고 있다.

  • PDF

The Effects of School Climate on Peer Victimization for Junior High School Students (학교분위기가 중학생의 또래폭력 피해경험에 미치는 영향)

  • Kim, Eun-Young
    • Journal of the Korean Society of Child Welfare
    • /
    • no.26
    • /
    • pp.87-111
    • /
    • 2008
  • The purpose of this study is to evaluate the actual conditions of peer victimization and to examine how the various factors of school climate influence peer victimization. Analysis on the relationship between various school climate and peer victimization has not been yet dealt with in Korea. Participants in this study were middle school students chosen from 11 middle schools in Seoul, by convenience sampling. A total of 1,204 surveys were then analyzed. Methods for analysis included Frequencies, Descriptives, Pearson's Correlation, Hierarchical Regression. From the result of the analysis, the level of verbal violence came out to be a relatively high form of peer victimization. The hierarchical regression were conducted in two steps. The second model's descriptive variable was higher by 19.6% than the first model. The variables of interaction between teacher and student in peer violence(${\beta}=.130$), of school facility maintenance(${\beta}=.067$), of safety of school environment(${\beta}=.331$), and economic status and sex out of controlled variables were proved to be of significance, and those variables explained 23.0% of the entire model. Based on the results of this study, practical and effective policy solutions to improve the school climate better have been suggested.