• Title/Summary/Keyword: exponential smoothing method

Search Result 114, Processing Time 0.026 seconds

Improving SARIMA model for reliable meteorological drought forecasting

  • Jehanzaib, Muhammad;Shah, Sabab Ali;Son, Ho Jun;Kim, Tae-Woong
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2022.05a
    • /
    • pp.141-141
    • /
    • 2022
  • Drought is a global phenomenon that affects almost all landscapes and causes major damages. Due to non-linear nature of contributing factors, drought occurrence and its severity is characterized as stochastic in nature. Early warning of impending drought can aid in the development of drought mitigation strategies and measures. Thus, drought forecasting is crucial in the planning and management of water resource systems. The primary objective of this study is to make improvement is existing drought forecasting techniques. Therefore, we proposed an improved version of Seasonal Autoregressive Integrated Moving Average (SARIMA) model (MD-SARIMA) for reliable drought forecasting with three years lead time. In this study, we selected four watersheds of Han River basin in South Korea to validate the performance of MD-SARIMA model. The meteorological data from 8 rain gauge stations were collected for the period 1973-2016 and converted into watershed scale using Thiessen's polygon method. The Standardized Precipitation Index (SPI) was employed to represent the meteorological drought at seasonal (3-month) time scale. The performance of MD-SARIMA model was compared with existing models such as Seasonal Naive Bayes (SNB) model, Exponential Smoothing (ES) model, Trigonometric seasonality, Box-Cox transformation, ARMA errors, Trend and Seasonal components (TBATS) model, and SARIMA model. The results showed that all the models were able to forecast drought, but the performance of MD-SARIMA was robust then other statistical models with Wilmott Index (WI) = 0.86, Mean Absolute Error (MAE) = 0.66, and Root mean square error (RMSE) = 0.80 for 36 months lead time forecast. The outcomes of this study indicated that the MD-SARIMA model can be utilized for drought forecasting.

  • PDF

A Study on Forecasting Industrial Land Considering Leading Economic Variable Using ARIMA-X (선행경제변수를 고려한 산업용지 수요예측 방법 연구)

  • Byun, Tae-Geun;Jang, Cheol-Soon;Kim, Seok-Yun;Choi, Sung-Hwan;Lee, Sang-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.22 no.1
    • /
    • pp.214-223
    • /
    • 2022
  • The purpose of this study is to present a new industrial land demand prediction method that can consider external economic factors. The analysis model used ARIMA-X, which can consider exogenous variables. Exogenous variables are composed of macroeconomic variable, Business Survey Index, and Composite Economic Index variables to reflect the economic and industrial structure. And, among the exogenous variables, only variables that precede the supply of industrial land are used for prediction. Variables with precedence in the supply of industrial land were found to be import, private and government consumption expenditure, total capital formation, economic sentiment index, producer's shipment index, machinery for domestic demand and composite leading index. As a result of estimating the ARIMA-X model using these variables, the ARIMA-X(1,1,0) model including only the import was found to be statistically significant. The industrial land demand forecast predicted the industrial land from 2021 to 2030 by reflecting the scenario of change in import. As a result, the future demand for industrial land was predicted to increase by 1.91% annually to 1,030.79 km2. As a result of comparing these results with the existing exponential smoothing method, the results of this study were found to be more suitable than the existing models. It is expected to b available as a new industrial land forecasting model.

A Case Study on Crime Prediction using Time Series Models (시계열 모형을 이용한 범죄예측 사례연구)

  • Joo, Il-Yeob
    • Korean Security Journal
    • /
    • no.30
    • /
    • pp.139-169
    • /
    • 2012
  • The purpose of this study is to contribute to establishing the scientific policing policies through deriving the time series models that can forecast the occurrence of major crimes such as murder, robbery, burglary, rape, violence and identifying the occurrence of major crimes using the models. In order to achieve this purpose, there were performed the statistical methods such as Generation of Time Series Model(C) for identifying the forecasting models of time series, Generation of Time Series Model(C) and Sequential Chart of Time Series(N) for identifying the accuracy of the forecasting models of time series on the monthly incidence of major crimes from 2002 to 2010 using IBM PASW(SPSS) 19.0. The following is the result of the study. First, murder, robbery, rape, theft and violence crime's forecasting models of time series are Simple Season, Winters Multiplicative, ARIMA(0,1,1)(0,1,1), ARIMA(1,1,0 )(0,1,1) and Simple Season. Second, it is possible to forecast the short-term's occurrence of major crimes such as murder, robbery, burglary, rape, violence using the forecasting models of time series. Based on the result of this study, we have to suggest various forecasting models of time series continuously, and have to concern the long-term forecasting models of time series which is based on the quarterly, yearly incidence of major crimes.

  • PDF

The Effect of Data Size on the k-NN Predictability: Application to Samsung Electronics Stock Market Prediction (데이터 크기에 따른 k-NN의 예측력 연구: 삼성전자주가를 사례로)

  • Chun, Se-Hak
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.3
    • /
    • pp.239-251
    • /
    • 2019
  • Statistical methods such as moving averages, Kalman filtering, exponential smoothing, regression analysis, and ARIMA (autoregressive integrated moving average) have been used for stock market predictions. However, these statistical methods have not produced superior performances. In recent years, machine learning techniques have been widely used in stock market predictions, including artificial neural network, SVM, and genetic algorithm. In particular, a case-based reasoning method, known as k-nearest neighbor is also widely used for stock price prediction. Case based reasoning retrieves several similar cases from previous cases when a new problem occurs, and combines the class labels of similar cases to create a classification for the new problem. However, case based reasoning has some problems. First, case based reasoning has a tendency to search for a fixed number of neighbors in the observation space and always selects the same number of neighbors rather than the best similar neighbors for the target case. So, case based reasoning may have to take into account more cases even when there are fewer cases applicable depending on the subject. Second, case based reasoning may select neighbors that are far away from the target case. Thus, case based reasoning does not guarantee an optimal pseudo-neighborhood for various target cases, and the predictability can be degraded due to a deviation from the desired similar neighbor. This paper examines how the size of learning data affects stock price predictability through k-nearest neighbor and compares the predictability of k-nearest neighbor with the random walk model according to the size of the learning data and the number of neighbors. In this study, Samsung electronics stock prices were predicted by dividing the learning dataset into two types. For the prediction of next day's closing price, we used four variables: opening value, daily high, daily low, and daily close. In the first experiment, data from January 1, 2000 to December 31, 2017 were used for the learning process. In the second experiment, data from January 1, 2015 to December 31, 2017 were used for the learning process. The test data is from January 1, 2018 to August 31, 2018 for both experiments. We compared the performance of k-NN with the random walk model using the two learning dataset. The mean absolute percentage error (MAPE) was 1.3497 for the random walk model and 1.3570 for the k-NN for the first experiment when the learning data was small. However, the mean absolute percentage error (MAPE) for the random walk model was 1.3497 and the k-NN was 1.2928 for the second experiment when the learning data was large. These results show that the prediction power when more learning data are used is higher than when less learning data are used. Also, this paper shows that k-NN generally produces a better predictive power than random walk model for larger learning datasets and does not when the learning dataset is relatively small. Future studies need to consider macroeconomic variables related to stock price forecasting including opening price, low price, high price, and closing price. Also, to produce better results, it is recommended that the k-nearest neighbor needs to find nearest neighbors using the second step filtering method considering fundamental economic variables as well as a sufficient amount of learning data.