• Title/Summary/Keyword: Time-series Model

Search Result 2,673, Processing Time 0.033 seconds

A Pre-processing Process Using TadGAN-based Time-series Anomaly Detection (TadGAN 기반 시계열 이상 탐지를 활용한 전처리 프로세스 연구)

  • Lee, Seung Hoon;Kim, Yong Soo
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.3
    • /
    • pp.459-471
    • /
    • 2022
  • Purpose: The purpose of this study was to increase prediction accuracy for an anomaly interval identified using an artificial intelligence-based time series anomaly detection technique by establishing a pre-processing process. Methods: Significant variables were extracted by applying feature selection techniques, and anomalies were derived using the TadGAN time series anomaly detection algorithm. After applying machine learning and deep learning methodologies using normal section data (excluding anomaly sections), the explanatory power of the anomaly sections was demonstrated through performance comparison. Results: The results of the machine learning methodology, the performance was the best when SHAP and TadGAN were applied, and the results in the deep learning, the performance was excellent when Chi-square Test and TadGAN were applied. Comparing each performance with the papers applied with a Conventional methodology using the same data, it can be seen that the performance of the MLR was significantly improved to 15%, Random Forest to 24%, XGBoost to 30%, Lasso Regression to 73%, LSTM to 17% and GRU to 19%. Conclusion: Based on the proposed process, when detecting unsupervised learning anomalies of data that are not actually labeled in various fields such as cyber security, financial sector, behavior pattern field, SNS. It is expected to prove the accuracy and explanation of the anomaly detection section and improve the performance of the model.

The forecasting evaluation of the high-order mixed frequency time series model to the marine industry (고차원 혼합주기 시계열모형의 해운경기변동 예측력 검정)

  • KIM, Hyun-sok
    • The Journal of shipping and logistics
    • /
    • v.35 no.1
    • /
    • pp.93-109
    • /
    • 2019
  • This study applied the statistically significant factors to the short-run model in the existing nonlinear long-run equilibrium relation analysis for the forecasting of maritime economy using the mixed cycle model. The most common univariate AR(1) model and out-of-sample forecasting are compared with the root mean squared forecasting error from the mixed-frequency model, and the prediction power of the mixed-frequency approach is confirmed to be better than the AR(1) model. The empirical results from the analysis suggest that the new approach of high-level mixed frequency model is a useful for forecasting marine industry. It is consistent that the inclusion of more information, such as higher frequency, in the analysis of long-run equilibrium framework is likely to improve the forecasting power of short-run models in multivariate time series analysis.

Time series analysis for Korean COVID-19 confirmed cases: HAR-TP-T model approach (한국 COVID-19 확진자 수에 대한 시계열 분석: HAR-TP-T 모형 접근법)

  • Yu, SeongMin;Hwang, Eunju
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.2
    • /
    • pp.239-254
    • /
    • 2021
  • This paper studies time series analysis with estimation and forecasting for Korean COVID-19 confirmed cases, based on the approach of a heterogeneous autoregressive (HAR) model with two-piece t (TP-T) distributed errors. We consider HAR-TP-T time series models and suggest a step-by-step method to estimate HAR coefficients as well as TP-T distribution parameters. In our proposed step-by-step estimation, the ordinary least squares method is utilized to estimate the HAR coefficients while the maximum likelihood estimation (MLE) method is adopted to estimate the TP-T error parameters. A simulation study on the step-by-step method is conducted and it shows a good performance. For the empirical analysis on the Korean COVID-19 confirmed cases, estimates in the HAR-TP-T models of order p = 2, 3, 4 are computed along with a couple of selected lags, which include the optimal lags chosen by minimizing the mean squares errors of the models. The estimation results by our proposed method and the solely MLE are compared with some criteria rules. Our proposed step-by-step method outperforms the MLE in two aspects: mean squares error of the HAR model and mean squares difference between the TP-T residuals and their densities. Moreover, forecasting for the Korean COVID-19 confirmed cases is discussed with the optimally selected HAR-TP-T model. Mean absolute percentage error of one-step ahead out-of-sample forecasts is evaluated as 0.0953% in the proposed model. We conclude that our proposed HAR-TP-T time series model with optimally selected lags and its step-by-step estimation provide an accurate forecasting performance for the Korean COVID-19 confirmed cases.

A Study on the Data Driven Neural Network Model for the Prediction of Time Series Data: Application of Water Surface Elevation Forecasting in Hangang River Bridge (시계열 자료의 예측을 위한 자료 기반 신경망 모델에 관한 연구: 한강대교 수위예측 적용)

  • Yoo, Hyungju;Lee, Seung Oh;Choi, Seohye;Park, Moonhyung
    • Journal of Korean Society of Disaster and Security
    • /
    • v.12 no.2
    • /
    • pp.73-82
    • /
    • 2019
  • Recently, as the occurrence frequency of sudden floods due to climate change increased, the flood damage on riverside social infrastructures was extended so that there has been a threat of overflow. Therefore, a rapid prediction of potential flooding in riverside social infrastructure is necessary for administrators. However, most current flood forecasting models including hydraulic model have limitations which are the high accuracy of numerical results but longer simulation time. To alleviate such limitation, data driven models using artificial neural network have been widely used. However, there is a limitation that the existing models can not consider the time-series parameters. In this study the water surface elevation of the Hangang River bridge was predicted using the NARX model considering the time-series parameter. And the results of the ANN and RNN models are compared with the NARX model to determine the suitability of NARX model. Using the 10-year hydrological data from 2009 to 2018, 70% of the hydrological data were used for learning and 15% was used for testing and evaluation respectively. As a result of predicting the water surface elevation after 3 hours from the Hangang River bridge in 2018, the ANN, RNN and NARX models for RMSE were 0.20 m, 0.11 m, and 0.09 m, respectively, and 0.12 m, 0.06 m, and 0.05 m for MAE, and 1.56 m, 0.55 m and 0.10 m for peak errors respectively. By analyzing the error of the prediction results considering the time-series parameters, the NARX model is most suitable for predicting water surface elevation. This is because the NARX model can learn the trend of the time series data and also can derive the accurate prediction value even in the high water surface elevation prediction by using the hyperbolic tangent and Rectified Linear Unit function as an activation function. However, the NARX model has a limit to generate a vanishing gradient as the sequence length becomes longer. In the future, the accuracy of the water surface elevation prediction will be examined by using the LSTM model.

The Development of the Short-Term Predict Model for Solar Power Generation (태양광발전 단기예측모델 개발)

  • Kim, Kwang-Deuk
    • Journal of the Korean Solar Energy Society
    • /
    • v.33 no.6
    • /
    • pp.62-69
    • /
    • 2013
  • In this paper, Korea Institute of Energy Research, building integrated renewable energy monitoring system that utilizes solar power generation forecast data forecast model is proposed. Renewable energy integration of real-time monitoring system based on monitoring data were building a database and the database of the weather conditions and to study the correlation structure was tailoring. The weather forecast cloud cover data, generation data, and solar radiation data, a data mining and time series analysis using the method developed models to forecast solar power. The development of solar power in order to forecast model of weather forecast data it is important to secure. To this end, in three hours, including a three-day forecast today Meteorological data were used from the KMA(korea Meteorological Administration) site offers. In order to verify the accuracy of the predicted solar circle for each prediction and the actual environment can be applied to generation and were analyzed.

MAGRU: Multi-layer Attention with GRU for Logistics Warehousing Demand Prediction

  • Ran Tian;Bo Wang;Chu Wang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.3
    • /
    • pp.528-550
    • /
    • 2024
  • Warehousing demand prediction is an essential part of the supply chain, providing a fundamental basis for product manufacturing, replenishment, warehouse planning, etc. Existing forecasting methods cannot produce accurate forecasts since warehouse demand is affected by external factors such as holidays and seasons. Some aspects, such as consumer psychology and producer reputation, are challenging to quantify. The data can fluctuate widely or do not show obvious trend cycles. We introduce a new model for warehouse demand prediction called MAGRU, which stands for Multi-layer Attention with GRU. In the model, firstly, we perform the embedding operation on the input sequence to quantify the external influences; after that, we implement an encoder using GRU and the attention mechanism. The hidden state of GRU captures essential time series. In the decoder, we use attention again to select the key hidden states among all-time slices as the data to be fed into the GRU network. Experimental results show that this model has higher accuracy than RNN, LSTM, GRU, Prophet, XGboost, and DARNN. Using mean absolute error (MAE) and symmetric mean absolute percentage error(SMAPE) to evaluate the experimental results, MAGRU's MAE, RMSE, and SMAPE decreased by 7.65%, 10.03%, and 8.87% over GRU-LSTM, the current best model for solving this type of problem.

Accuracy Assessment of Forest Degradation Detection in Semantic Segmentation based Deep Learning Models with Time-series Satellite Imagery

  • Woo-Dam Sim;Jung-Soo Lee
    • Journal of Forest and Environmental Science
    • /
    • v.40 no.1
    • /
    • pp.15-23
    • /
    • 2024
  • This research aimed to assess the possibility of detecting forest degradation using time-series satellite imagery and three different deep learning-based change detection techniques. The dataset used for the deep learning models was composed of two sets, one based on surface reflectance (SR) spectral information from satellite imagery, combined with Texture Information (GLCM; Gray-Level Co-occurrence Matrix) and terrain information. The deep learning models employed for land cover change detection included image differencing using the Unet semantic segmentation model, multi-encoder Unet model, and multi-encoder Unet++ model. The study found that there was no significant difference in accuracy between the deep learning models for forest degradation detection. Both training and validation accuracies were approx-imately 89% and 92%, respectively. Among the three deep learning models, the multi-encoder Unet model showed the most efficient analysis time and comparable accuracy. Moreover, models that incorporated both texture and gradient information in addition to spectral information were found to have a higher classification accuracy compared to models that used only spectral information. Overall, the accuracy of forest degradation extraction was outstanding, achieving 98%.

GENERALISED PARAMETERS TECHNIQUE FOR IDENTIFICATION OF SEASONAL ARMA (SARMA) AND NON SEASONAL ARMA (NSARMA) MODELS

  • M. Sreenivasan;K. Sumathi
    • Journal of applied mathematics & informatics
    • /
    • v.4 no.1
    • /
    • pp.135-135
    • /
    • 1997
  • Times series modeling plays an important role in the field of engineering, Statistics, Biomedicine etc. Model identification is one of crucial steps in the modeling of an AutoRegreesive Moving Average(ARMA(p, q)) process for real world problems. Many techniques have been developed in the literature (Salas et al., McLeod et al. etc.) for the identification of an ARMA(p, q) Model. In this paper, a new technique called The Generalised Parameters Technique is formulated for seasonal and non-seasonal ARMA model identification. This technique is very simple and can e applied to any given time series. Initial estimates of the AR parameters of the ARMA model are also obtained by this method. This model identification technique is validated through many theoretical and simulated examples.

A systematic review of studies using time series analysis of health and welfare in Korea (체계적 문헌고찰을 통한 국내 보건복지 분야의 시계열 분석 연구 동향)

  • Woo, Kyung-Sook;Shin, Young-Jeon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.3
    • /
    • pp.579-599
    • /
    • 2014
  • The purpose of this study was to identify the trends and risk of bias of research using time series analysis on health and welfare in Korea and to suggest a direction for future health and welfare research. The database searches identified 6,543 papers. Following the process for screening and selecting, a total of 91 papers were included in the systematic review. There has been a steady increase in the number of articles using time series analysis from 1987 to 2013. Time series analysis was applied in medicine and health science journals. The main goals were explanation and description. Most of the subjects were heath status and utilization of healthcare services. The main model used in the time series analysis was ARIMA followed by time series regression. The data were gathered from various sources, including the national statistical office and government agencies. For assessing risk of bias, some studies were found to have inadequate sample sizes or showed no time series graphs and plots. These findings suggest greater widespread utilization of time series analysis in the field of health and welfare and to use the appropriate analysis methods and statistical procedures to obtain more reliable results to improve the quality of research.

Intelligent Digital Redesign of Uncertain Nonlinear Systems : Global approach (불확실성이 포함된 비선형 시스템에 대한 전역적 접근의 지능형 디지털 재설계)

  • Sung Hwachang;Joo Younghoon;Park Jinbae;kim Dowan
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2005.11a
    • /
    • pp.95-98
    • /
    • 2005
  • This paper presents intelligent digital redesign method of global approach for hybrid state space fuzzy-model-based controllers. For effectiveness and stabilization of continuous-time uncertain nonlinear systems under discrete-time controller, Takagi-Sugeno(TS) fuzzy model is used to represent the complex system. And global approach design problems viewed as a convex optimization problem that we minimize the error of the norm bounds between nonlinearly interpolated linear operators to be matched. Also by using the power series, we analyzed nonlinear system's uncertain parts more precisely. When a sampling period is sufficiently small, the conversion of a continuous-time structured uncertain nonlinear system to an equivalent discrete -time system have proper reason. Sufficiently conditions for the global state -matching of the digitally controlled system are formulated in terms of linear matrix inequalities (LMls). Finally, we prove the effectiveness and stabilization of the proposed intelligent digital redesign method by applying the chaotic Lorentz system.

  • PDF