• Title/Summary/Keyword: Time-series data prediction

Search Result 611, Processing Time 0.029 seconds

Implementation of Fund Recommendation System Using Machine Learning

  • Park, Chae-eun;Lee, Dong-seok;Nam, Sung-hyun;Kwon, Soon-kak
    • Journal of Multimedia Information System
    • /
    • v.8 no.3
    • /
    • pp.183-190
    • /
    • 2021
  • In this paper, we implement a system for a fund recommendation based on the investment propensity and for a future fund price prediction. The investment propensity is classified by scoring user responses to series of questions. The proposed system recommends the funds with a suitable risk rating to the investment propensity of the user. The future fund prices are predicted by Prophet model which is one of the machine learning methods for time series data prediction. Prophet model predicts future fund prices by learning the parameters related to trend changes. The prediction by Prophet model is simple and fast because the temporal dependency for predicting the time-series data can be removed. We implement web pages for the fund recommendation and for the future fund price prediction.

Forecasting Baltic Dry Index by Implementing Time-Series Decomposition and Data Augmentation Techniques (시계열 분해 및 데이터 증강 기법 활용 건화물운임지수 예측)

  • Han, Min Soo;Yu, Song Jin
    • Journal of Korean Society for Quality Management
    • /
    • v.50 no.4
    • /
    • pp.701-716
    • /
    • 2022
  • Purpose: This study aims to predict the dry cargo transportation market economy. The subject of this study is the BDI (Baltic Dry Index) time-series, an index representing the dry cargo transport market. Methods: In order to increase the accuracy of the BDI time-series, we have pre-processed the original time-series via time-series decomposition and data augmentation techniques and have used them for ANN learning. The ANN algorithms used are Multi-Layer Perceptron (MLP), Recurrent Neural Network (RNN), and Long Short-Term Memory (LSTM) to compare and analyze the case of learning and predicting by applying time-series decomposition and data augmentation techniques. The forecast period aims to make short-term predictions at the time of t+1. The period to be studied is from '22. 01. 07 to '22. 08. 26. Results: Only for the case of the MAPE (Mean Absolute Percentage Error) indicator, all ANN models used in the research has resulted in higher accuracy (1.422% on average) in multivariate prediction. Although it is not a remarkable improvement in prediction accuracy compared to uni-variate prediction results, it can be said that the improvement in ANN prediction performance has been achieved by utilizing time-series decomposition and data augmentation techniques that were significant and targeted throughout this study. Conclusion: Nevertheless, due to the nature of ANN, additional performance improvements can be expected according to the adjustment of the hyper-parameter. Therefore, it is necessary to try various applications of multiple learning algorithms and ANN optimization techniques. Such an approach would help solve problems with a small number of available data, such as the rapidly changing business environment or the current shipping market.

A Study on the Health Index Based on Degradation Patterns in Time Series Data Using ProphetNet Model (ProphetNet 모델을 활용한 시계열 데이터의 열화 패턴 기반 Health Index 연구)

  • Sun-Ju Won;Yong Soo Kim
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.46 no.3
    • /
    • pp.123-138
    • /
    • 2023
  • The Fourth Industrial Revolution and sensor technology have led to increased utilization of sensor data. In our modern society, data complexity is rising, and the extraction of valuable information has become crucial with the rapid changes in information technology (IT). Recurrent neural networks (RNN) and long short-term memory (LSTM) models have shown remarkable performance in natural language processing (NLP) and time series prediction. Consequently, there is a strong expectation that models excelling in NLP will also excel in time series prediction. However, current research on Transformer models for time series prediction remains limited. Traditional RNN and LSTM models have demonstrated superior performance compared to Transformers in big data analysis. Nevertheless, with continuous advancements in Transformer models, such as GPT-2 (Generative Pre-trained Transformer 2) and ProphetNet, they have gained attention in the field of time series prediction. This study aims to evaluate the classification performance and interval prediction of remaining useful life (RUL) using an advanced Transformer model. The performance of each model will be utilized to establish a health index (HI) for cutting blades, enabling real-time monitoring of machine health. The results are expected to provide valuable insights for machine monitoring, evaluation, and management, confirming the effectiveness of advanced Transformer models in time series analysis when applied in industrial settings.

A Fuzzy Time-Series Prediction with Preprocessing (전처리과정을 갖는 시계열데이터의 퍼지예측)

  • Yoon, Sang-Hun;Lee, Chul-Hee
    • Proceedings of the KIEE Conference
    • /
    • 2000.11d
    • /
    • pp.666-668
    • /
    • 2000
  • In this paper, a fuzzy prediction method is proposed for time series data having uncertainty and non-stationary characteristics. Conventional methods, which use past data directly in prediction procedure, cannot properly handle non-stationary data whose long-term mean is floating. To cope with this problem, a data preprocessing technique utilizing the differences of original time series data is suggested. The difference sets are established from data. And the optimal difference set is selected for input of fuzzy predictor. The proposed method based the Takigi-Sugeno-Kang(TSK or TS) fuzzy rule. Computer simulations show improved results for various time series.

  • PDF

Prediction on Clusters by using Information Criterion and Multiple Seeds (정보기준과 다중 중심점을 활용한 클러스터별 예측)

  • Cho, Young-Hee;Lee, Gye-Sung
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.10 no.6
    • /
    • pp.145-152
    • /
    • 2010
  • Bayesian information criterion is used to do clustering for time series data. To acquire more stable clusters, multiple seeds are chosen first for the algorithm. Once clusters being set up, most similar time series data in the cluster to the one under consideration are to be chosen for prediction test. These chosen time series data are used to extract valid Markov rules by which we test the prediction accuracy. We confirmed that clustering with multiple seeds led to better prediction performance.

A Study on Improving Prediction Accuracy by Modeling Multiple Similar Time Series (다중 유사 시계열 모델링 방법을 통한 예측정확도 개선에 관한 연구)

  • Cho, Young-Hee;Lee, Gye-Sung
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.10 no.6
    • /
    • pp.137-143
    • /
    • 2010
  • A method for improving prediction accuracy through processing time series data has been studied in this research. We have designed techniques to model multiple similar time series data and avoided the shortcomings of single prediction model. We predicted the future changes by effective rules derived from these models. The methods for testing prediction accuracy consists of three types: fixed interval, sliding, and cumulative method. Among the three, cumulative method produced the highest accuracy.

Development of the Roundwood Demand Prediction Model

  • Kim, Dong-Jun
    • Journal of Korean Society of Forest Science
    • /
    • v.95 no.2
    • /
    • pp.203-208
    • /
    • 2006
  • This study compared the roundwood demand prediction accuracy of econometric and time-series models using Korean data. The roundwood was divided into softwood and hardwood by species. The econometric model of roundwood demand was specified with four explanatory variables; own price, substitute price, gross domestic product, dummy. The time-series model was specified with lagged endogenous variable. The dummy variable reflected the abrupt decrease in roundwood demand in the late 1990's in the case of softwood roundwood, and the boom of plywood export in the late 1970's in the case of hardwood roundwood. On the other hand, the prediction accuracy was estimated on the basis of Residual Mean Square Errors(RMSE). The results showed that the softwood roundwood demand prediction can be performed more accurately by econometric model than by time-series model. However, the hardwood roundwood demand prediction accuracy was similar in the case of using econometric and time-series model.

A Study on the Demand Prediction Model for Repair Parts of Automotive After-sales Service Center Using LSTM Artificial Neural Network (LSTM 인공신경망을 이용한 자동차 A/S센터 수리 부품 수요 예측 모델 연구)

  • Jung, Dong Kun;Park, Young Sik
    • The Journal of Information Systems
    • /
    • v.31 no.3
    • /
    • pp.197-220
    • /
    • 2022
  • Purpose The purpose of this study is to identifies the demand pattern categorization of repair parts of Automotive After-sales Service(A/S) and proposes a demand prediction model for Auto repair parts using Long Short-Term Memory (LSTM) of artificial neural networks (ANN). The optimal parts inventory quantity prediction model is implemented by applying daily, weekly, and monthly the parts demand data to the LSTM model for the Lumpy demand which is irregularly in a specific period among repair parts of the Automotive A/S service. Design/methodology/approach This study classified the four demand pattern categorization with 2 years demand time-series data of repair parts according to the Average demand interval(ADI) and coefficient of variation (CV2) of demand size. Of the 16,295 parts in the A/S service shop studied, 96.5% had a Lumpy demand pattern that large quantities occurred at a specific period. lumpy demand pattern's repair parts in the last three years is predicted by applying them to the LSTM for daily, weekly, and monthly time-series data. as the model prediction performance evaluation index, MAPE, RMSE, and RMSLE that can measure the error between the predicted value and the actual value were used. Findings As a result of this study, Daily time-series data were excellently predicted as indicators with the lowest MAPE, RMSE, and RMSLE values, followed by Weekly and Monthly time-series data. This is due to the decrease in training data for Weekly and Monthly. even if the demand period is extended to get the training data, the prediction performance is still low due to the discontinuation of current vehicle models and the use of alternative parts that they are contributed to no more demand. Therefore, sufficient training data is important, but the selection of the prediction demand period is also a critical factor.

Multiple Model Prediction System Based on Optimal TS Fuzzy Model and Its Applications to Time Series Forecasting (최적 TS 퍼지 모델 기반 다중 모델 예측 시스템의 구현과 시계열 예측 응용)

  • Bang, Young-Keun;Lee, Chul-Heui
    • Journal of Industrial Technology
    • /
    • v.28 no.B
    • /
    • pp.101-109
    • /
    • 2008
  • In general, non-stationary or chaos time series forecasting is very difficult since there exists a drift and/or nonlinearities in them. To overcome this situation, we suggest a new prediction method based on multiple model TS fuzzy predictors combined with preprocessing of time series data, where, instead of time series data, the differences of them are applied to predictors as input. In preprocessing procedure, the candidates of optimal difference interval are determined by using con-elation analysis and corresponding difference data are generated. And then, for each of them, TS fuzzy predictor is constructed by using k-means clustering algorithm and least squares method. Finally, the best predictor which minimizes the performance index is selected and it works on hereafter for prediction. Computer simulation is performed to show the effectiveness and usefulness of our method.

  • PDF

A Study on Prediction the Movement Pattern of Time Series Data using Information Criterion and Effective Data Length (정보기준과 효율적 자료길이를 활용한 시계열자료 운동패턴 예측 연구)

  • Jeon, Jin-Ho;Kim, Min-Soo
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.13 no.1
    • /
    • pp.101-107
    • /
    • 2013
  • Is generated in real time in the real world, a large amount of time series data from a wide range of business areas. But it is not easy to determine the optimal model for the description and understanding of the time series data is represented as a dynamic feature. In this study, through the HMM suitable for estimating the short and long-term forecasting model of time-series data to estimate a model that can explain the characteristics of these time series data, it was estimated to predict future patterns of movement. The actual stock market through various materials, information criterion and optimal model estimation for the length of the most efficient data was found to accurately estimate the state of the model. Similar movement patterns predictive than the long-term prediction is more similar to the short-term prediction of the experimental result were found to be.