• Title/Summary/Keyword: Time series method

Search Result 2,385, Processing Time 0.032 seconds

Clustering Algorithm for Time Series with Similar Shapes

  • Ahn, Jungyu;Lee, Ju-Hong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.7
    • /
    • pp.3112-3127
    • /
    • 2018
  • Since time series clustering is performed without prior information, it is used for exploratory data analysis. In particular, clusters of time series with similar shapes can be used in various fields, such as business, medicine, finance, and communications. However, existing time series clustering algorithms have a problem in that time series with different shapes are included in the clusters. The reason for such a problem is that the existing algorithms do not consider the limitations on the size of the generated clusters, and use a dimension reduction method in which the information loss is large. In this paper, we propose a method to alleviate the disadvantages of existing methods and to find a better quality of cluster containing similarly shaped time series. In the data preprocessing step, we normalize the time series using z-transformation. Then, we use piecewise aggregate approximation (PAA) to reduce the dimension of the time series. In the clustering step, we use density-based spatial clustering of applications with noise (DBSCAN) to create a precluster. We then use a modified K-means algorithm to refine the preclusters containing differently shaped time series into subclusters containing only similarly shaped time series. In our experiments, our method showed better results than the existing method.

Time Series Representation Combining PIPs Detection and Persist Discretization Techniques for Time Series Classification (시계열 분류를 위한 PIPs 탐지와 Persist 이산화 기법들을 결합한 시계열 표현)

  • Park, Sang-Ho;Lee, Ju-Hong
    • The Journal of the Korea Contents Association
    • /
    • v.10 no.9
    • /
    • pp.97-106
    • /
    • 2010
  • Various time series representation methods have been suggested in order to process time series data efficiently and effectively. SAX is the representative time series representation method combining segmentation and discretization techniques, which has been successfully applied to the time series classification task. But SAX requires a large number of segments in order to represent the meaningful dynamic patterns of time series accurately, since it loss the dynamic property of time series in the course of smoothing the movement of time series. Therefore, this paper suggests a new time series representation method that combines PIPs detection and Persist discretization techniques. The suggested method represents the dynamic movement of high-diemensional time series in a lower dimensional space by detecting PIPs indicating the important inflection points of time series. And it determines the optimal discretizaton ranges by applying self-transition and marginal probabilities distributions to KL divergence measure. It minimizes the information loss in process of the dimensionality reduction. The suggested method enhances the performance of time series classification task by minimizing the information loss in the course of dimensionality reduction.

Irregularly-Sampled Time Series Correction Method for Anomaly Detection in Manufacturing Facility (생산 설비의 이상탐지를 위한 불규칙 샘플링 시계열 데이터 보정 기법)

  • Shin, Kang-hyeon;Jin, Kyo-hong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.85-88
    • /
    • 2021
  • There are many irregularly-sampled time series in the manufacturing data which are collected from manufacturing facilities by short intervals. Those time series often have large variance. In this paper, we propose irregularly-sampled time series correction method based on simple moving average. This method corrects time intervals between neighboring values in time series regularly and reduces the variance of the values at the same time. We examine that this method improves performance of anomaly detection in manufacturing facility.

  • PDF

Predicting Nonstationary Time Series with Fuzzy Learning Based on Consecutive Data (연속된 데이터의 퍼지학습에 의한 비정상 시계열 예측)

  • Kim, In-Taek
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.50 no.5
    • /
    • pp.233-240
    • /
    • 2001
  • This paper presents a time series prediction method using a fuzzy rule-based system. Extracting fuzzy rules by performing a simple one-pass operation on the training data is quite attractive because it is easy to understand, verify, and extend. The simplest method is probably to relate an estimate, x(n+k), with past data such as x(n), x(n-1), ..x(n-m), where k and m are prefixed positive integers. The relation is represented by fuzzy if-then rules, where the past data stand for premise part and the predicted value for consequence part. However, a serious problem of the method is that it cannot handle nonstationary data whose long-term mean is varying. To cope with this, a new training method is proposed, which utilizes the difference of consecutive data in a time series. In this paper, typical previous works relating time series prediction are briefly surveyed and a new method is proposed to overcome the difficulty of prediction nonstationary data. Finally, computer simulations are illustrated to show the improved results for various time series.

  • PDF

A Procedure for Computing Conduction Time Series Factors by Numerical Method (전도 시계열 계수를 수치해석으로 구하는 방법)

  • Byun, Ki-Hong
    • Journal of the Korean Solar Energy Society
    • /
    • v.37 no.5
    • /
    • pp.77-84
    • /
    • 2017
  • The purpose of this paper is to propose the way of computing conduction time series factors (CTSF) using numerical method. After the accuracy of the numerical solution procedure being verified, the method is applied to the wall type 24 and roof type 14 of ASHARE to find the conduction time series coefficients, so called conduction time series factors. The results agree well with the values presented in the ASHRAE handbook. The method proposed can be easily applied to find unknown CTSF for more complex structures. It provides information about the temperature changes at a given location and time, thus validity of generated CTSF can be checked easily.

Chaotic Forecast of Time-Series Data Using Inverse Wavelet Transform

  • Matsumoto, Yoshiyuki;Yabuuchi, Yoshiyuki;Watada, Junzo
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.338-341
    • /
    • 2003
  • Recently, the chaotic method is employed to forecast a near future of uncertain phenomena. This method makes it possible by restructuring an attractor of given time-series data in multi-dimensional space through Takens' embedding theory. However, many economical time-series data are not sufficiently chaotic. In other words, it is hard to forecast the future trend of such economical data on the basis of chaotic theory. In this paper, time-series data are divided into wave components using wavelet transform. It is shown that some divided components of time-series data show much more chaotic in the sense of correlation dimension than the original time-series data. The highly chaotic nature of the divided component enables us to precisely forecast the value or the movement of the time-series data in near future. The up and down movement of TOPICS value is shown so highly predicted by this method as 70%.

  • PDF

Comparison of prediction methods for Nonlinear Time series data with Intervention1)

  • Lee, Sung-Duck;Kim, Ju-Sung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.265-274
    • /
    • 2003
  • Time series data are influenced by the external events such as holiday, strike, oil shock, and political change, so the external events cause a sudden change to the time series data. We regard the observation as outlier that occurred as a result of external events. In general, it is called intervention if we know the period and the reason of external events, and it makes an analyst difficult to establish a time series model. Therefore, it is important that we analyze the styles and effects of intervention. In this paper, we considered the linear time series model with invention and compared with nonlinear time series models such as ARCH, GARCH model and also we compared with the combination prediction method that Tong(1990) introduced. In the practical case study, we compared prediction power with RMSE among linear, nonlinear time series model with intervention and combination prediction method.

  • PDF

Efficient Time-Series Similarity Measurement and Ranking Based on Anomaly Detection (이상탐지 기반의 효율적인 시계열 유사도 측정 및 순위화)

  • Ji-Hyun Choi;Hyun Ahn
    • Journal of Internet Computing and Services
    • /
    • v.25 no.2
    • /
    • pp.39-47
    • /
    • 2024
  • Time series analysis is widely employed by many organizations to solve business problems, as it extracts various information and insights from chronologically ordered data. Among its applications, measuring time series similarity is a step to identify time series with similar patterns, which is very important in time series analysis applications such as time series search and clustering. In this study, we propose an efficient method for measuring time series similarity that focuses on anomalies rather than the entire series. In this regard, we validate the proposed method by measuring and analyzing the rank correlation between the similarity measure for the set of subsets extracted by anomaly detection and the similarity measure for the whole time series. Experimental results, especially with stock time series data and an anomaly proportion of 10%, demonstrate a Spearman's rank correlation coefficient of up to 0.9. In conclusion, the proposed method can significantly reduce computation cost of measuring time series similarity, while providing reliable time series search and clustering results.

QP-DTW: Upgrading Dynamic Time Warping to Handle Quasi Periodic Time Series Alignment

  • Boulnemour, Imen;Boucheham, Bachir
    • Journal of Information Processing Systems
    • /
    • v.14 no.4
    • /
    • pp.851-876
    • /
    • 2018
  • Dynamic time warping (DTW) is the main algorithms for time series alignment. However, it is unsuitable for quasi-periodic time series. In the current situation, except the recently published the shape exchange algorithm (SEA) method and its derivatives, no other technique is able to handle alignment of this type of very complex time series. In this work, we propose a novel algorithm that combines the advantages of the SEA and the DTW methods. Our main contribution consists in the elevation of the DTW power of alignment from the lowest level (Class A, non-periodic time series) to the highest level (Class C, multiple-periods time series containing different number of periods each), according to the recent classification of time series alignment methods proposed by Boucheham (Int J Mach Learn Cybern, vol. 4, no. 5, pp. 537-550, 2013). The new method (quasi-periodic dynamic time warping [QP-DTW]) was compared to both SEA and DTW methods on electrocardiogram (ECG) time series, selected from the Massachusetts Institute of Technology - Beth Israel Hospital (MIT-BIH) public database and from the PTB Diagnostic ECG Database. Results show that the proposed algorithm is more effective than DTW and SEA in terms of alignment accuracy on both qualitative and quantitative levels. Therefore, QP-DTW would potentially be more suitable for many applications related to time series (e.g., data mining, pattern recognition, search/retrieval, motif discovery, classification, etc.).

Dimension Analysis of Chaotic Time Series Using Self Generating Neuro Fuzzy Model

  • Katayama, Ryu;Kuwata, Kaihei;Kajitani, Yuji;Watanabe, Masahide;Nishida, Yukiteru
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1993.06a
    • /
    • pp.857-860
    • /
    • 1993
  • In this paper, we apply the self generating neuro fuzzy model (SGNFM) to the dimension analysis of the chaotic time series. Firstly, we formulate a nonlinear time series identification problem with nonlinear autoregressive (NARMAX) model. Secondly, we propose an identification algorithm using SGNFM. We apply this method to the estimation of embedding dimension for chaotic time series, since the embedding dimension plays an essential role for the identification and the prediction of chaotic time series. In this estimation method, identification problems with gradually increasing embedding dimension are solved, and the identified result is used for computing correlation coefficients between the predicted time series and the observed one. We apply this method to the dimension estimation of a chaotic pulsation in a finger's capillary vessels.

  • PDF