Search | Korea Science

Clustering Algorithm for Time Series with Similar Shapes

Ahn, Jungyu;Lee, Ju-Hong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.7
- /
- pp.3112-3127
- /
- 2018
Since time series clustering is performed without prior information, it is used for exploratory data analysis. In particular, clusters of time series with similar shapes can be used in various fields, such as business, medicine, finance, and communications. However, existing time series clustering algorithms have a problem in that time series with different shapes are included in the clusters. The reason for such a problem is that the existing algorithms do not consider the limitations on the size of the generated clusters, and use a dimension reduction method in which the information loss is large. In this paper, we propose a method to alleviate the disadvantages of existing methods and to find a better quality of cluster containing similarly shaped time series. In the data preprocessing step, we normalize the time series using z-transformation. Then, we use piecewise aggregate approximation (PAA) to reduce the dimension of the time series. In the clustering step, we use density-based spatial clustering of applications with noise (DBSCAN) to create a precluster. We then use a modified K-means algorithm to refine the preclusters containing differently shaped time series into subclusters containing only similarly shaped time series. In our experiments, our method showed better results than the existing method.
https://doi.org/10.3837/tiis.2018.07.008 인용 PDF KSCI

Efficient Time-Series Similarity Measurement and Ranking Based on Anomaly Detection (이상탐지 기반의 효율적인 시계열 유사도 측정 및 순위화)

Ji-Hyun Choi;Hyun Ahn
- Journal of Internet Computing and Services
- /
- v.25 no.2
- /
- pp.39-47
- /
- 2024
Time series analysis is widely employed by many organizations to solve business problems, as it extracts various information and insights from chronologically ordered data. Among its applications, measuring time series similarity is a step to identify time series with similar patterns, which is very important in time series analysis applications such as time series search and clustering. In this study, we propose an efficient method for measuring time series similarity that focuses on anomalies rather than the entire series. In this regard, we validate the proposed method by measuring and analyzing the rank correlation between the similarity measure for the set of subsets extracted by anomaly detection and the similarity measure for the whole time series. Experimental results, especially with stock time series data and an anomaly proportion of 10%, demonstrate a Spearman's rank correlation coefficient of up to 0.9. In conclusion, the proposed method can significantly reduce computation cost of measuring time series similarity, while providing reliable time series search and clustering results.
https://doi.org/10.7472/jksii.2024.25.2.39 인용 PDF HTML

Dimension Analysis of Chaotic Time Series Using Self Generating Neuro Fuzzy Model

Katayama, Ryu;Kuwata, Kaihei;Kajitani, Yuji;Watanabe, Masahide;Nishida, Yukiteru
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 1993.06a
- /
- pp.857-860
- /
- 1993
In this paper, we apply the self generating neuro fuzzy model (SGNFM) to the dimension analysis of the chaotic time series. Firstly, we formulate a nonlinear time series identification problem with nonlinear autoregressive (NARMAX) model. Secondly, we propose an identification algorithm using SGNFM. We apply this method to the estimation of embedding dimension for chaotic time series, since the embedding dimension plays an essential role for the identification and the prediction of chaotic time series. In this estimation method, identification problems with gradually increasing embedding dimension are solved, and the identified result is used for computing correlation coefficients between the predicted time series and the observed one. We apply this method to the dimension estimation of a chaotic pulsation in a finger's capillary vessels.
PDF

How to Measure Nonlinear Dependence in Hydrologic Time Series (시계열 수문자료의 비선형 상관관계)

Mun, Yeong-Il
- Journal of Korea Water Resources Association
- /
- v.30 no.6
- /
- pp.641-648
- /
- 1997
Mutual information is useful for analyzing nonlinear dependence in time series in much the same way as correlation is used to characterize linear dependence. We use multivariate kernel density estimators for the estimation of mutual information at different time lags for single and multiple time series. This approach is tested on a variety of hydrologic data sets, and suggested an appropriate delay time $ au$ at which the mutual information is almost zerothen multi-dimensional phase portraits could be constructed from measurements of a single scalar time series.
PDF

Comparison of prediction methods for Nonlinear Time series data with Intervention1)

Lee, Sung-Duck;Kim, Ju-Sung
- Journal of the Korean Data and Information Science Society
- /
- v.14 no.2
- /
- pp.265-274
- /
- 2003
Time series data are influenced by the external events such as holiday, strike, oil shock, and political change, so the external events cause a sudden change to the time series data. We regard the observation as outlier that occurred as a result of external events. In general, it is called intervention if we know the period and the reason of external events, and it makes an analyst difficult to establish a time series model. Therefore, it is important that we analyze the styles and effects of intervention. In this paper, we considered the linear time series model with invention and compared with nonlinear time series models such as ARCH, GARCH model and also we compared with the combination prediction method that Tong(1990) introduced. In the practical case study, we compared prediction power with RMSE among linear, nonlinear time series model with intervention and combination prediction method.
PDF

QP-DTW: Upgrading Dynamic Time Warping to Handle Quasi Periodic Time Series Alignment

Boulnemour, Imen;Boucheham, Bachir
- Journal of Information Processing Systems
- /
- v.14 no.4
- /
- pp.851-876
- /
- 2018
Dynamic time warping (DTW) is the main algorithms for time series alignment. However, it is unsuitable for quasi-periodic time series. In the current situation, except the recently published the shape exchange algorithm (SEA) method and its derivatives, no other technique is able to handle alignment of this type of very complex time series. In this work, we propose a novel algorithm that combines the advantages of the SEA and the DTW methods. Our main contribution consists in the elevation of the DTW power of alignment from the lowest level (Class A, non-periodic time series) to the highest level (Class C, multiple-periods time series containing different number of periods each), according to the recent classification of time series alignment methods proposed by Boucheham (Int J Mach Learn Cybern, vol. 4, no. 5, pp. 537-550, 2013). The new method (quasi-periodic dynamic time warping [QP-DTW]) was compared to both SEA and DTW methods on electrocardiogram (ECG) time series, selected from the Massachusetts Institute of Technology - Beth Israel Hospital (MIT-BIH) public database and from the PTB Diagnostic ECG Database. Results show that the proposed algorithm is more effective than DTW and SEA in terms of alignment accuracy on both qualitative and quantitative levels. Therefore, QP-DTW would potentially be more suitable for many applications related to time series (e.g., data mining, pattern recognition, search/retrieval, motif discovery, classification, etc.).
https://doi.org/10.3745/JIPS.02.0090 인용 PDF KSCI

Research on data augmentation algorithm for time series based on deep learning

Shiyu Liu;Hongyan Qiao;Lianhong Yuan;Yuan Yuan;Jun Liu
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.17 no.6
- /
- pp.1530-1544
- /
- 2023
Data monitoring is an important foundation of modern science. In most cases, the monitoring data is time-series data, which has high application value. The deep learning algorithm has a strong nonlinear fitting capability, which enables the recognition of time series by capturing anomalous information in time series. At present, the research of time series recognition based on deep learning is especially important for data monitoring. Deep learning algorithms require a large amount of data for training. However, abnormal sample is a small sample in time series, which means the number of abnormal time series can seriously affect the accuracy of recognition algorithm because of class imbalance. In order to increase the number of abnormal sample, a data augmentation method called GANBATS (GAN-based Bi-LSTM and Attention for Time Series) is proposed. In GANBATS, Bi-LSTM is introduced to extract the timing features and then transfer features to the generator network of GANBATS.GANBATS also modifies the discriminator network by adding an attention mechanism to achieve global attention for time series. At the end of discriminator, GANBATS is adding averagepooling layer, which merges temporal features to boost the operational efficiency. In this paper, four time series datasets and five data augmentation algorithms are used for comparison experiments. The generated data are measured by PRD(Percent Root Mean Square Difference) and DTW(Dynamic Time Warping). The experimental results show that GANBATS reduces up to 26.22 in PRD metric and 9.45 in DTW metric. In addition, this paper uses different algorithms to reconstruct the datasets and compare them by classification accuracy. The classification accuracy is improved by 6.44%-12.96% on four time series datasets.
https://doi.org/10.3837/tiis.2023.06.002 인용 PDF HTML

The usefulness of overfitting via artificial neural networks for non-stationary time series

Ahn Jae-Joon;Oh Kyong-Joo;Kim Tae-Yoon
- Proceedings of the Korean Operations and Management Science Society Conference
- /
- 2006.05a
- /
- pp.1221-1226
- /
- 2006
The use of Artificial Neural Networks (ANN) has received increasing attention in the analysis and prediction of financial time series. Stationarity of the observed financial time series is the basic underlying assumption in the practical application of ANN on financial time series. In this paper, we will investigate whether it is feasible to relax the stationarity condition to non-stationary time series. Our result discusses the range of complexities caused by non-stationary behavior and finds that overfitting by ANN could be useful in the analysis of such non-stationary complex financial time series.
PDF

Time-Series Forecasting Based on Multi-Layer Attention Architecture

Na Wang;Xianglian Zhao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.18 no.1
- /
- pp.1-14
- /
- 2024
Time-series forecasting is extensively used in the actual world. Recent research has shown that Transformers with a self-attention mechanism at their core exhibit better performance when dealing with such problems. However, most of the existing Transformer models used for time series prediction use the traditional encoder-decoder architecture, which is complex and leads to low model processing efficiency, thus limiting the ability to mine deep time dependencies by increasing model depth. Secondly, the secondary computational complexity of the self-attention mechanism also increases computational overhead and reduces processing efficiency. To address these issues, the paper designs an efficient multi-layer attention-based time-series forecasting model. This model has the following characteristics: (i) It abandons the traditional encoder-decoder based Transformer architecture and constructs a time series prediction model based on multi-layer attention mechanism, improving the model's ability to mine deep time dependencies. (ii) A cross attention module based on cross attention mechanism was designed to enhance information exchange between historical and predictive sequences. (iii) Applying a recently proposed sparse attention mechanism to our model reduces computational overhead and improves processing efficiency. Experiments on multiple datasets have shown that our model can significantly increase the performance of current advanced Transformer methods in time series forecasting, including LogTrans, Reformer, and Informer.
https://doi.org/10.3837/tiis.2024.01.001 인용 PDF HTML

Time Series Representation Combining PIPs Detection and Persist Discretization Techniques for Time Series Classification (시계열 분류를 위한 PIPs 탐지와 Persist 이산화 기법들을 결합한 시계열 표현)

Park, Sang-Ho;Lee, Ju-Hong
- The Journal of the Korea Contents Association
- /
- v.10 no.9
- /
- pp.97-106
- /
- 2010
Various time series representation methods have been suggested in order to process time series data efficiently and effectively. SAX is the representative time series representation method combining segmentation and discretization techniques, which has been successfully applied to the time series classification task. But SAX requires a large number of segments in order to represent the meaningful dynamic patterns of time series accurately, since it loss the dynamic property of time series in the course of smoothing the movement of time series. Therefore, this paper suggests a new time series representation method that combines PIPs detection and Persist discretization techniques. The suggested method represents the dynamic movement of high-diemensional time series in a lower dimensional space by detecting PIPs indicating the important inflection points of time series. And it determines the optimal discretizaton ranges by applying self-transition and marginal probabilities distributions to KL divergence measure. It minimizes the information loss in process of the dimensionality reduction. The suggested method enhances the performance of time series classification task by minimizing the information loss in the course of dimensionality reduction.
https://doi.org/10.5392/JKCA.2010.10.9.097 인용 PDF KSCI

Search Result 1,889, Processing Time 0.039 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)