Browse > Article
http://dx.doi.org/10.7469/JKSQM.2022.50.3.459

A Pre-processing Process Using TadGAN-based Time-series Anomaly Detection  

Lee, Seung Hoon (Department of Industrial and Management Engineering, Kyonggi University Graduate School)
Kim, Yong Soo (Department of Industrial System Engineering, Kyonggi University)
Publication Information
Abstract
Purpose: The purpose of this study was to increase prediction accuracy for an anomaly interval identified using an artificial intelligence-based time series anomaly detection technique by establishing a pre-processing process. Methods: Significant variables were extracted by applying feature selection techniques, and anomalies were derived using the TadGAN time series anomaly detection algorithm. After applying machine learning and deep learning methodologies using normal section data (excluding anomaly sections), the explanatory power of the anomaly sections was demonstrated through performance comparison. Results: The results of the machine learning methodology, the performance was the best when SHAP and TadGAN were applied, and the results in the deep learning, the performance was excellent when Chi-square Test and TadGAN were applied. Comparing each performance with the papers applied with a Conventional methodology using the same data, it can be seen that the performance of the MLR was significantly improved to 15%, Random Forest to 24%, XGBoost to 30%, Lasso Regression to 73%, LSTM to 17% and GRU to 19%. Conclusion: Based on the proposed process, when detecting unsupervised learning anomalies of data that are not actually labeled in various fields such as cyber security, financial sector, behavior pattern field, SNS. It is expected to prove the accuracy and explanation of the anomaly detection section and improve the performance of the model.
Keywords
Pre-processing Process; Time-series Anomaly Detection; TadGAN; Unsupervised Learning;
Citations & Related Records
Times Cited By KSCI : 8  (Citation Analysis)
연도 인용수 순위
1 Jiao, Y., Yang, K., Song, D., & Tao, D. 2022. TimeAutoAD: Autonomous Anomaly Detection With Self-Supervised Contrastive Loss for Multivariate Time Series. IEEE Transactions on Network Science and Engineering 9(3):1604-1619.   DOI
2 Lee, G. H., Shin, B. C., & Hur, J. W. 2020. Fault Classification of Gear Pumps Using SVM. Journal of Applied Reliability, 20(2):187-196.   DOI
3 Li, Y., Peng, X., Zhang, J., Li, Z., & Wen, M. 2021. DCT-GAN: Dilated Convolutional Transformer-based GAN for Time Series Anomaly Detection. IEEE Transactions on Knowledge and Data Engineering.
4 Oh, S. & Islam, M. R. 2021. Application TadGAN to Detect Collective Anomaly in Power Usage Data. The Journal of Contents Computing 3(1):297-306.   DOI
5 Breiman, L. 2001. Random forests. Machine learning, 45(1):5-32.   DOI
6 Geiger, A., Liu, D., Alnegheimish, S., Cuesta-Infante, A., & Veeramachaneni, K. 2020. TadGAN: Time series anomaly detection using generative adversarial networks. In 2020 IEEE International Conference on Big Data (Big Data) pp. 33-43. IEEE.
7 Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., ... & Bengio, Y. 2014. Generative adversarial nets. Advances in neural information processing systems, 27.
8 Carletti, M., Masiero, C., Beghi, A., & Susto, G. A. 2019. A deep learning approach for anomaly detection with industrial time series data: a refrigerators manufacturing case study. Procedia Manufacturing 38:233-240.   DOI
9 Chang, K. B. G. N. A. Learning Anomaly Detection for Generating Predictive Maintenance Models from LBS-AUV Mission Data.
10 Chen, T. & Guestrin, C. 2016. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining pp. 785-794.
11 Cho, K., Van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., & Bengio, Y. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv: 1406.1078.
12 Meyer, P., Hackel, T., Reider, S., Korf, F., & Schmidt, T. C. 2021. Network Anomaly Detection in Cars: A Case for Time-Sensitive Stream Filtering and Policing. arXiv preprint arXiv:2112.11109.
13 Choi, S. H. & Hur, J. 2020. Optimized-XG boost learner based bagging model for photovoltaic power forecasting. Transactions of the Korean Institute of Electrical Engineers 69(7):978-984.   DOI
14 Hwang, J. H. & Jin, K. H. 2021. Anomaly Detection and Performance Analysis using Deep Learning. In Proceedings of the Korean Institute of Information and Commucation Sciences Conference pp. 78-81. The Korea Institute of Information and Commucation Engineering.
15 Jiang, W., Hong, Y., Zhou, B., He, X., & Cheng, C. 2019. A GAN-based anomaly detection approach for imbalanced industrial time series. IEEE Access 7:143608-143619.   DOI
16 Cook, A. A., Misirli, G., & Fan, Z. 2019. Anomaly detection for IoT time-series data: A survey. IEEE Internet of Things Journal 7(7):6481-6494.   DOI
17 Preuveneers, D., Rimmer, V., Tsingenopoulos, I., Spooren, J., Joosen, W., & Ilie-Zudor, E. 2018. Chained anomaly detection models for federated learning: An intrusion detection case study. Applied Sciences 8(12):2663.   DOI
18 Song, B. & Suh, Y. 2019. Narrative texts-based anomaly detection using accident report documents: The case of chemical process safety. Journal of Loss Prevention in the Process Industries 57:47-54.   DOI
19 Xu, J., Wu, H., Wang, J., & Long, M. 2021. Anomaly transformer: Time series anomaly detection with association discrepancy. arXiv preprint arXiv:2110.02642.
20 Nguyen, H. D., Tran, K. P., Thomassey, S., & Hamad, M. 2021. Forecasting and Anomaly Detection approaches using LSTM and LSTM Autoencoder techniques with the applications in supply chain management. International Journal of Information Management, 57, 102282.   DOI
21 Oh, M. J., Choi, E. S., Roh, K. W., Kim, J. S., & Cho, W. S. 2021. A Study on the design of supervised and unsupervised learning models for fault and anomaly detection in manufacturing facilities. The Journal of Bigdata, 6(1), 23-35.
22 Park, H. J., Cho, S. H., Jang, K. H., Seol, J. W., Kwon, B. G., Kwon, J. Y. & Choi, J. H. 2020. Study on Fault Diagnosis of Planetary Gearbox in Unmanned Aerial Vehicle Using Multi sensor Data. Journal of Applied Reliability 20(4):332-342.   DOI
23 Park, H. J., Sim, J. W., Jang, J. W., Jang, K. H., Seol, J. W., Kwon, J. Y. & Choi, J. H. 2021. Study on Fault Severity Diagnosis of Planetary Gearbox in Unmanned Aerial Vehicle using Artificial Neural Network. Journal of Applied Reliability 21(4):329-340.   DOI
24 TIPIRNENI, S. & REDDY, C. K. 2022. Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series. ACM Trans. Knowl. Discov. Data, 1(1).
25 Ramotsoela, D., Abu-Mahfouz, A., & Hancke, G. 2018. A survey of anomaly detection in industrial wireless sensor networks with critical water system infrastructure as a case study. Sensors 18(8):2491.   DOI
26 Rezapour, M. 2019. Anomaly detection using unsupervised methods: credit card fraud case study. International Journal of Advanced Computer Science and Applications 10(11).
27 Srivastava, N., Mansimov, E., & Salakhudinov, R. 2015. Unsupervised learning of video representations using lstms. In International conference on machine learning pp. 843-852. PMLR.
28 Lee, S. H. & Kim, Y. S. 2021. A Study on the Optimization of Long Short-Term Memory Hyperparameters Using the Taguchi Design of Experiments. Journal of Applied Reliability 21(3):238-245.   DOI
29 Kim, H. J. 2022. Semi-Supervised Learning to Predict Default Risk for P2P Lending. Journal of Digital Convergence 20(4):185-192.   DOI
30 Kim, H. S. & Choi, J. H. 2020. Distribution and Validation of RUL Prediction Parameters Considering Life Distribution. Journal of Applied Reliability 20(2):145-153.   DOI
31 Hochreiter, S. & Schmidhuber, J. 1997. Long short-term memory. Neural computation 9(8):1735-1780.   DOI
32 Lee, S. H., Yoon, Y. A., Jung, J. H., Chang, T. W., & Kim, Y. S. 2020. A Machine Learning Model for Predicting Silica Concentrations through Time Series Analysis of Mining Data. Journal of the Korean Society for Quality Management 48(3):511-520.   DOI