Browse > Article
http://dx.doi.org/10.22156/CS4SMB.2022.12.04.260

A Study on Atmospheric Data Anomaly Detection Algorithm based on Unsupervised Learning Using Adversarial Generative Neural Network  

Yang, Ho-Jun (Department of Electric Computer Engineering, Inha University)
Lee, Seon-Woo (Department of Electric Computer Engineering, Inha University)
Lee, Mun-Hyung (Department of Electric Computer Engineering, Inha University)
Kim, Jong-Gu (Department of Electric Computer Engineering, Inha University)
Choi, Jung-Mu (Department of Computer Engineering, Inha University)
Shin, Yu-mi (Department of Computer Engineering, Inha University)
Lee, Seok-Chae (Department of Public Administration, Inha University)
Kwon, Jang-Woo (Department of Computer Engineering, Inha University)
Park, Ji-Hoon (Air Quality Research Department, Air Quality Research Division)
Jung, Dong-Hee (Air Quality Research Department, Air Quality Research Division)
Shin, Hye-Jung (Air Quality Research Department, Air Quality Research Division)
Publication Information
Journal of Convergence for Information Technology / v.12, no.4, 2022 , pp. 260-269 More about this Journal
Abstract
In this paper, We propose an anomaly detection model using deep neural network to automate the identification of outliers of the national air pollution measurement network data that is previously performed by experts. We generated training data by analyzing missing values and outliers of weather data provided by the Institute of Environmental Research and based on the BeatGAN model of the unsupervised learning method, we propose a new model by changing the kernel structure, adding the convolutional filter layer and the transposed convolutional filter layer to improve anomaly detection performance. In addition, by utilizing the generative features of the proposed model to implement and apply a retraining algorithm that generates new data and uses it for training, it was confirmed that the proposed model had the highest performance compared to the original BeatGAN models and other unsupervised learning model like Iforest and One Class SVM. Through this study, it was possible to suggest a method to improve the anomaly detection performance of proposed model while avoiding overfitting without additional cost in situations where training data are insufficient due to various factors such as sensor abnormalities and inspections in actual industrial sites.
Keywords
Air Quality; Machine Learning; Deep Learning; Unsupervised Learning; Anomaly Detection;
Citations & Related Records
연도 인용수 순위
  • Reference
1 A. Deng & B. Hooi. (2021). Graph Neural Network-Based Anomaly Detection in Multivariate Time Series. Proceedings of the AAAI Conference on Artificial Intelligence (Vol. 35, No. 5, pp. 4027-4035). ArXiv:2106.06947 DOI : 10.48550/arXiv.2106.06947   DOI
2 L. Ruthotto & E. Haber. (2021). An Introduction to Deep Generative Modeling. GAMM-Mitteilungen, 44(2), e202100008. ArXiv:2103.05180 DOI : 10.1002/gamm.202100008   DOI
3 S. G. K. Patro & K. K. sahu. (2015). Normalization: A Preprocessing Stage. IARJSET, 2(3), 20-22. DOI : 10.17148/IARJSET.2015.2305   DOI
4 K. Cho et al. (2014). Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation. ArXiv:1406.1078 DOI : 10.48550/arXiv.1406.1078
5 J. Davis & M. Goadrich. (2006). The relationship between Precision-Recall and ROC curves. Proceedings of the 23rd International Conference on Machine Learning - ICML '06, 233-240. DOI : 10.1145/1143844.1143874   DOI
6 H. Ren et al. (2019). Time-Series Anomaly Detection Service at Microsoft. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (pp.3009-3017). DOI : 10.1145/3292500.3330680   DOI
7 Air Korea. (2021). Annual report of the Atmospheric Environment 2020(Online). https://www.airkorea.or.kr/web/detailViewDown?pMENU_NO=125
8 J. Zhou et al. (2020). Graph neural networks: A review of methods and applications. AI Open, 1, 57-81. DOI : 10.1016/j.aiopen.2021.01.001   DOI
9 J. Li, H. Izakian, W. Pedrycz & I. Jamal. (2020). Clustering-based anomaly detection in multivariate time series data. Applied Soft Computing, 100, 106919. DOI : 10.1016/j.asoc.2020.106919   DOI
10 B. Zhou, S. Liu, B. Hooi, X. Cheng & J. Ye. (2019). BeatGAN: Anomalous Rhythm Detection using Adversarially Generated Time Series. Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, (pp. 4433-4439). DOI : 10.24963/ijcai.2019/616   DOI
11 L. M. Manevitz & M. Yousef. (2001). One-Class SVMs for Document Classification. Journal of Machine Learning Research, 2(Dec), 139-154.
12 F. T. Liu, K. M. Ting & Z.-H. Zhou. (2008). Isolation Forest. 2008 Eighth IEEE International Conference on Data Mining, (pp. 413-422). DOI : 10.1109/ICDM.2008.17   DOI