Browse > Article
http://dx.doi.org/10.13088/jiis.2019.25.3.043

Anomaly Detection for User Action with Generative Adversarial Networks  

Choi, Nam woong (Department of Industrial Engineering, Yonsei University)
Kim, Wooju (Graduate School of Industrial Engineering, Yonsei University)
Publication Information
Journal of Intelligence and Information Systems / v.25, no.3, 2019 , pp. 43-62 More about this Journal
Abstract
At one time, the anomaly detection sector dominated the method of determining whether there was an abnormality based on the statistics derived from specific data. This methodology was possible because the dimension of the data was simple in the past, so the classical statistical method could work effectively. However, as the characteristics of data have changed complexly in the era of big data, it has become more difficult to accurately analyze and predict the data that occurs throughout the industry in the conventional way. Therefore, SVM and Decision Tree based supervised learning algorithms were used. However, there is peculiarity that supervised learning based model can only accurately predict the test data, when the number of classes is equal to the number of normal classes and most of the data generated in the industry has unbalanced data class. Therefore, the predicted results are not always valid when supervised learning model is applied. In order to overcome these drawbacks, many studies now use the unsupervised learning-based model that is not influenced by class distribution, such as autoencoder or generative adversarial networks. In this paper, we propose a method to detect anomalies using generative adversarial networks. AnoGAN, introduced in the study of Thomas et al (2017), is a classification model that performs abnormal detection of medical images. It was composed of a Convolution Neural Net and was used in the field of detection. On the other hand, sequencing data abnormality detection using generative adversarial network is a lack of research papers compared to image data. Of course, in Li et al (2018), a study by Li et al (LSTM), a type of recurrent neural network, has proposed a model to classify the abnormities of numerical sequence data, but it has not been used for categorical sequence data, as well as feature matching method applied by salans et al.(2016). So it suggests that there are a number of studies to be tried on in the ideal classification of sequence data through a generative adversarial Network. In order to learn the sequence data, the structure of the generative adversarial networks is composed of LSTM, and the 2 stacked-LSTM of the generator is composed of 32-dim hidden unit layers and 64-dim hidden unit layers. The LSTM of the discriminator consists of 64-dim hidden unit layer were used. In the process of deriving abnormal scores from existing paper of Anomaly Detection for Sequence data, entropy values of probability of actual data are used in the process of deriving abnormal scores. but in this paper, as mentioned earlier, abnormal scores have been derived by using feature matching techniques. In addition, the process of optimizing latent variables was designed with LSTM to improve model performance. The modified form of generative adversarial model was more accurate in all experiments than the autoencoder in terms of precision and was approximately 7% higher in accuracy. In terms of Robustness, Generative adversarial networks also performed better than autoencoder. Because generative adversarial networks can learn data distribution from real categorical sequence data, Unaffected by a single normal data. But autoencoder is not. Result of Robustness test showed that he accuracy of the autocoder was 92%, the accuracy of the hostile neural network was 96%, and in terms of sensitivity, the autocoder was 40% and the hostile neural network was 51%. In this paper, experiments have also been conducted to show how much performance changes due to differences in the optimization structure of potential variables. As a result, the level of 1% was improved in terms of sensitivity. These results suggest that it presented a new perspective on optimizing latent variable that were relatively insignificant.
Keywords
Autoencoder; Anomaly Score; Feature matching; Generative Adversarial Nets-Anomaly Detection; Optimizing latent variable;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Sun, B., P. B. Luh, Q. S. Jia, Z. O'Neill, and F. Song, "Building energy doctors: An spc and kalman filter-based method for system-level fault detection in hvac systems", IEEE Transactions on Automation Science and Engineering, Vol.11, No.1, (2014), 215-229.   DOI
2 Du, Z., B. Fan, X. Jin and J. Chi, "Fault detection and diagnosis for buildings and hvac systems using combined neural networks and subtractive clustering analysis", building and environment, Vol.73 (2014), 1-11   DOI
3 Koturwar, P., D. Mukhopadhyay and S. Griase, A survey of classification techniques in the area of big data, Department of Information Technology Maharashtra Instititute of Techonology, 2014, Available at https://arxiv.org/abs/1503.07477 (Downloaded 10 June, 2019)
4 Pimentel, A.F M., D. A. Clifton, L. Clifton and L. Tarassenko, "A review of novelty detection", Signal Processing, Vol.99, (2014), 215-249   DOI
5 Ye, N., S. Vilbert and Q. Chen, "Computer intrusion detection through ewma for autocorrelated and uncorrelated data", IEEE transactions on reliability, Vol.52, No.1, (2003).
6 He, X., Z. Wang, Y. Liu, and D. H. Zhou, "Least-squares fault detection and diagnosis for networked sensing systems using a direct state estimation approach", IEEE Transactions on Industrial Informatics, Vol.9, No.3, (2013), 1670-1679.   DOI
7 Ye, N. and Q. Chen, "An anomaly detection technique based on a chisquare statistic for detecting intrusions into information systems", Quality and Reliability Engineering International, Vol.17, No.2, (2001), 105-112.   DOI
8 Dai, X. and Z. Gao, "From model, signal to knowledge: A data-driven perspective of fault detection and diagnosis", IEEE Transactions on Industrial Informatics, Vol. 9, No. 4, (2013), 2226-2238.   DOI
9 Esteban, C., S. L. Hyland and G. Ratsch, Real-valued (medical) time series generation with recurrent conditional gans, Tri-Institutional Training Program in Computational Biology and Medicine Weill Cornell Medical, 2017. Available at https://arxiv.org/abs/1706.02633 (Downloaded 13 June, 2019)
10 Goh, J., S. Adepu, M. Tan and Z. S. Lee, Anomaly Deetection in cyber physical systems using recurrent neural networks, IEEE, Sigarpore, 2017.
11 Zenati, H., C. S. Foo, B. Lecouat, G. Manek and V.R Chandrasekhar, Efficient gan-based anomaly detection, ICDM, 2018. Available at https://arxiv.org/abs/1802.06222 (Downloaded 1 May 2019)
12 Goodfellow, I. J., J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville and Y. Bengio, "Generative adversarial nets", Advances in neural information processing systems, Vol. ACM, (2014)
13 Salimans, T., I. Goodfellow, W. Zaremba, V. Cheung, A. Radford, and X. Chen, "Improved techniques for training gans", In Advances in Neural Information Processing Systems, Vol.29, (2016), 2226-2234.
14 Hinton, G. E and R. R. salakhutdinov, "Reducing the dimensionality of Data with Neural Network", Science, Vol.313, No. 5786, (2006), 504-507.   DOI
15 Sakurada, M. and T. Yairi, Anomaly detection using autoencoders with nonlinear dimensionality reduction, Machine Learning for Sensory Data Analysis, Dunedin, 2014.
16 Fraink, J., Artificial intelligence and intrusion detection: Current and future directions, Division of Computer Science, University of California, 1994. Available at https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.5769&rank=1
17 Iwan, S., A. Prugel-Bennett and G. Wills, Networked digital Technologies, Springer, Dubai, 2012.
18 Deecke, L., R. Vandermeulen, L. Ruff, S. Mandt and M. Kloft, Anomaly Detection with Generative Adversarial Networks, 2018. Available at https://openreview.net/forum?id=S1EfylZ0Z (Downloaded 13 June 2019)
19 Li, D., D. Chen, J. Goh and S-K. Ng, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, DBLP, London, 2018.
20 Schlegl T., P. Seebock, S. M. Waldstein, U. Schmidt and G. Langs, Computer vision and pattern Recognition, IPMI, North Carolina, 2017.