Fig. 1. An Illustration of a Clustering-Based Ensemble Method for Outlier Detection
Fig. 2. The Process of a Clustering-Based Ensemble Method for Outlier Detection
Fig. 3. An Illustration of Clustering-Based Ensemble Method for Outlier Detection on Streaming Data
Fig. 4. The Performance Comparison with Respect to the k Value in k-means Clustering and the Number of Ensemble Members
Fig. 5. The Effects of Window Size in the Ensemble Outlier Detection Method Using RBFevents_drift and RBFevents_no_drift data
Table 1. The Description of Data Sets
Table 2. Performance Comparison by Precision(P), Recall(R), and F1-measure of the Compared Methods
Table 3. Performance Comparison of the Ensemble Outlier Detection Method Using RBFevents_drift and RBFevents_no_drift Data
References
- E. M. Knorr and R. T. Ng, "Finding intensional knowledge of distance-based outliers," in Proceedings of 25th International Conference on Very Large Databases, 1999.
- M. Markou and A. Singh, "Novelty detection: a review-part 1: statistical approaches," Signal Processing, Vol.83, No.12, pp.2481-2497, 2003. https://doi.org/10.1016/j.sigpro.2003.07.018
- M. Markou and A. Singh, "Novelty detection: a review - part 2: neural network based approaches," Signal Processing, Vol.83, No.12, pp.2499-2521, 2003. https://doi.org/10.1016/j.sigpro.2003.07.019
- R. Domingues and M. Filippone, P. Michiardi, and J. Zouaoui, "A comparative evaluation of outlier detection algorithms: Experiments and analyses," Pattern Recognition, Vol.74, pp.406-421, 2018. https://doi.org/10.1016/j.patcog.2017.09.037
- C. Aggarwal, "Outlier analysis," Springer, Switzerlnd, 2017.
- K. Wu, K. Zhang, W. Fan, A. Edwards, and P. Yu, "RSForest: A rapid density estimator for streaming anomaly detection," In Proceedings of the 14th International Conference on Data Mining, 2014.
- F. Angiulli and F. Fassetti, "Detecting distance-based outlies in streams of data," in Proceedings of CIKM, 2007.
- M. M. Breunig. H-P. Kriegel. R. T. Ng, and J. Sander, "LOF: Identifying density-based local outliers," in Proceedings of the 2000 ACM Sigmod International Conference on Management of Data, 2000.
- K. Ting, F. Liu, and Z. Zhou, "Isolation forest," In Proceedings of the 8th International Conference on Data Mining, 2008.
- S. Zhai, Y. Cheng, W. Lu, and Z. Zhang, "Deep structured energy based models for anomaly detection," in Proceedings of the ICML, 2016.
- X. Xu, Z. He, and S. Deng, "Discovering cluster-based local outliers," Pattern Recognition Letters, Vol.24, pp.1641-1650, 2003. https://doi.org/10.1016/S0167-8655(03)00003-5
- E. Spinosa, A. Carvalho, and J. Gama, "Olindda: A clusterbased approach for detecting novelty and concept drift in data streams," In Proceedings of the SAC, 2007.
- Creditcard/Kaggle [Internet], https://www.kaggle.com/ arvindratan/creditcard/version/1.
- S. Hawkins, H. Hongxing, G. Williams, and R. Baxter, "Outlier detection using replicator neural networks," In Proceedings of the International Conference on Data Warehousing and Knowledge Discovery, 2002.
- A. Bifet, G. Holmes, R. Kirkby, and B. Pfahringer, "MOA: Massive online analysis," Journal of Machine Learning Research, Vol.11, pp.1601-1604, 2010.
- M. Steinbach, P. Tan, and V. Kumar, "Introduction to Data Mining," Addison Wesley, Boston, 2006.
- B. Scholkopf, J. Platt, J. Shawe-Taylor, A. Smola, and R. Williamson, "Estimating the support of a high-dimensional distribution," Neural Computation, Vol.13, No.7, pp.1443- 1471, 2001. https://doi.org/10.1162/089976601750264965
- F. Pedregosa et al, "Scikit-learn: Machine Learning in Python," Journal of Machine Learning Research, Vol.12, pp. 2825-2830, 2011.