DOI QR코드

DOI QR Code

kNNDD-based One-Class Classification by Nonparametric Density Estimation

비모수 추정방법을 활용한 kNNDD의 이상치 탐지 기법

  • Son, Jung-Hwan (School of Industrial Management Engineering, Korea University) ;
  • Kim, Seoung-Bum (School of Industrial Management Engineering, Korea University)
  • 손정환 (고려대학교 산업경영공학부) ;
  • 김성범 (고려대학교 산업경영공학부)
  • Received : 2012.02.20
  • Accepted : 2012.06.12
  • Published : 2012.09.01

Abstract

One-class classification (OCC) is one of the recent growing areas in data mining and pattern recognition. In the present study we examine a k-nearest neighbors data description (kNNDD) algorithm, one of the OCC algorithms widely used. In particular, we propose to use nonparametric estimation methods to determine the threshold of the kNNDD algorithm. A simulation study has been conducted to explore the characteristics of the proposed approach and compare it with the existing approach that determines the threshold. The results demonstrate the usefulness and flexibility of the proposed approach.

Keywords

References

  1. Alireza, Tavakkoli., Amol, Ambardekar., Mircea, Nicolescu., and Sushil, J. Louiset. (2007), A Genetic Approach to Training Support Vector Data Descriptors for Background Modeling in Video Data, International Symposium on Visual Computing, Lake Tahoe, 318-327.
  2. Berry, M. J. A. and Linoff, G. (1997), Data Mining Techniques, John Wiley and Sons, Inc.
  3. Breunig, M. M., Kriegel, H. P., Ng, R. T., and Sander, J. (2000), LOF : Identifying density-based local outliers. in Proceedings of the ACM SIGMOD 2000 international conference on management of data, 29, 93-104.
  4. Burden, R. L. and Faires, J. D. (2000), Numerical Analysis, Seventh Edition, Brooks/Core, Parcific Grove, CA.
  5. Duda, R. and Hart, P. (1973), Pattern Classification and Scene Analysis, John Wiley and Sons, New York.
  6. Efron, B. and Tibshirani, R. (1993), An Introduction to the Bootstrap, Chapman and Hall/CRC, Boca Raton, FL.
  7. Hand, David., Mannila, Heikki., and Smyth, Padhraic. (2001), Principles of data mining, Adaptive Computation and Machine Learning Series, MIT Press.
  8. Khan, S. S. and Madden, M. G. (2010), A survey of recent trends in one class classication, Articial Intelligence and Cognitive Science-20th Irish Conference, Lecture Notes in Computer Science, 6206, 188-197, Springer.
  9. Kim, S. B., Sukchotrat, T., and Park, S. K. (2011), A nonparametric fault isolation approach through one-class classification algorithms, IIE Transactions, 43, 505-517. https://doi.org/10.1080/0740817X.2010.523769
  10. Koppel, M. and Schler, J. (2004), Authorship verification as a one-class classification problem, in Proceedings of 21st International Conference on Machine Learning.
  11. Mason, R. L. and Young, J. C. (2002), Multivariate Statistical Process Control With Industrial Applications, American Statistical Association and the Society for Industrial and Applied Mathematics, Philadelphia, PA.
  12. Manevitz, L. M. and Yousef, M. (2001), One-class svms for document classification, Journal of Machine Learning Research, 2, 139-154.
  13. Sanchez-Yanez, R. E., Kurmyshev, E. V., and Fernandex, A. (2003), One-class texture classifier in the CCR feature space, Pattern Recognition Letter, 24.
  14. Runger, G. C., Alt, F. B., and Montgomery, D. C. (1996), "Contributors to a multivariate statistical process control chart signal," Communications in Statistics : Theory and Methods, 25(10), 2203-2213. https://doi.org/10.1080/03610929608831832
  15. Silverman, B. W. (1986), Density Estimation for Statistics and Data Analysis. Chapmand and Hall, London, United Kingdom.
  16. Song, S. I., Cho, Y. C., and Park, H. K. (2003), "Robust Control Chart using Bootstrap Method," Journal of the Society of Korea Industrial and Systems Engineering, 26(3), 39-49.
  17. Sukchotrat, T., Kim, S. B., and Tsung, F. (2010), "One-class classificationbased control charts for multivariate process monitoring," IIE Transactions, 42, 107-120.
  18. Tan, P. N., Stein, M., and Kumar, V. (2007), Introduction to datamining, Infinity books, Seoul, Korea.
  19. Tax, D. M. J. (2001), One-class classification : Concept-learning in the absence of counter-examples, PHD thesis, Delf University of Technology, Netherlands.
  20. Wasserman, Larry. (2006), All of nonparametric statistics, Springer Texts in Statistics.
  21. Woodall, W. H. and Montgomery, D. C. (1999), "Research issues and ideas in statistical process control," Journal of Quality Technology, 31(4), 376-386.