Browse > Article
http://dx.doi.org/10.5351/KJAS.2022.35.5.593

Fraud detection support vector machines with a functional predictor: application to defective wafer detection problem  

Park, Minhyoung (Department of Statistics, Korea University)
Shin, Seung Jun (Department of Statistics, Korea University)
Publication Information
The Korean Journal of Applied Statistics / v.35, no.5, 2022 , pp. 593-601 More about this Journal
Abstract
We call "fruad" the cases that are not frequently occurring but cause significant losses. Fraud detection is commonly encountered in various applications, including wafer production in the semiconductor industry. It is not trivial to directly extend the standard binary classification methods to the fraud detection context because the misclassification cost is much higher than the normal class. In this article, we propose the functional fraud detection support vector machine (F2DSVM) that extends the fraud detection support vector machine (FDSVM) to handle functional covariates. The proposed method seeks a classifier for a function predictor that achieves optimal performance while achieving the desired sensitivity level. F2DSVM, like the conventional SVM, has piece-wise linear solution paths, allowing us to develop an efficient algorithm to recover entire solution paths, resulting in significantly improved computational efficiency. Finally, we apply the proposed F2DSVM to the defective wafer detection problem and assess its potential applicability.
Keywords
fraud detection; functional data; piece-wise linear solution paths; support vector machine;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Chawla NV, Bowyer KW, Hall LO, and Kegelmeyer WP (2002). SMOTE: synthetic minority over-sampling technique, Journal of Artificial Intelligence Research, 16, 321-357.   DOI
2 Feng Y, Zhou M, and Tong X (2021). Imbalanced classification: A paradigm-based review, Statistical Analysis and Data Mining: The ASA Data Science Journal, 14, 383-406.   DOI
3 James GM (2002). Generalized linear models with functional predictors, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64, 411-432.   DOI
4 James GM and Hastie TJ (2001). Functional linear discriminant analysis for irregularly sampled curves, Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63, 533-550.   DOI
5 Olszewski and Thomas R (2001). Generalized Feature Extraction for Structural Pattern Recognition in Timeseries Data, Carnegie Mellon University, Pennsylvania.
6 Park C, Koo JY, Kim S, Sohn I, and Lee JW (2008). Classification of gene functions using support vector machine for time-course gene expression data, Computational Statistics & Data Analysis, 52, 2578-2587.   DOI
7 Vapnik V (1999). The Nature of Statistical Learning Theory, Springer science & business media, Berlin.
8 Woodall WH, Spitzner DJ, Montgomery DC, and Gupta S (2004). Using control charts to monitor process and product quality profiles, Journal of Quality Technology, 36, 309-320.   DOI
9 Yao L, Tang J, and Li J (2009). Entire solution path for support vector machine for positive and unlabeled classification, Tsinghua Science and Technology, 14, 242-251,   DOI
10 Biau G, Florentina B, and Wegkamp MH (2005). Functional classification in H ilbert spaces, IEEE Transactions on Information Theory, 51, 2163-2172.   DOI
11 Bolton RJ and Hand DJ (2002). Statistical fraud detection: A review, Statistical Science, 17, 235-255.
12 Wahba G (1990). Spline Models for OIbservational Data, SIAM, Pennsylvania.
13 Hastie T, Rosset S, Tibshirani R, and Zhu J (2004). The entire regularization path for the support vector machine, Journal of Machine Learning Research, 5, 1391-1415.
14 Kimeldorf G and Wahba G (1971). Some results on Tchebycheffian spline functions, Journal of Mathematical Analysis and Applications, 33, 82-95.   DOI
15 Ramsay JO and Silverman BW (2002). Applied Functional Data Analysis: Methods and Case Studies, Springer, New York.
16 Rossi F and Villa N (2006). Support vector machine for functional data classification, Neurocomputing, 69, 730-742.   DOI