Search | Korea Science

Optimization of Classifier Performance at Local Operating Range: A Case Study in Fraud Detection

Park Lae-Jeong;Moon Jung-Ho
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.5 no.3
- /
- pp.263-267
- /
- 2005
Building classifiers for financial real-world classification problems is often plagued by severely overlapping and highly skewed class distribution. New performance measures such as receiver operating characteristic (ROC) curve and area under ROC curve (AUC) have been recently introduced in evaluating and building classifiers for those kind of problems. They are, however, in-effective to evaluation of classifier's discrimination performance in a particular class of the classification problems that interests lie in only a local operating range of the classifier, In this paper, a new method is proposed that enables us to directly improve classifier's discrimination performance at a desired local operating range by defining and optimizing a partial area under ROC curve or domain-specific curve, which is difficult to achieve with conventional classification accuracy based learning methods. The effectiveness of the proposed approach is demonstrated in terms of fraud detection capability in a real-world fraud detection problem compared with the MSE-based approach.
https://doi.org/10.5391/IJFIS.2005.5.3.263 인용 PDF KSCI

Semi-Supervised Learning for Fault Detection and Classification of Plasma Etch Equipment (준지도학습 기반 반도체 공정 이상 상태 감지 및 분류)

Lee, Yong Ho;Choi, Jeong Eun;Hong, Sang Jeen
- Journal of the Semiconductor & Display Technology
- /
- v.19 no.4
- /
- pp.121-125
- /
- 2020
With miniaturization of semiconductor, the manufacturing process become more complex, and undetected small changes in the state of the equipment have unexpectedly changed the process results. Fault detection classification (FDC) system that conducts more active data analysis is feasible to achieve more precise manufacturing process control with advanced machine learning method. However, applying machine learning, especially in supervised learning criteria, requires an arduous data labeling process for the construction of machine learning data. In this paper, we propose a semi-supervised learning to minimize the data labeling work for the data preprocessing. We employed equipment status variable identification (SVID) data and optical emission spectroscopy data (OES) in silicon etch with SF6/O2/Ar gas mixture, and the result shows as high as 95.2% of labeling accuracy with the suggested semi-supervised learning algorithm.
PDF KSCI

A Study proposal for URL anomaly detection model based on classification algorithm (분류 알고리즘 기반 URL 이상 탐지 모델 연구 제안)

Hyeon Wuu Kim;Hong-Ki Kim;DongHwi Lee
- Convergence Security Journal
- /
- v.23 no.5
- /
- pp.101-106
- /
- 2023
Recently, cyberattacks are increasing in social engineering attacks using intelligent and continuous phishing sites and hacking techniques using malicious code. As personal security becomes important, there is a need for a method and a solution for determining whether a malicious URL exists using a web application. In this paper, we would like to find out each feature and limitation by comparing highly accurate techniques for detecting malicious URLs. Compared to classification algorithm models using features such as web flat panel DB and based URL detection sites, we propose an efficient URL anomaly detection technique.
https://doi.org/10.33778/kcsa.2023.23.5.101 인용 PDF HTML

A Study on Plagiarism Detection and Document Classification Using Association Analysis (연관분석을 이용한 효과적인 표절검사 및 문서분류에 관한 연구)

Hwang, Insoo
- The Journal of Information Systems
- /
- v.23 no.3
- /
- pp.127-142
- /
- 2014
Plagiarism occurs when the content is copied without permission or citation, and the problem of plagiarism has rapidly increased because of the digital era of resources available on the World Wide Web. An important task in plagiarism detection is measuring and determining similar text portions between a given pair of documents. One of the main difficulties of this task is that not all similar text fragments are examples of plagiarism, since thematic coincidences also tend to produce portions of similar text. In order to handle this problem, this paper proposed association analysis in data mining to detect plagiarism. This method is able to detect common actions performed by plagiarists such as word deletion, insertion and transposition, allowing to obtain plausible portions of plagiarized text. Experimental results employing an unsupervised document classification strategy showed that the proposed method outperformed traditionally used approaches.
https://doi.org/10.5859/KAIS.2014.23.3.127 인용 PDF KSCI

One-class Classification based Fault Classification for Semiconductor Process Cyclic Signal (단일 클래스 분류기법을 이용한 반도체 공정 주기 신호의 이상분류)

Cho, Min-Young;Baek, Jun-Geol
- IE interfaces
- /
- v.25 no.2
- /
- pp.170-177
- /
- 2012
Process control is essential to operate the semiconductor process efficiently. This paper consider fault classification of semiconductor based cyclic signal for process control. In general, process signal usually take the different pattern depending on some different cause of fault. If faults can be classified by cause of faults, it could improve the process control through a definite and rapid diagnosis. One of the most important thing is a finding definite diagnosis in fault classification, even-though it is classified several times. This paper proposes the method that one-class classifier classify fault causes as each classes. Hotelling T2 chart, kNNDD(k-Nearest Neighbor Data Description), Distance based Novelty Detection are used to perform the one-class classifier. PCA(Principal Component Analysis) is also used to reduce the data dimension because the length of process signal is too long generally. In experiment, it generates the data based real signal patterns from semiconductor process. The objective of this experiment is to compare between the proposed method and SVM(Support Vector Machine). Most of the experiments' results show that proposed method using Distance based Novelty Detection has a good performance in classification and diagnosis problems.
https://doi.org/10.7232/IEIF.2012.25.2.170 인용 PDF KSCI

Survey on Detection and Recognition of Road Marking

Vokhidov, Husan;Hong, Hyung Gil;Hoang, Toan Minh;Kang, JinKyu;Park, Kang Ryoung;Cho, Hyeong Oh
- Proceedings of the Korea Information Processing Society Conference
- /
- 2015.10a
- /
- pp.1408-1410
- /
- 2015
Information about the painted road markings and other painted road objects play an important part in keeping safety of drivers. Some researchers have presented research approaches and dealt with road markings detection. In this paper, we present comprehensive survey of these techniques, and review some of them like a machine learning method, template matching method for road markings detection and classification, method of detection and classification of road markings using curve-based prototype fitting, signed edge signature method.
https://doi.org/10.3745/PKIPS.y2015m10a.1408 인용 PDF

On the Performance of Cuckoo Search and Bat Algorithms Based Instance Selection Techniques for SVM Speed Optimization with Application to e-Fraud Detection

AKINYELU, Andronicus Ayobami;ADEWUMI, Aderemi Oluyinka
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.3
- /
- pp.1348-1375
- /
- 2018
Support Vector Machine (SVM) is a well-known machine learning classification algorithm, which has been widely applied to many data mining problems, with good accuracy. However, SVM classification speed decreases with increase in dataset size. Some applications, like video surveillance and intrusion detection, requires a classifier to be trained very quickly, and on large datasets. Hence, this paper introduces two filter-based instance selection techniques for optimizing SVM training speed. Fast classification is often achieved at the expense of classification accuracy, and some applications, such as phishing and spam email classifiers, are very sensitive to slight drop in classification accuracy. Hence, this paper also introduces two wrapper-based instance selection techniques for improving SVM predictive accuracy and training speed. The wrapper and filter based techniques are inspired by Cuckoo Search Algorithm and Bat Algorithm. The proposed techniques are validated on three popular e-fraud types: credit card fraud, spam email and phishing email. In addition, the proposed techniques are validated on 20 other datasets provided by UCI data repository. Moreover, statistical analysis is performed and experimental results reveals that the filter-based and wrapper-based techniques significantly improved SVM classification speed. Also, results reveal that the wrapper-based techniques improved SVM predictive accuracy in most cases.
https://doi.org/10.3837/tiis.2018.03.021 인용 PDF KSCI

The Damage Classification by Periodicity Detection of Ultrasonic Wave Signal to Occur at the Tire (타이어에서 발생하는 초음파 신호의 주기성 검출에 의한 손상 분별)

Oh, Young-Dal;Kang, Dae-Soo
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.10 no.6
- /
- pp.107-111
- /
- 2010
The damage of tire by damage material classification method is researched as used ultrasonic wave signal to occur at a tire during vehicle driving. Auto-correlation function after having passed through an envelope detecting preprocess is used for detecting periodicity because of occurring periodic ultrasonic waves signal with tire revolution. One revolution cycle time of a damaged tire and period that calculated auto-correlation function appeared equally in experiment. The result that can classification whether or not there was a tire damage is established.
PDF KSCI

Obstacle Detection and Classification Algorithm using a Laser Scanner (레이저 스캐너를 이용한 장애물 탐색 및 분리 알고리즘 개발)

Lee, Gi-Roung;Hong, Suk-Kyo;Chwa, Dong-Kyoung
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.57 no.4
- /
- pp.677-685
- /
- 2008
This paper proposes algorithm for the obstacle detection and classification using a single laser scanner. In a measurement data from a laser scanner, there exist points with large differential value called singular points, which can be used to obtain the boundary of an obstacle such that obstacle information can be analyzed. On the other hand, measurement data include a lot of measurement error, which makes it difficult to analyze the accurate obstacle information. To solve this problem, the least square estimation algorithm is used to obtain the accurate information using a single laser scanner, by compensation for the measurement error. This algorithm can be used for the effective obstacle avoidance of mobile robots, and the experimental results are included to demonstrate the effectiveness of the propose algorithm.
PDF KSCI

Lane Detection and Tracking Using Classification in Image Sequences

Lim, Sungsoo;Lee, Daeho;Park, Youngtae
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.8 no.12
- /
- pp.4489-4501
- /
- 2014
We propose a novel lane detection method based on classification in image sequences. Both structural and statistical features of the extracted bright shape are applied to the neural network for finding correct lane marks. The features used in this paper are shown to have strong discriminating power to locate correct traffic lanes. The traffic lanes detected in the current frame is also used to estimate the traffic lane if the lane detection fails in the next frame. The proposed method is fast enough to apply for real-time systems; the average processing time is less than 2msec. Also the scheme of the local illumination compensation allows robust lane detection at nighttime. Therefore, this method can be widely used in intelligence transportation systems such as driver assistance, lane change assistance, lane departure warning and autonomous vehicles.
https://doi.org/10.3837/tiis.2014.12.014 인용 PDF KSCI KPUBS HTML

Search Result 1,931, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)