DOI QR코드

DOI QR Code

머신러닝 기법을 이용한 한국어 보이스피싱 텍스트 분류 성능 분석

Korean Voice Phishing Text Classification Performance Analysis Using Machine Learning Techniques

  • 발행 : 2021.11.04

초록

Text classification is one of the popular tasks in Natural Language Processing (NLP) used to classify text or document applications such as sentiment analysis and email filtering. Nowadays, state-of-the-art (SOTA) Machine Learning (ML) and Deep Learning (DL) algorithms are the core engine used to perform these classification tasks with high accuracy, and they show satisfying results. This paper conducts a benchmarking performance's analysis of multiple SOTA algorithms on the first known labeled Korean voice phishing dataset called KorCCVi. Experimental results reveal performed on a test set of 366 samples reveal which algorithm performs the best considering the training time and metrics such as accuracy and F1 score.

키워드

과제정보

This research was supported by the MSIT(Ministry of Science and ICT), Korea, under the National Program for Excellence in SW)(2018-0-00209) supervised by the IITP(Institute of Information & communications Technology Planning & Evaluation).