Browse > Article

Development of e-Mail Classifiers for e-Mail Response Management Systems  

Kim, Kuk-Pyo (동국대학교 산업공학과)
Kwon, Young-S. (동국대학교 산업공학과)
Publication Information
Journal of Information Technology Services / v.2, no.2, 2003 , pp. 87-95 More about this Journal
Abstract
With the increasing proliferation of World Wide Web, electronic mail systems have become very widely used communication tools. Researches on e-mail classification have been very important in that e-mail classification system is a major engine for e-mail response management systems which mine unstructured e-mail messages and automatically categorize them. in this research we develop e-mail classifiers for e-mail Response Management Systems (ERMS) using naive bayesian learning and centroid-based classification. We analyze which method performs better under which conditions, comparing classification accuracies which may depend on the structure, the size of training data set and number of classes, using the different data set of an on-line shopping mall and a credit card company. The developed e-mail classifiers have been successfully implemented in practice. The experimental results show that naive bayesian learning performs better, while centroid-based classification is more robust in terms of classification accuracy.
Keywords
Text Categorization; Text Mining; e-Mail Response Management System; Machine Learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Diao. Y., Lu. H. and Wu. D., A Comparative Study of Classification Based Personal E-mail Filtering, PAKDD, 2000
2 Mitchell. T. M., Machine Learning, The McGraw-Hill Company, 1997
3 Yang. Y. and Pedersen. J., A Comparative Study On Feature Selection in Text Categorization,' ICML, 1997
4 윤종식, '배깅과 부스팅을 이용한 나이브 베이지안 이메일 분류기의 성능향상', '동국대학교 석사학위논문', 2001
5 황호순, '프론트 앤드 e-CRM을 위한 전자메일 분류기 개발', '동국대학교 석사학위논문', 2001
6 Han(Sam). E. H. and Karypis. G., Centroid -Based Document Classification:Analysis & Experimental Results,' PAKDD, 2000
7 Lewis. D. and Ringuette. M., Comparison of Two Learning Algorithms for Text Categorization, In Tenth European Conference on Machine Learning, 1998
8 Dietterich. T. G., 'Approximate Statistical tests for Comparing Supervised Classification Learning Algorithms,' Neural Computation, Vol.10, No.7(1998)
9 Salton. G., Automatic Text Processing:The Transformation, Analysis, and Retrieval of Information by Computer, Addison Wesley, 1989
10 McCallum. A. and Nigam. K., A Comparison of Event Models for Naive Bayes Text Classification, In AAAI-98 Workshop on Learning for Text Categorization, 1998
11 Dumais. S. S., Heckerman. D. and Horvitz. E., A Bayesian Approach to Filtering Junk e-Mail, AAAI Technical Report WS-98-05, 1998