Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2002.9B.6.759

A Machine Learning Approach to Web Image Classification  

Cho, Soo-Sun (한국전자통신연구원 정보가전연구부)
Lee, Dong-Woo (한국전자통신연구원 정보가전연구부)
Han, Dong-Won (한국전자통신연구원)
Hwang, Chi-Jung (충남대학교 컴퓨터과학과)
Abstract
Although image occupies a large part of importance on the Web documents, there have not been many researches for analyzing and understanding it. Many Web images are used for carrying important information but others are not used for it. In this paper classify the Web images from presently served Web sites to erasable or non-erasable classes. based on machine learning methods. For this research, we have detected 16 special and rich features for Web images and experimented by using the Baysian and decision tree methods. As the results, F-measures of 87.09%, 82.72% were achived for each method and particularly, from the experiments to compare the effects of feature groups, it has proved that the added features on this study are very useful for Web image classification.
Keywords
Image Classification; Machine Learning; Features of Web Images; Bayes Classifier; Decision Tree;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 ADEW, 'HTML Analyser,' http://www.htmlanalyser.com/
2 Knowledge Media Institute and The Open University, 'RoC: The Robust Bayesian Classifier,' http://kmi.open.ac.uk/projects/bkd/
3 S. Chandra, A. Gehani, C. S. Ellis and A. Vahdat, 'Transcoding Characteristics of Web Images,' In Proc. Multimedia Computing and Networking, San Jose, CA, Vol.4312, pp. 135-149, January, 2001
4 T. M. Mitchell, 'Machine Learning', McGraw-Hill, 1997
5 G. Penn, J. Hu, H. Luo and R. McDonald, 'Flexible web document analysis for delivery to narrow-bandwidth devices,' In Proc. 6th International Conference on Document Analysis and Recognition, Seattle, WA, USA, pp.1074-1078, September, 2001   DOI
6 J. R. Smith, R. Mohan and C-S Li, 'Content-Based Transcoding of Images In The Internet,' In Proc. IEEE Inter. Conf. Image Processing, October, 1998   DOI
7 S. Paek, 'Detecting image purpose in World-Wide Web documents,' In Proc. IS&T/SPIE Symposium on Electronic Imaging: Science and Technology Document Recognition, San Jose, CA, USA, January, 1998
8 M. J Swain, C. Frankel and V. Athitsos, 'WebSeer : An Image Search Engine for the World Wide Web,' In Proc. IEEE Computer Vision and Pattern Recognition Conference, June, 1997
9 김명관, '2단계 분류기법을 이용한 영상분류기 개발', 한국컴퓨터산업교육학회논문집, Vol.3., No.5, pp.605-610, 2002   과학기술학회마을
10 Rulequest Research, 'Data Mining Tools See5 and C5.0,' http://www.rulequest.com/see5-info.html
11 S. Chandra and C. S. Ellis, ']pEG Compression Metric as a Quality Aware Image Transcoding,' In Proc. USENIX 2nd Symposium on Internet Technologies and Systems, Boulder, CO, pp.81-92, October, 1999
12 Y. Wang and]. Hu, 'A Machine Learning Based Approach for Table Detection on The Web,' In Proc. The n' International World Wide Web Conference, Honolulu, Hawaii, USA, pp.242-250, May, 2002   DOI