Browse > Article

A SVM-based Method for Classifying Tagged Web Resources using Tag Stability of Folksonomy in Categories  

Koh, Byung-Gul (서울대학교 컴퓨터공학부)
Lee, Kang-Pyo (서울대학교 컴퓨터공학부)
Kim, Hyoung-Joo (서울대학교 컴퓨터공학부)
Abstract
Folksonomy, which is collaborative classification created by freely selected keywords, is one of the driving factors of the web 2.0. Folksonomy has advantage of being built at low cost while its weakness is lack of hierarchical or systematic structure in comparison with taxonomy. If we can build classifier that is able to classify web resources from collective intelligence in taxonomy, we can build taxonomy at low cost. In this paper, targeting folksonomy in Slashdot.org, we define a general model and show that collective intelligence, which can build classifier, really exists in folksonomy using a stability value. We suggest method that builds SVM classifier using stability that is result from this collective intelligence. The experiment shows that our proposed method managed to build taxonomy from folksonomy with high accuracy.
Keywords
Folksonomy; Tag; Collective intelligence; Taxonomy; Classification; SVM;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Mathes, A., Folksonomies-Cooperative Classifica-tion and Communication Through Shared Meta-data. Computer Mediated Communication, LIS590 CMC (Doctoral Seminar), Graduate School of Lib-rary and Information Science, University of Illinois Urbana-Champaign, December, 2004
2 Sinha, R., A cognitive analysis of tagging. Rashmi Sinha's weblog, available at: www. rashmisinha.com/archivcs/05_09/tagging-cognitive.html, 2005
3 Brooks, C.H. and N. Montanez, Improved annota-tion of the blogosphere via autotagging and hiera-rchical clustering. Proceedings of the 15th inter-national conference on World Wide Web, pp. 625-632, 2006   DOI
4 Halpin, H., V. Rohu, and H. Shepherd, The com-plex dynamics of collaborative tagging. Proceed-ings of the 16th international conference on World Wide Web, pp. 211-220, 2007   DOI
5 Joachims, T., Text categorization with support vector machines: Learning with many relevant features. Proceedings of ECML-98, 10th European Conference on Machine Learning, 1398, pp. 137-142, 1998
6 Sebastiani, F., Machine learning in automated text categorization. ACM Computing Surveys (CSUR), 34(1), pp. 1-47, 2002   DOI   ScienceOn
7 Kohavi, R., A study of cross-validation and boot-strap for accuracy estimation and model selection. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, 2(12), pp. 1137-1143, 1995
8 Chirita, P.A., et al., P-TAG: large scale automatic generation of personalized annotation tags for the web. Proceedings of the 16th international con-ference on World Wide Web, pp. 845-854, 2007   DOI
9 Golder, S. and B.A. Huberman, Usage Patterns of Collaborative Tagging Systems. Journal of Infor-mation Science, 32(2), pp. 198-208, 2006   DOI   ScienceOn
10 Smith, G., Atomiq: Folksonomy: social classifica-tion. Information Architecture, 2004. 3
11 Schmitz, P. Inducing ontology from Flickr tags. in Proceedings the Collaborative Web Tagging Workshop, WWW. 2006
12 Slashdot. httpt/slashdot.org
13 Li, X., L. Guo, and Y. Zhao. Tag-based Social Interest Discovery. in Proceedings of the 17th International World Wide Web Conference. 2008   DOI
14 Boser, B.E., I.M. Guyon, and V.N. Vapnik, A training algorithm for optimal margin classifiers. Proceedings of the fifth annual workshop on Com-putational learning theory, pp. 144-152, 1992   DOI
15 Yang, Y. and J.O. Pedersen, A Comparative Study on Feature Selection in Text Categorization. Pro-ceedings of the Fourteenth International Confer-ence on Machine Learning table of contents, pp. 412-420, 1997