Browse > Article
http://dx.doi.org/10.7838/jsebs.2011.16.4.301

Post Clustering Method using Tag Hierarchy for Blog Search  

Lee, Ki-Jun (팬택 중앙연구소)
Kim, Kyung-Min (연세대학교 공과대학 정보산업공학과)
Lee, Myung-Jin (연세대학교 공과대학 정보산업공학과)
Kim, Woo-Ju (연세대학교 공과대학 정보산업공학과)
Hong, June-S. (경기대학교 경상대학 경영정보학과)
Publication Information
The Journal of Society for e-Business Studies / v.16, no.4, 2011 , pp. 301-319 More about this Journal
Abstract
Blog plays an important role as new type of knowledge base distinguishing from traditional web resource. While information resources in their existing website dealt with a wide range of topics, information resources of the blog are concentrated in specific units of information depending on the user's interests and have the criteria of classification forresources published by tagging. In this research, we build a tag hierarchy utilizing title keywords and tags of the blog, and propose apost clustering methodology applying the tag hierarchy. We then generate the tag hierarchy reflected the relationship between tags and develop the tag clustering methodology according to tag similarity. In this paper, we analyze the possibility of applying the proposed methodology with real-world examples and evaluate its performances through developed prototype system.
Keywords
Blog Search; Tag Hierarchy; Post Clustering;
Citations & Related Records
Times Cited By KSCI : 5  (Citation Analysis)
연도 인용수 순위
1 엄태영, 김우주, 박상언, "태그 네트워크를 이용한 개인화 북마크 추천시스템", 한국전자거래학회지, 제15권, 제4호, pp. 181-195, 2010.
2 이기준, 이명진, 김우주, "주제 유사성기반 클러스터링을 이용한 블로그 검색기법 연구", 한국지능정보시스템학회, 제15권, pp. 61-74, 2009.
3 Begelman, G., Keller, P., and Smadja, F., "Automated tag clustering : Improving search and exploration in the tag space," Citeseer, 2006.
4 Broder, A., "A taxonomy of web search," pp. 3-10, 2002.
5 Cantone, D., Ferro, A., Pulvirenti, A., Recupero, D. R., and Shasha, D., "Antipole tree indexing to support range search and k-nearest neighbor search in metric spaces," Knowledge and Data Engineering, IEEE Transactions on, Vol. 17, pp. 535-550, 2005.   DOI
6 김기현, 정영미, "이용자 태그 확장을 통한 블로그 자동분류 성능 향상에 관한 연구", 제16회 한국정보관리학회 학술대회 논문집, pp. 43-48, 2009.
7 Sarle, W. S., "Algorithms for Clustering Data," Vol. 32, ed: JSTOR, pp. 227-229, 1990.
8 (2011/04/26), BlogWatcher. Available : http://blogwatcher.pi.titech.ac.jp.
9 (2011/04/26), Tistory. Available : http://www.tistory.com/.
10 (2011/04/26), Wikipedia. Available : http://en.wikipedia.org/wiki/Cluster_analysis.
11 Steinbach, M., Karypis, G., and Kumar, V., "A comparison of document clustering techniques," pp. 525-526, 2000.
12 Sun, A., Suryanto, M. A., and Liu, Y., "Blog classification using tags : An empirical study," pp. 307-316, 2007.
13 Takama, Y., Kajinami, T., and Matsumura, A., "Application of keyword map-based relevance feedback to interactive blog search," pp. 112-115, 2005.
14 Wu, Z. and Palmer, M., "Verbs semantics and lexical selection," pp. 133-138, 1994.
15 (2011/04/26), BLOGRANGER. Available : http://ranger.labs.goo.ne.jp.
16 Zesch, T. and Gurevych, I., "Wisdom of crowds versus wisdom of linguistsmeasuring the semantic relatedness of words," Natural Language Engineering, Vol. 16, pp. 25-59, 2010.   DOI
17 (2011/04/26), Bloglines. Available : http://www.bloglines.com/.
18 (2011/04/26), Blogpulse. Available : http://www.blogpulse.com/.
19 Patwardhan, S. and Pedersen, T., "Using WordNet-based context vectors to estimate the semantic relatedness of concepts," Making Sense of Sense : Bringing Psycholinguistics and Computational Linguistics Together, p. 1, 2006.
20 Rada, R., Mili, H., Bicknell, E., and Blettner, M., "Development and application of a metric on semantic nets," Systems, Man and Cybernetics, IEEE Transactions on, Vol. 19, pp. 17-30, 1989.   DOI   ScienceOn
21 Rand, W. M., "Objective criteria for the evaluation of clustering methods," Journal of the American Statistical association, Vol. 66, pp. 846-850, 1971.   DOI   ScienceOn
22 Fujiki, T., Nanno, T., Suzuki, Y., and Okumura, M., "Identification of bursts in a document stream," pp. 55-64, 2004.
23 Resnik, P., "Using information content to evaluate semantic similarity in a taxonomy," Arxiv preprint cmp-lg/9511007, 1995.
24 Chung, Y. M. and Lee, J. Y., "A corpus based approach to comparative evaluation of statistical term association measures," Journal of the American Society for Information Science and Technology, Vol. 52, pp. 283-296, 2001.   DOI   ScienceOn
25 Cutting, D. R., Karger, D. R., Pedersen, J. O., and Tukey, J. W., "Scatter/gather: A cluster-based approach to browsing large document collections," pp. 318-329, 1992.
26 Grahl, M., Hotho, A., and Stumme, G., "Conceptual clustering of social bookmarking sites," pp. 356-364, 2007.
27 Lesk, M., "Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone," pp. 24-26, 1986.
28 Gurevych, I., "Using the structure of a conceptual network in computing semantic relatedness," Natural Language Processing-IJCNLP 2005, pp. 767-778, 2005.
29 Kumar, R., Novak, J., Raghavan, P., and Tomkins, A., "On the bursty evolution of blogspace," World Wide Web, Vol. 8, pp. 159-178, 2005.   DOI   ScienceOn
30 Leacock, C. and Chodorow, M., "Combining local context and WordNet similarity for word sense identification," WordNet : An electronic lexical database, Vol. 49, pp. 265-283, 1998.
31 MacQueen, J., "Some methods for classification and analysis of multivariate observations," p. 14, 1967.
32 Mishne, G. and Rijke, M. de, "A study of blog search," Advances in Information Retrieval, pp. 289-301, 2006.
33 김은희, 정영미, "사용자 태그와 중심성 지수를 이용한 블로그 검색 성능 향상에 관한 연구", 정보관리학회지, 제27권, 제1호, pp. 61-77, 2010.
34 김재승, 문현정, 우용태, "태그 온톨로지를 이용한 자동 태깅 및 태그 추천 기법", 한국전자거래학회지, 제14권, 제4호, pp. 167-179, 2009.
35 김찬주, 황규백, "소셜 북마킹 시스템의 스패머 탐지를 위한 기계학습 기술의 성능 비교", 정보과학회논문지 : 컴퓨팅의 실제 및 레터, 제15권, 제5호, pp. 345-349, 2009.
36 심학준, 윤태복, 이지형, "메타정보를 활용한 블로그 추천방법", 한국지능시스템학회 2010 년도 춘계학술대회 학술발표논문집, 제20권, 제1호, pp. 96-97, 2010.