Browse > Article
http://dx.doi.org/10.5391/JKIIS.2011.21.5.564

A Study on Graph-based Topic Extraction from Microblogs  

Choi, Don-Jung (성균관대학교 전자전기컴퓨터공학과)
Lee, Sung-Woo (성균관대학교 전자전기컴퓨터공학과)
Kim, Jae-Kwang (성균관대학교 전자전기컴퓨터공학과)
Lee, Jee-Hyong (성균관대학교 전자전기컴퓨터공학과)
Publication Information
Journal of the Korean Institute of Intelligent Systems / v.21, no.5, 2011 , pp. 564-568 More about this Journal
Abstract
Microblogs became popular information delivery ways due to the spread of smart phones. They have the characteristic of reflecting the interests of users more quickly than other medium. Particularly, in case of the subject which attracts many users, microblogs can supply rich information originated from various information sources. Nevertheless, it has been considered as a hard problem to obtain useful information from microblogs because too much noises are in them. So far, various methods are proposed to extract and track some subjects from particular documents, yet these methods do not work effectively in case of microblogs which consist of short phrases. In this paper, we propose a graph-based topic extraction and partitioning method to understand interests of users about a certain keyword. The proposed method contains the process of generating a keyword graph using the co-occurrences of terms in the microblogs, and the process of splitting the graph by using a network partitioning method. When we applied the proposed method on some keywords. our method shows good performance for finding a topic about the keyword and partitioning the topic into sub-topics.
Keywords
Microblogs; Twitter; Keyword graph; Network partitioning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 http://en.wikipedia.org/wiki/Twitter
2 A. Java, X, Song, T. Finin and B. Tseng, "Why We Twitter: Understanding Microblogging Usage and Communities," Joint 9th WEBKDD and 1st SNA-KDD Workshop, 2007.
3 O. Phelan, K. McCarthy and B. Smyth, "Using Twitter to Recommend Real-Time Topical News," Proceedings of the 3th ACM conference on Recommender systems, 2009.
4 M. Michelson and S. A. Macskassy, "Discovering Users' Topics of Interest on Twitter: A First Look," Proceedings of the 4th workshop on Analytics for noisy unstructured text data, 2010.
5 M. Grineva, M. Grinev and D. Lizorkin, "Extra cting Key Terms From Noisy and Multi-theme Documents," Proceedings of the 18th Internatio nal Conference on World Wide Web, 2009.
6 J. Zeng, C. Wu and W. Wang, "Multi-grain Hierarchical Topic Extraction Algorithm for Text Mining," Expert Systems with Applications, Vol. 37(4), pp. 3202-3208, 2010.   DOI   ScienceOn
7 M. E. J. Newman, M. Girvan, "Finding and Evaluating Community Structure in Networks," Journal of Physical Review E, Vol. 69, 2004.