Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2012.19D.4.271

TK-Indexing : An Indexing Method for SNS Data Based on NoSQL  

Shim, Hyung-Nam (고려대학교 컴퓨터.전파통신공학과)
Kim, Jeong-Dong (고려대학교 컴퓨터.전파통신공학과)
Seol, Kwang-Soo (고려대학교 컴퓨터.전파통신공학과)
Baik, Doo-Kwon (고려대학교 컴퓨터.전파통신공학과)
Abstract
Currently, contents generated by SNS services are increasing exponentially, as the number of SNS users increase. The SNS is commonly used to post personal status and individual interests. Also, the SNS is applied in socialization, entertainment, product marketing, news sharing, and single person journalism. As SNS services became available on smart phones, the users of SNS services can generate and spread the social issues and controversies faster than the traditional media. The existing indexing methods for web contents have limitation in terms of real-time indexing for SNS contents, as they usually focus on diversity and accuracy of indexing. To overcome this problem, there are real-time indexing techniques based on RDBMSs. However, these techniques suffer from complex indexing procedures and reduced indexing targets. In this regard, we introduce the TK-Indexing method to improve the previous indexing techniques. Our method indexes the generation time of SNS contents and keywords by way of NoSQL to indexing SNS contents in real-time.
Keywords
NoSQL; Indexing Method; MapReduce; Social Network Service; Real-time Search;
Citations & Related Records
연도 인용수 순위
  • Reference
1 R. Chirkova, C. Li, and J. Li. "Answering queries using materialized views with minimum size", The VLDB Journal, 15(3):191-210, 2006.   DOI
2 me2day, http://www.me2day.net, 2012.
3 V. Hristidis and Y. Papakonstantinou, "Algorithms and applications for answering ranked queries using ranked views," The VLDB Journal, Vol.13, No.1, 2004.
4 C. Li, K. C.-C. chang, I. F. Ilyas, and S. Song, "RankSQL; Query Algebra and Opimization for Realational Top-k Queries", In Proc. Int'l Conf. on Management of Data, ACM SIGMOD, Baltimore, Maryland, June, 2005.
5 Jansen, B.j., et al, "Real Time search on the web: Queries, topics, and economic value", Information Processing and Management, Vol.47, Issue.4, pp.491-506, 2011.   DOI   ScienceOn
6 Facebook, http://www.facebook.com, 2012.
7 Twitter, http://www.twitter.com, 2012.
8 David Geer, "Is It Really Time for Real-Time Search", IEEE Computer Society, Vol.43, Issue.3, pp.16-19, 2010.   DOI   ScienceOn
9 Shim H, Kim J, Baik D, "TK-Indexing method based on NoSQL for real-time search", KIISE 2011 Fall Conference, Vol.28, Issue2, Seoul National Univ., 2011.
10 Bernard J.jansen, "Real time search user behavior", In: CHI Extended Abstracts, pp.3961-3966, 2010.
11 Jeffrey Dean, Sanjay Ghemawat, "MapReduce: simplified data processing on large clusters" the 6th conference on Symposium on Operating Systems Design & Implementation, 2004.
12 B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears, "Benchmarking cloud serving systems with YCSB", In SoCC '10: Proceedings of the 1st ACM symposium on Cloud computing, pp.143-154, 2010.
13 Das, G., Gunopulos, D., Koudas, N., and Tsirogiannis, D, "Answering top-k queries using Views", International Conference on Very Large Data Bases, pp.451-462, 2006.
14 Chun chen, Feng Li, Beng chin Ooi, Sai Wu, "TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets", SIGMOD/PODS 2011, ACM Press(2011), pp.649-660, 2011.
15 Fay Chang, Jeffrey Dean, Sanjay Ghemawat, et al., "Bigtable: A distributed storage system for structured data", OSDI'06 Proceedings of the 7th conference on Symposium on Operating Systems Design and Implementation, 2006.
16 G. DeCandia, D.Hastorun, M. Jampani, G. Kakulapati, A.Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels, "Dynamo: Amazon's Highly Available Key-Value Store", ACM Symposium on Operating Systems Principles, 2007.