Browse > Article
http://dx.doi.org/10.12672/ksis.2015.23.2.069

Location Inference of Twitter Users using Timeline Data  

Kang, Ae Tti (Department of Social Studies, Ewha Womans University)
Kang, Young Ok (Department of Social Studies, Ewha Womans University)
Publication Information
Abstract
If one can infer the residential area of SNS users by analyzing the SNS big data, it can be an alternative by replacing the spatial big data researches which result from the location sparsity and ecological error. In this study, we developed the way of utilizing the daily life activity pattern, which can be found from timeline data of tweet users, to infer the residential areas of tweet users. We recognized the daily life activity pattern of tweet users from user's movement pattern and the regional cognition words that users text in tweet. The models based on user's movement and text are named as the daily movement pattern model and the daily activity field model, respectively. And then we selected the variables which are going to be utilized in each model. We defined the dependent variables as 0, if the residential areas that users tweet mainly are their home location(HL) and as 1, vice versa. According to our results, performed by the discriminant analysis, the hit ratio of the two models was 67.5%, 57.5% respectively. We tested both models by using the timeline data of the stress-related tweets. As a result, we inferred the residential areas of 5,301 users out of 48,235 users and could obtain 9,606 stress-related tweets with residential area. The results shows about 44 times increase by comparing to the geo-tagged tweets counts. We think that the methodology we have used in this study can be used not only to secure more location data in the study of SNS big data, but also to link the SNS big data with regional statistics in order to analyze the regional phenomenon.
Keywords
Location sparsity; Location inference; Timeline data; Daily life activity; Discriminant analysis;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Achrekar, H; Gandhe, A; Lazarus, R; Yu, S. H; Liu, B. 2011, Predicting flu trends using twitter data. Paper presented at the Computer Communications Workshops (INFOCOM WKSHPS), 2011 IEEE Conference on.
2 Backstrom, L; Sun, E; Marlow, C. 2010, Find me if you can: improving geographical prediction with social and spatial proximity. Paper presented at the Proceedings of the 19th international conference on World wide web.
3 Bae, H. W; Bang, S. W. 2013, (with R) Discriminant analysis and Logistic Regression analysis, Kyowoosa, Seoul
4 Bollen, J; Mao, H; Pepe, A. 2011, Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena. Paper presented at the ICWSM.
5 Cheng, Z; Caverlee, J; Lee, K. 2010, You are where you tweet: a content-based approach to geo-locating twitter users. Paper presented at the Proceedings of the 19th ACM international conference on Information and knowle
6 Choi, H; Varian, H. 2012, Predicting the present with google trends. Economic Record, 88(s1): 2-9.   DOI   ScienceOn
7 Davis Jr, C. A; Pappa, G. L; de Oliveira; D. R. R; de L Arcanjo, F. 2011, Inferring the location of Twitter messages based on user relationships. Transactions in GIS, 15(6):735-751.   DOI
8 Fujisaka, T; Lee, R; Sumiya, K. 2010, Discovery of user behavior patterns from geo-tagged microblogs. Paper presented at the Proceedings of the 4th International Conference on Uniquitous Information Management and Commdgemanagement.
9 Ghosh, D; Guha, R. 2013, What are we 'tweeting' about obesity? Mapping tweets with topic modeling and Geographic Information System. Cartography and Geographic Information Science, 40(2):90-102.   DOI
10 Hecht, B; Hong, L; Suh, B; Chi, E. H. 2011, Tweets from Justin Bieber's heart: the dynamics of the location field in user profiles. Paper presented at the Proceedings of the SIGCHI Conference on Human Factors in Computing.
11 Ikawa, Y; Enoki, M; Tatsubori, M. 2012, Location inference using microblog messages. Paper presented at the Proceedings of the 21st international conference companion on World Wide Web.
12 Kent, J. D; Capello Jr, H. T. 2013, Spatial patterns and demographic indicators of effective social media content during theHorsethief Canyon fire of 2012. Cartography and Geographic Information Science, 40(2):78-89.   DOI
13 Kim, K. S; Kim, H.T. 2008, Analysis on the Excess Commuting Travel Time, Korea Development Institute.
14 Kim, S. J. 2013, Analyzing political attitudes of Twitter users by extracting sentiment from user timeline, The Catholic University of Korea, Bucheon.
15 Kim, Y. H; Shin, S. 2013, The current status of use SNS in Korea, Korea Information Society Development Institute.
16 Kwak, H; Lee, C; Park, H ; Moon, S. 2010, What is Twitter, a social network or a news media? Paper presented at the Proceedings of the 19th international conference on World wide web.
17 Lee, D. W; Kang, H. K; Kim, S. H; Lee, C. M. 2013, Autocorrelation Analysis of the Sentiment with Stock Information Appearing on Big-Data. The Korean Journal Of Financial Engineering, 12(2):79-96.
18 Lee, H. S; Lim, J. H. 2013, SPSS 20.0 Manual, Zip-hyunjae, Seoul.
19 Li, L; Goodchild, M. F; Xu, B. 2013, Spatial, temporal, and socioeconomic patterns in the use of Twitter and Flickr. Cartography and Geographic Information Science, 40(2):61-77.   DOI
20 Mayer-Schonberger, V; Cukier, K. 2013, Big data: A revolution that will transform how we live, work, and think: Houghton Mifflin Harcourt.
21 Mitchell, L; Frank, M. R; Harris, K. D; Dodds, P. S; Danforth, C. M. 2013, The Geography of Happiness: Connecting Twitter sentiment and expression, demographics, and objective characteristics of place. PloS one, 8(5):e64417.   DOI
22 Noulas, A; Scellato, S; Mascolo, C; Pontil, M. 2011, An Empirical Study of Geographic User Activity Patterns in Foursquare. ICWSM, 11:70-573.
23 Park, D. Y; Park, D. J. 2013, R and Statistical analysis, Jayu Academy, Paju.
24 Roick, O; Heuser, S. 2013, Location Based Social Networks-Definition, Current State of the Art and Research Agenda. Transactions in GIS.
25 Seo, T. W. 2012, A Study of Real-time Disaster Information Extraction and Displayusing the Mash-up based on SNS, Bukyung University, Busan.
26 Sung, T. J. 2014, (Using SPSS/AMOS/HLM) Easy Statistical Analysis, HakJeeSa, Seoul.