Browse > Article
http://dx.doi.org/10.3837/tiis.2019.04.005

Detection of Political Manipulation through Unsupervised Learning  

Lee, Sihyung (Department of Information Security, Seoul Women's University)
Publication Information
KSII Transactions on Internet and Information Systems (TIIS) / v.13, no.4, 2019 , pp. 1825-1844 More about this Journal
Abstract
Political campaigns circulate manipulative opinions in online communities to implant false beliefs and eventually win elections. Not only is this type of manipulation unfair, it also has long-lasting negative impacts on people's lives. Existing tools detect political manipulation based on a supervised classifier, which is accurate when trained with large labeled data. However, preparing this data becomes an excessive burden and must be repeated often to reflect changing manipulation tactics. We propose a practical detection system that requires moderate groundwork to achieve a sufficient level of accuracy. The proposed system groups opinions with similar properties into clusters, and then labels a few opinions from each cluster to build a classifier. It also models each opinion with features deduced from raw data with no additional processing. To validate the system, we collected over a million opinions during three nation-wide campaigns in South Korea. The system reduced groundwork from 200K to nearly 200 labeling tasks, and correctly identified over 90% of manipulative opinions. The system also effectively identified transitions in manipulative tactics over time. We suggest that online communities perform periodic audits using the proposed system to highlight manipulative opinions and emerging tactics.
Keywords
Political manipulation; fake news; online communities; WWW; unsupervised learning;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 "South Korea's spy agency admits trying to influence 2012 poll," BBC News, 2017.
2 A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," Communications of the ACM, vol. 60, no. 6, pp. 84-90, 2017.   DOI
3 C. Chang and C. Lin, "LIBSVM-a library for support vector machines," Retrieved August 24, 2018.
4 "Weka 3: data mining software in Java," Retrieved August 24, 2018.
5 "TensorFlow: an open source machine learning library for research and production," Retrieved August 24, 2018.
6 "Russian Twitter political protests swamped by spam," BBC News, 2012.
7 R. K. Garrett and B. E. Weeks, "The promise and peril of real-time corrections to political misperceptions," in Proc. of ACM CSCW, pp. 1047-1058, 2013.
8 "Manipulation of online public opinion and the battle of Naver's real-time search words," The Kyunghyang Shinmun, 2018.
9 B. Pang and L. Lee, "Opinion mining and sentiment analysis," Now Publishers, 2008.
10 H. Oh and S. Kim, "Identifying and exploiting trustable users with robust features in online rating systems," KSII Tranactions on Internet and Information Systems, vol. 11, no. 4, pp. 2171-2195, 2017.
11 A. J. Minnich, N. Chavoshi, A. Mueen, S. Luan, M. Faloutsos, "TrueView: harnessing the power of multiple review sites," in Proc. of WWW, pp.787-797, 2015.
12 S. Choe, "Prosecutors detail attempt to sway South Korean election," The New York Times, 2013.
13 H. Li, G. Fei, S. Wang, B. Liu, W. Shao, A. Mukherjee, and J. Shao, "Bimodal distribution and co-bursting in review spam detection," in Proc. of WWW, pp.1063-1072, 2017.
14 S. Kwon, M. Cha, K. Jung, W. Chen, and Y. Wang, "Prominent features of rumor propagation in online social media," in Proc. of IEEE ICDM, pp. 1103-1108, 2013.
15 G. Wang, T. Konolige, C. Wilson, X. Wang, H. Zheng, and B. Y. Zhao, "You are how you click: clickstream analysis for sybil detection," in Proc. of USENIX Security Symposium, pp.241-256, 2013.
16 D. Mocanu, L. Rossi, Q. Zhang, M. Karsai, and W. Quattrociocchi, "Collective attention in the age of misinformation," CoRR, 2014.
17 H. Fawcett, "South Korea's political cyber war," Aljazeera, 2013.
18 "KoreanClick: Nielsen KoreanClick syndicated reports," Nielsen KoreanClick, 2015.
19 A. Shin, "Opposition party apologizes for spreading fake news on president's son during election," Arirang, 2017.
20 H. Olsen, "North Korean weighs in on South Korean presidential election," KoreaBANG, 2012.
21 "Manipulation of recommendation counts by military and government agencies," Media Today, 2013.
22 "Responsibility of users for their postings," Nate, 2016.
23 A. Joy, "How South Korean intelligence interfered in election," KoreaBANG, 2013.
24 J. Fleiss, "Measuring nominal scale agreement among many raters," Psychological Bulletin, vol. 76, no. 5, pp. 378-382, 1971.   DOI
25 R. Landis and G. Koch, "The measurement of observer agreement for categorical data," Biometrics, vol. 33, no.1, pp. 159-174, 1977.   DOI
26 B. Liu, "Web data mining: exploring hyperlinks, contents, and usage data," Springer, 2011.
27 S. P. Lloyd, "Least squares quantization in PCM," IEEE Transactions on Information Theory, vol. 28, no. 2, pp. 129-137, 1982.   DOI
28 Y. Freund and R. E. Schapire, "A short introduction to boosting," Journal of Japanese Society for Artificial Intelligence, vol. 14, no. 5, pp. 1-14, 1999.
29 C. Fellbaum and G. A. Miller, "Wordnet: an electronic lexical database (language, speech, and communication)," MIT Press, 1998.
30 V. Vapnik, "The nature of statistical learning theory," Springer, 2000.
31 L. Breiman, "Random forests," Springer Machine Learning, vol. 45, no. 1, pp. 5-32, 2001.   DOI
32 S. Haykin, "Neural networks and learning machines," Pearson, 2009.
33 S. Shane and M. Mazzetti, "Inside a 3-year Russian campaign to influence US voters," The New York Times, 2018.
34 K. Becker, "The handbook of political manipulation," Conservative Daily News, 2012.
35 W. H. Riker, "The art of political manipulation," Yale University Press, 1986.
36 S. Lee, "Detection of political manipulation in online communities through measures of effort and collaboration," ACM Transactions on the Web, vol. 9, no. 3, article 16, 2015.
37 R. Bond, et al., "A 61-million-person experiment in social influence and political mobilization," Nature, vol. 489, no. 7415, pp. 295-298, 2012.   DOI
38 J. Ratkiewicz, M. D. Conover, M. Meiss, B. Goncalves, A. Flammini, and F. Menczer, "Detecting and tracking political abuse in social media," in Proc. of ICWSM, pp.294-304, 2011.