[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.3745/KTSDE.2022.11.4.149

CoAID⁺ : COVID-19 News Cascade Dataset for Social Context Based Fake News Detection

Han, Soeun (한양대학교 컴퓨터소프트웨어학과)
Kang, Yoonsuk (한양대학교 컴퓨테이셔널 사회과학연구센터)
Ko, Yunyong (한양대학교 인공지능 혁신인재교육 연구단)
Ahn, Jeewon (한양대학교 컴퓨터소프트웨어학과)
Kim, Yushim (Arizona State University 행정학과)
Oh, Seongsoo (한양대학교 행정학과)
Park, Heejin (한양대학교 정보통신학부)
Kim, Sang-Wook (한양대학교 정보통신학부)

Publication Information

KIPS Transactions on Software and Data Engineering / v.11, no.4, 2022 , pp. 149-156 More about this Journal

Abstract

In the current COVID-19 pandemic, fake news and misinformation related to COVID-19 have been causing serious confusion in our society. To accurately detect such fake news, social context-based methods have been widely studied in the literature. They detect fake news based on the social context that indicates how a news article is propagated over social media (e.g., Twitter). Most existing COVID-19 related datasets gathered for fake news detection, however, contain only the news content information, but not its social context information. In this case, the social context-based detection methods cannot be applied, which could be a big obstacle in the fake news detection research. To address this issue, in this work, we collect from Twitter the social context information based on CoAID, which is a COVID-19 news content dataset built for fake news detection, thereby building CoAID⁺ that includes both the news content information and its social context information. The CoAID⁺ dataset can be utilized in a variety of methods for social context-based fake news detection, thus would help revitalize the fake news detection research area. Finally, through a comprehensive analysis of the CoAID⁺ dataset in various perspectives, we present some interesting features capable of differentiating real and fake news.

Keywords

Fake News Detection; Propagation; Coronavirus; Social Context Based Detection;

Citations & Related Records

Reference

1	Y. Liu and Y. F. Wu, "Early detection of fake news on social media through propagation path classification with recurrent and convolutional networks," In Proceeding of the AAAI Conference on Artificial Intelligence, 2018.
2	K. Shu, D. Mahudeswaran, S. Wang, and H. Liu, "Hierarchical propagation networks for fake news detection: Investigation and exploitation," In Proceeding of the International AAAI Conference on Web and Social Media, Vol.14, pp.626-637, 2020.
3	K. Shu, S. Wang, and H. Liu, "Beyond news contents: The role of social context for fake news detection," In Proceeding of the Twelfth ACM International Conference on Web Search and Data Mining, pp.312-320, 2019.
4	S. Vosoughi, D. Roy, and S. Aral, "The spread of true and false news online," Science, Vol.359, No.6380, pp.1146-1151, 2018. DOI
5	K. Shu, A. Sliva, S. Wang, J. Tang, and H. Liu, "Fake news detection on social media: A data mining perspective," In Proceeding of the ACM SIGKDD Explorations Newsletter, Vol.19, No.1, pp.22-36, 2017. DOI
6	F. Monti, F. Frasca, D. Eynard, D. Mannion, and M. M. Bronstein, "Fake news detection on social media using geometric deep learning," arXiv preprint arXiv:1902.06673, 2019.
7	L. Cui and D. Lee, "Coaid: Covid-19 healthcare misinformation dataset," arXiv preprint arXiv:2006.00885, 2020.
8	X. Zhou, A. Mulay, E. Ferrara, and R. Zafarani, "Recovery: A multimodal repository for covid-19 news credibility research," In Proceeding of the 29th ACM International Conference on Information & Knowledge Management, pp.3205-3212, 2020.
9	G. K. Shahi and D. Nandini, "FakeCovid--A multilingual cross-domain fact check news dataset for COVID-19," arXiv preprint arXiv:2006.11343, 2020.
10	C. Castillo, M. Marcelo, and B. Poblete, "Predicting information credibility in time-sensitive social media," Internet Research, 2013.
11	Z. Jin, J. Cao, Y. Zhang, and J. Luo, "News verification by exploiting conflicting social viewpoints in microblogs," In Proceeding of the AAAI Conference on Artificial Intelligence, Vol.30. No.1, 2016.
12	E. Tacchini, G. Ballarin, ML.Vedova, S. Moret and L. Alfaro, "Some like it hoax: Automated fake news detection in social networks," arXiv preprint arXiv:1704.07506, 2017.
13	S. Han, Y. Kang, Y. Ko, J. Ahn, Y. Kim, S. Oh, H. Park, and S. Kim, "COVID-19 Cascade Dataset for Fake News Detection," The KIPS Spring Conference 2021, Vol.28, No.1, pp.312-313, 2021.
14	S. Badaskar, S. Agarwal, and S. Arora, "Identifying real or fake articles: Towards better language modeling," In Proceeding of the Third International Joint Conference on Natural Language Processing: Volume-II, 2008.
15	M. Abdul-Mageed, A. Elmadany, E. M. B. Nagoudi, D. Paddi, K. Verma, and R. Lin, "Mega-cov: A billion-scale dataset of 100+ languages for covid-19," arXiv preprint arXiv:2005.06012, 2020.
16	B. Riedel, et al. "A simple but tough-to-beat baseline for the Fake News Challenge stance detection task," arXiv preprint arXiv:1707.03264, 2017.
17	H. Ahmed, I. Traore, and S. Saad, "Detection of online fake news using n-gram analysis and machine learning techniques," International Conference on Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments, Springer, Cham, 2017.
18	M. Potthast, J. Kiesel, K. Reinartz, J. Bevendorff, and B. Stein, "A stylometric inquiry into hyperpartisan and fake news," arXiv preprint arXiv:1702.05638, 2017.
19	A. Gupta, P. Kumaraguru, C. Castillo, and P. Meier, "Tweetcred: Real-time credibility assessment of content on twitter," International conference on social informatics, Springer, Cham, 2014.

KSCI

CoAID+ : COVID-19 News Cascade Dataset for Social Context Based Fake News Detection CoAID+ : 소셜 컨텍스트 기반 가짜뉴스 탐지를 위한 COVID-19 뉴스 파급 데이터

CoAID⁺ : COVID-19 News Cascade Dataset for Social Context Based Fake News Detection