DOI QR코드

DOI QR Code

Cyberbullying Detection by Sentiment Analysis of Tweets' Contents Written in Arabic in Saudi Arabia Society

  • Received : 2021.03.05
  • Published : 2021.03.30

Abstract

Social media has become a global means of communication in people's lives. Most people are using Twitter for communication purposes and its inappropriate use, which has negative effects on people's lives. One of the widely common misuses of Twitter is cyberbullying. As the resources of dialectal Arabic are rare, so for cyberbullying most people are using dialectal Arabic. For this reason, the ultimate goal of this study is to detect and classify cyberbullying on Twitter in the Arabic context in Saudi Arabia. To help in the detection and classification of tweets, Pointwise Mutual Information (PMI) to generate a lexicon, and Support Vector Machine (SVM) algorithms are used. The evaluation is performed on both methods in terms of the F1-score. However, the F1-score after applying the PMI is 50%, while after the SVM application on the resampling data it is 82%. The analysis of the results shows that the SVM algorithm outperforms better.

Keywords

References

  1. S. Poland, "Cyberbullying Continues to Challenge Educators," District Administration, 2010.
  2. A. M. Al-Zahrani, "Cyberbullying among Saudi's Higher-Education Students: Implications for Educators and Policymakers," World J. Educ., vol. 5, no. 3, p. n/a, 2015.
  3. D. Farid and N. El-Tazi, "Detection of Cyberbullying in Tweets in Egyptian Dialects," vol. 18, no. 7, pp. 34-41, 2020.
  4. A. H. Alduailej, "The challenge of cyberbullying and its automatic detection in Arabic text," pp. 389-394, 2017.
  5. H. Sanchez, "Twitter Bullying Detection," Homo, 2011.
  6. P. Burnap and M. L. Williams, "Cyber Hate Speech on Twitter: An Application of Machine Classification and Statistical Modeling for Policy and Decision Making," Policy & Internet, vol. 7, no. 2, pp. 223-242, 2015. https://doi.org/10.1002/poi3.85
  7. N. Al-Twairesh, H. Al-Khalifa, A. Al-Salman, and Y. Al-Ohali, "AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets," Procedia Comput. Sci., vol. 117, no. December, pp. 63-72, 2017. https://doi.org/10.1016/j.procs.2017.10.094
  8. H. A.-D. Adel Assiri, Ahmed Emam and International, "Saudi Twitter Corpus for Sentiment Analysis," Int. J. Comput. Inf. Eng., vol. 10, no. 2, pp. 272-275, 2016.
  9. M. El Kourdi, A. Bensaid, and T. Rachidi, "Automatic Arabic document categorization based on the Naive Bayes algorithm," Proc. Work. Comput. Approaches to Arab. Script-based Lang. - Semit. '04, p. 51, 2004.
  10. M. Oakleaf, "Writing information literacy assessment plans: A guide to best practice," Commun. Inf. Lit., vol. 3, no. 2, pp. 80-90, 2009.
  11. L. Albraheem and H. S. Al-Khalifa, "Exploring the problems of sentiment analysis in informal Arabic," Proc. 14th Int. Conf. Inf. Integr. Web-based Appl. Serv. - IIWAS '12, p. 415, 2012.
  12. O. Erdur-Baker, "Cyberbullying and its correlation to traditional bullying, gender and frequent and risky usage of internet-mediated communication tools," New Media and Society, 2010. .
  13. W. A. Al-harbi, "E Ffect of S Audi Dialect P Reprocessing on," Int. J. Adv. Comput. Technol., pp. 91-99, 2016.
  14. "11 Facts About Cyber Bullying," DoSomething.org | Volunt. Soc. Chang., pp. 5-7.
  15. A. El-Halees, "Arabic Opinion Mining Using Combined Classification Approach," Int. Arab Conf. Inf. Technol. ACIT2011, 2011.
  16. S. Wakade, C. Shekar, K. J. Liszka, and C. Chan, "Text Mining for Sentiment Analysis of Twitter Data."
  17. W. Alabbas, M. Haider, A. Mansour, G. Epiphaniou, and I. Frommholz, "Classification of Colloquial Arabic Tweets in real-time to detect high-risk floods," Proc. Int. Conf. Soc. Media, Wearable Web Anal. (Social Media 2017), 2017.
  18. P. B. Dastanwala and V. Patel, "A review on social audience identification on twitter using text mining methods," Proc. 2016 IEEE Int. Conf. Wirel. Commun. Signal Process. Networking, WiSPNET 2016, pp. 1917-1920, 2016.
  19. H. Al-Rubaiee, R. Qiu, K. Alomar, and D. Li, "Sentiment Analysis of Arabic Tweets in e-Learning," J. Comput. Sci., vol. 12, no. 11, pp. 553-563, 2016. https://doi.org/10.3844/jcssp.2016.553.563
  20. J. Ben Salamah and A. Elkhlifi, "Microblogging Opinion Mining Approach for Kuwaiti Dialect," Comput. Technol. Inf. Manag., vol. 1, no. 1, p. 9, 2016.
  21. R. Sonykrishna, L. Priyanka, K. Vijayalakshmi, and M. Sowmya, "A Text Mining Application Of Emotion Classification Of Twitter ' s Users," no. April, pp. 203-207, 2017.
  22. R. Balabantaray, "Multi-class twitter emotion classification: A new approach," Int. J. ..., vol. 4, no. 1, pp. 48-53, 2012.
  23. M. Hasan, E. Rundensteiner, and E. Agu, "EMOTEX: Detecting Emotions in Twitter Messages," ASE BIGDATA/SOCIALCOM/CYBERSECURITY Conf., pp. 27-31, 2014.
  24. F. M. Al-kharboush and M. A. Al-hagery, "Features Extraction Effect on the Accuracy of Sentiment Classification Using Ensemble Models," vol. 10, no. 3, pp. 2019-2022, 2021.
  25. R. Rana and V. Kolhe, "Analysis of Students Emotion for Twitter Data using Naive Bayes and Non Linear Support Vector Machine Approachs," Int. J. Recent Innov. Trends Comput. Commun., vol. 3, no. 5, pp. 3211-3217, 2015.
  26. M. A. H. Al-Hagery, "Classifiers' Accuracy Based on Breast Cancer Medical Data and Data Mining Techniques," Int. J. Adv. Biotechnol. Res., vol. 7, no. 2, pp. 760-772, 2016.
  27. S. Al-qarzaie, S. Al-odhaibi, B. Al-saeed, and M. Al-hagery, "Using the Data Mining Techniques for Breast Cancer Early Prediction," Symp. Data Min. Appl., vol. 1, no. May, 2014.
  28. A. Abdulrahman Al-Noshan, M. Abdullah Al-Hagery, H. Abdulaziz Al-Hodathi, and M. Sulaiman Al-Quraishi, "Performance Evaluation and Comparison of Classification Algorithms for Students at Qassim University," Int. J. Sci. Res., vol. 8, no. 11, pp. 1277-1282, 2018.
  29. E. I. Al-Fairouz and M. A. Al-Hagery, "The most efficient classifiers for the students' academic dataset," Int. J. Adv. Comput. Sci. Appl., vol. 11, no. 9, pp. 501-506, 2020.
  30. A. M. Misbah and I. F. Imam, "Mining opinions in Arabic text using an improved 'semantic orientation using pointwise mutual information' algorithm," 2012 8th Int. Conf. Informatics Syst. INFOS 2012, pp. 61-69, 2012.
  31. N. Al-Twairesh, H. Al-Khalifa, and A. Al-Salman, "AraSenTi: Large-scale twitter-specific Arabic sentiment lexicons," 54th Annu. Meet. Assoc. Comput. Linguist. ACL 2016 - Long Pap., vol. 2, pp. 697-705, 2016.
  32. C. Stephanidis, "Foreword," Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 10280 LNCS, no. January 2021, p. VII, 2017.
  33. M. AbdullahAl-Hagery, M. AbdullahAl-Assaf, and F. MohammadAl-Kharboush, "Exploration of the best performance method of emotions classification for arabic tweets," Indones. J. Electr. Eng. Comput. Sci., vol. 19, no. 2, pp. 1010-1020, 2020. https://doi.org/10.11591/ijeecs.v19.i2.pp1010-1020
  34. T. Joachims, "Text categorization with Support Vector Machines: Learning with many relevant features," pp. 137-142, 1998.