DOI QR코드

DOI QR Code

Academic Registration Text Classification Using Machine Learning

  • Received : 2021.12.05
  • Published : 2022.01.30

Abstract

Natural language processing (NLP) is utilized to understand a natural text. Text analysis systems use natural language algorithms to find the meaning of large amounts of text. Text classification represents a basic task of NLP with a wide range of applications such as topic labeling, sentiment analysis, spam detection, and intent detection. The algorithm can transform user's unstructured thoughts into more structured data. In this work, a text classifier has been developed that uses academic admission and registration texts as input, analyzes its content, and then automatically assigns relevant tags such as admission, graduate school, and registration. In this work, the well-known algorithms support vector machine SVM and K-nearest neighbor (kNN) algorithms are used to develop the above-mentioned classifier. The obtained results showed that the SVM classifier outperformed the kNN classifier with an overall accuracy of 98.9%. in addition, the mean absolute error of SVM was 0.0064 while it was 0.0098 for kNN classifier. Based on the obtained results, the SVM is used to implement the academic text classification in this work.

Keywords

Acknowledgement

I want to thank my university, "University of Ha'il" for providing us with all the needed facilities to complete this master's degree. A special thanks to Gharbi Alshammari for his continuous guide and support. I want to acknowledge and thank my department for allowing me to conduct my research and providing any assistance requested. Finally, I would like to thank all my friends and colleagues who have helped me on this project. Their enthusiasm and willingness to provide feedback made completing this study an enjoyable experience.

References

  1. Allahyari, M., Pouriyeh, S., Assefi, M., Safaei, S., Trippe, E. D., Gutierrez, J. B., & Kochut, K. (2017). A brief survey of text mining: Classification, clustering and extraction techniques. ArXiv Preprint ArXiv:1707.02919.
  2. Kadhim, A. I. (2019). Survey on supervised machine learning techniques for automatic text classification. Artificial Intelligence Review, 52(1), 273-292. https://doi.org/10.1007/s10462-018-09677-1
  3. Qawqzeh Y, Alharbi MT, Jaradat A, Abdul Sattar KN. A review of swarm intelligence algorithms deployment for scheduling and optimization in cloud computing environments. PeerJ Comput Sci. 2021 Aug 25;7:e696. doi: 10.7717/peerj-cs.696. PMID: 34541313; PMCID: PMC8409329.
  4. Cunningham-Nelson, S., Baktashmotlagh, M., & Boles, W. (2017). From review to rating: Exploring dependency measures for text classification. ArXiv Preprint ArXiv:1709.00813.
  5. Mohammed M, Khan MB, Bashier Mohammed BE. Machine learning: algorithms and applications. CRC Press; 2016
  6. Mironczuk, M. M., & Protasiewicz, J. (2018). A recent overview of the state-of-the-art elements of text classification. Expert Systems with Applications, 106, 36-54. https://doi.org/10.1016/j.eswa.2018.03.058
  7. Li, Baoli & Yu, Shiwen & Lu, Qin. (2003). An Improved k-Nearest Neighbor Algorithm for Text Categorization.
  8. Abdulameer, A. S., Tiun, S., Sani, N. S., Ayob, M., & Taha, A. Y. (2020). Enhanced clustering models with wiki-based k-nearest neighbors-based representation for web search result clustering. Journal of King Saud University-Computer and Information Sciences.
  9. Sabbah, T., Ayyash, M., & Ashraf, M. (2018). Hybrid support vector machine based feature selection method for text classification. Int. Arab J. Inf. Technol., 15(3A), 599-609.
  10. Sarker IH. Ai-driven cybersecurity: an overview, security intelligence modeling and research directions. SN Comput Sci. 2021
  11. Srivastava, Durgesh & Bhambhu, L. (2010). Data classification using support vector machine. Journal of Theoretical and Applied Information Technology. 12. 1-7.
  12. Slusarczyk B. Industry 4.0: Are we ready? Polish J Manag Stud. 17, 2018.
  13. Uysal, A. K. (2016). An improved global feature selection scheme for text classification. Expert Systems with Applications, 43, 82-92. https://doi.org/10.1016/j.eswa.2015.08.050
  14. Yao, L., Mao, C., & Luo, Y. (2019). Graph convolutional networks for text classification. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 7370-7377.
  15. Onan, A., Atik, E., & Yalcin, A. (2020). Machine learning approach for automatic categorization of service support requests on university information management system. International Conference on Intelligent and Fuzzy Systems, 1133-1139.
  16. Yu, P., Cui, V. Y., & Guan, J. (2021). Text Classification by using Natural Language Processing. Journal of Physics: Conference Series, 1802(4), 42010. https://doi.org/10.1088/1742-6596/1802/4/042010