[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.22937/IJCSNS.2021.21.5.14

MTReadable: Arabic Readability Corpus for Medical Tests Information

Alahmdi, Dimah (Faculty of Computer and Information Technology, King Abdulaziz University)
Alghamdi, Athir Saeed (Faculty of Computer and Information Technology, King Abdulaziz University)
Almuallim, Neda'a (Faculty of Computer and Information Technology, King Abdulaziz University)
Alarifi, Suaad (Faculty of Computer and Information Technology, King Abdulaziz University)

Publication Information

International Journal of Computer Science & Network Security / v.21, no.5, 2021 , pp. 84-89 More about this Journal

Abstract

Medical tests are very important part of the health monitoring process. It is performed for various reasons like diagnosing diseases, determining medications effectiveness, etc. Due to that, patients should be able to read and understand the available online tests and results in order to take proper decisions regarding their health condition. In fact, people are varying in their educational level and health backgrounds that make providing such information in an easily readable format by the majority of people considered as a challenge in the health domain since ever. This paper describes the MTReadable corpus which constructed for evaluating the readability of online medical tests. It covered 32 basic periodic check-up tests with over 36k words. These tests information are annotated and labelled based on three readability levels which are easy, neutral and difficult by three non-specialists native Arabic speakers. This paper contributes to enriching the Arabic health research community with an investigation of the level of readability of online medical tests and to be a baseline for further complex health online reports and information.

Keywords

Text mining; Arabic corpus; Readability corpus; Medical Test;

Citations & Related Records

Reference

1	Pope, C., S. Ziedland, and N. Mays. "Qualitative research in health care: Analysing qualitative data. 320." BMJ 8.320 (2000): p.7227.
2	Salloum, Said A., et al. "A survey of Arabic text mining." Intelligent Natural Language Processing: Trends and Applications. Springer, Cham, 2018. p.417-431.
3	Al Aqeel, Sinaa, et al. "Readability of written medicine information materials in Arabic language: expert and consumer evaluation." BMC health services research 18.1 (2018): p.1-7. DOI
4	Alotaibi, S., Alyahya, M., Al-Khalifa, H., Alageel, S., & Abanmy, N.. Readability of Arabic medicine information leaflets: a machine learning approach. Procedia Computer Science, 82 (2016), p. 122-126. DOI
5	Bustos, Aurelia, et al. "Padchest: A large chest x-ray image dataset with multi-label annotated reports." Medical image analysis 66 (2020):p. 101797. DOI
6	Health, S. M. o. (2019). Awareness. Retrieved from https://www.moh.gov.sa/Pages/Default.asx
7	Blackman, Nicole J - M., and John J. Koval. "Interval estimation for Cohen's kappa as a measure of agreement." Statistics in medicine 19.5 (2000): p.723-741. DOI
8	Daraz, Lubna, et al. "Can patients trust online health information? A meta-narrative systematic review addressing the quality of health information on the internet." Journal of general internal medicine 34.9 (2019): 1884-1891. DOI
9	Pinsonneault, Alain, et al. "Integrated health information technology and the quality of patient care: A natural experiment." Journal of Management Information Systems 34.2 (2017): p.457-486. DOI
10	Kher, Akhil, Sandra Johnson, and Robert Griffith. "Readability assessment of online patient education material on congestive heart failure." Advances in preventive medicine 2017 (2017).
11	Albukhitan, Saeed, Ahmed Alnazer, and Tarek Helmy. "Semantic annotation of arabic web documents using deep learning." Procedia computer science 130 (2018): p.589-596. DOI
12	Alalyani, Nada, and Souad Larabi Marie-Sainte. "NADA: New Arabic dataset for text classification." International Journal of Advanced Computer Science and Applications 9.9 (2018).
13	Zeroual, Imad, and Abdelhak Lakhouaja. "A new Quranic Corpus rich in morphosyntactical information." International Journal of Speech Technology 19.2 (2016): p.339-346. DOI
14	Saad, Motaz K., and Wesam M. Ashour. "Osac: Open source arabic corpora." 6th ArchEng Int. Symposiums, EEECS. Vol. 10. 2010.
15	Samy, Doaa, et al. "Medical Term Extraction in an Arabic Medical Corpus." LREC. 2012.
16	Bird, Steven, Ewan Klein, and Edward Loper. Natural language processing with Python: analyzing text with the natural language toolkit. " O'Reilly Media, Inc.", 2009.
17	Dukes, Kais, and Nizar Habash. "Morphological Annotation of Quranic Arabic." Lrec. 2010.
18	Charnock, Deborah. "The DISCERN handbook." Quality criteria for consumer health information on treatment choices. Radcliffe: University of Oxford and The British Library (1998).
19	Encyclopedia, King, A, (2019) https://kaahe.org/en-us/Pages/Home/Home.aspx
20	Laboratories, A. B. M. (2019). Lab Tests Website. Retrieved from https://www.alborg.sa/ar/
21	Sun, Wencheng, et al. "Data processing and text mining technologies on electronic medical records: a review." Journal of healthcare engineering 2018 (2018).