Real-Time Comprehensive Assistance for Visually Impaired Navigation

  • Received : 2024.05.05
  • Published : 2024.05.30


Individuals with visual impairments face numerous challenges in their daily lives, with navigating streets and public spaces being particularly daunting. The inability to identify safe crossing locations and assess the feasibility of crossing significantly restricts their mobility and independence. Globally, an estimated 285 million people suffer from visual impairment, with 39 million categorized as blind and 246 million as visually impaired, according to the World Health Organization. In Saudi Arabia alone, there are approximately 159 thousand blind individuals, as per unofficial statistics. The profound impact of visual impairments on daily activities underscores the urgent need for solutions to improve mobility and enhance safety. This study aims to address this pressing issue by leveraging computer vision and deep learning techniques to enhance object detection capabilities. Two models were trained to detect objects: one focused on street crossing obstacles, and the other aimed to search for objects. The first model was trained on a dataset comprising 5283 images of road obstacles and traffic signals, annotated to create a labeled dataset. Subsequently, it was trained using the YOLOv8 and YOLOv5 models, with YOLOv5 achieving a satisfactory accuracy of 84%. The second model was trained on the COCO dataset using YOLOv5, yielding an impressive accuracy of 94%. By improving object detection capabilities through advanced technology, this research seeks to empower individuals with visual impairments, enhancing their mobility, independence, and overall quality of life.



  1. Coughlan, J., & Manduchi, R. (2009). Functional Assessment of a Camera Phone-Based Wayfinding System Operated by Blind and Visually Impaired Users. International Journal of Artificial Intelligence Tools, 18(3), 379-397. doi:10.1142/S0218213009000196. PMID: 19960101;PMCID: PMC2786081. Retrieved from
  2. Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2014). ImageNet Classification with Deep Convolutional Neural Networks. In Advances in Neural Information Processing Systems (pp. 1097-1105).
  3. Girshick, R. (2015). "Fast R-CNN." In Proceedings of the IEEE International Conference on Computer Vision (ICCV), 1440-1448. rshick_Fast_R-CNN_ICCV_2015_paper.pdf
  4. Caldini, A., Fanfani, M., & Colombo, C. (2015). SmartphoneBased Obstacle Detection for the Visually Impaired. In V. Murino & E. Puppo (Eds.), ICIAP 2015, Part I, LNCS 9279, pp. 480-488. Springer International Publishing. DOI:10.1007/978-3-319-23231-7 43 tphoneBased_Obstacle_Detection_for_the_Visually_Impaired
  5. Khenkar, S., Alsulaiman, H., Ismail, S., Fairaq, A., Jarraya, S. K., and Ben-Abdallah, H. (2016) 'ENVISION: Assisted Navigation of Visually Impaired Smartphone Users', Procedia Computer Science, 100, pp. 128-135. doi: 10.1016/j.procs.2016.09.132.
  6. Bai, J., Liu, D., Su, G., & Fu, Z. (2017). A Cloud and Visionbased Navigation System Used for Blind People. In Proceedings of the 2017 International Conference on Artificial Intelligence, Automation and Control Technologies (AIACT '17) (pp. 1-6). New York, NY, USA: Association for Computing Machinery. DOI:10.1145/3080845.3080867.
  7. Ghilardi, M. C., Simoes, G., Wehrmann, J., Manssour, I. H., & Barros, R.C. (2018). Real-Time Detection of Pedestrian Traffic Lights for Visually-Impaired People. In Proceedings of the 2018 IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 1-10). IEEE. DOI:10.1109/CVPR.2023.00001. Available at:
  8. Abdul Muhsin, M., Alkhalid, F. F., & Oleiwi, B. K. (2019). Online Blind Assistive System using Object Recognition. International Research Journal of Innovations in Engineering and Technology (IRJIET), 3(12), 47-51. Retrieved from
  9. Karmarkar, R. R., & Honmane, V. N. (2021). "Object Detection System for the Blind with Voice Guidance." International Journal of Engineering Applied Sciences and Technology, 6(2), 67-70.
  10. Granquist, C., Sun, S. Y., Montezuma, S. R., Tran, T. M., Gage, R., & Legge, G. E. (2021). Evaluation and Comparison of Artificial Intelligence Vision Aids: Orcam MyEye 1 and Seeing AI. Journal of Visual Impairment & Blindness, 115(4), 277-285. Available at:
  11. Senjam, S. S., Manna, S., & Bascaran, C. (2021). Smartphones-Based Assistive Technology: Accessibility Features and Apps for People with Visual Impairment, and its Usage, Challenges, and Usability Testing. Clinical Optometry, 13, 311-322. DOI: 10.2147/OPTO.S336361. Available at:
  12. Salunkhe, A., Raut, M., Santra, S., & Bhagwat, S. (2021). Android-based object recognition application for visually impaired. *ITM Web of Conferences*, 40, 03001. [Online]. Available:
  13. See, A. R., Sasing, B. G., & Advincula, W. D. (2022). A Smartphone-Based Mobility Assistant Using Depth Imaging for Visually Impaired and Blind. *Applied Sciences*, 12(6), 2802. [Online]. Available:
  14. Patil, R., Modi, R., Parandekar, A., & Deone, J. B. (2022). Designing mobile application for Visually Impaired and Blind Persons. SSRN. [Online] Available at:
  15. Birambole, A., Bhagat, P., Mhatre, B., & Abhyankar, A. (2022). "Blind Person Assistant: Object Detection." International Journal for Research in Applied Science & Engineering Technology (IJRASET), 10(3). Retrieved from blind-person-assistant-object-detection (
  16. Busaeed, S., Mehmood, R., & Katib, I. (2022). Requirements, Challenges, and Use of Digital Devices and Apps for Blind and Visually Impaired. NOT PEERREVIEWED. Preprints [Online]. Available at:
  17. Kuriakose, B., Shrestha, R., & Sandnes, F. E. (2023). DeepNAVI: A deep learning-based smartphone navigation assistant for people with visual impairments. Expert Systems With Applications, 212, 118720. DOI:10.1016/j.eswa.2022.118720. Available at:
  18. Sarmah, A. J., Bhagawati, K., Duwarah, K., Purkayastha, S. D., Boro, A., & Muchahary, D. (2023). "Object detection and conversion of text to speech for visually impaired." ADBU-Journal of Engineering Technology, 12(2), 0120204049
  19. COCO Consortium. COCO - Common Objects in Context. Retrieved from
  20. Roboflow. (n.d.). Roboflow. Retrieved from
  21. Jiang, P., Ergu, D., Liu, F., Cai, Y., & Ma, B. (2022). A Review of Yolo Algorithm Developments. Procedia Computer Science, 199, 1066-1073. doi:10.1016/j.procs.2022.01.146
  22. Benjumea, A., Teeti, I., & Cuzzolin, F. (2022). YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles.
  23. Horvat, M. and Gledec, G. (2022) 'A comparative study of YOLOv5 models performance for image localization and classification', Proceedings of the Central European Conference on Information and Intelligent Systems, pp. 349-357
  24. Solawetz, J. (2020). Yolov5 new versionimprovements and evaluation. Roboflow. Seach date. Retrieved from
  25. Roboflow. (n.d.). What's New in YOLOv8? Roboflow Blog. Retrieved from
  26. Viso AI. (n.d.). YOLOv8 Guide. Retrieved from
  27. YOLOv8 Architecture Overview. (n.d.). YOLOv8. Retrieved from
  28. R. Szeliski, Computer Vision: Algorithms and Applications. Cham, Switzerland: Springer International Publishing, 2022.