DOI QR코드

DOI QR Code

A Unicode based Deep Handwritten Character Recognition model for Telugu to English Language Translation

  • BV Subba Rao (Dept of Information Technology, PVP Siddhartha Institute of Technology) ;
  • J. Nageswara Rao (Department of Computer Science and Engineering, Lakireddy Bali Reddy College of Engineering) ;
  • Bandi Vamsi (Department of Artificial Intelligence & Data Science, Madanapalle Institute of Technology & Science) ;
  • Venkata Nagaraju Thatha (Department of Information Technology MLR INSTITUTE of technology ) ;
  • Katta Subba Rao (Department of Computer Science and Engineering, B V Raju Institute of Technology)
  • 투고 : 2024.02.05
  • 발행 : 2024.02.29

초록

Telugu language is considered as fourth most used language in India especially in the regions of Andhra Pradesh, Telangana, Karnataka etc. In international recognized countries also, Telugu is widely growing spoken language. This language comprises of different dependent and independent vowels, consonants and digits. In this aspect, the enhancement of Telugu Handwritten Character Recognition (HCR) has not been propagated. HCR is a neural network technique of converting a documented image to edited text one which can be used for many other applications. This reduces time and effort without starting over from the beginning every time. In this work, a Unicode based Handwritten Character Recognition(U-HCR) is developed for translating the handwritten Telugu characters into English language. With the use of Centre of Gravity (CG) in our model we can easily divide a compound character into individual character with the help of Unicode values. For training this model, we have used both online and offline Telugu character datasets. To extract the features in the scanned image we used convolutional neural network along with Machine Learning classifiers like Random Forest and Support Vector Machine. Stochastic Gradient Descent (SGD), Root Mean Square Propagation (RMS-P) and Adaptative Moment Estimation (ADAM)optimizers are used in this work to enhance the performance of U-HCR and to reduce the loss function value. This loss value reduction can be possible with optimizers by using CNN. In both online and offline datasets, proposed model showed promising results by maintaining the accuracies with 90.28% for SGD, 96.97% for RMS-P and 93.57% for ADAM respectively.

키워드

참고문헌

  1. Meena, B., Rao, K. V., &Chittineni, S. (2022). A Novel Method to Auto Configure Convolution Neural Network Model Using Soft Computing Technique to Recognize Telugu Hand-Written Character for Better Accuracy. Journal of Theoretical and Applied Information Technology, 100(18). 
  2. Das, M. S., Reddy, C. R. K., Rahul, K., & Govardhan, A. (2011). Multilingual Optical Character Recognition System for Printed English and Telugu Base Characters. International Journal of Science and Advanced Technology (ISSN 2221-8386), 1(4), 106-111. 
  3. Guptha, N. S., Balamurugan, V., Megharaj, G., Sattar, K. N. A., & Rose, J. D. (2022). Cross lingual handwritten character recognition using long short term memory network with aid of elephant herding optimization algorithm. Pattern Recognition Letters, 159, 16-22. https://doi.org/10.1016/j.patrec.2022.04.038 
  4. Sonthi, V. K., Nagarajan, S., &Krishnaraj, N. (2022). An Intelligent Telugu Handwritten Character Recognition using Multi-Objective Mayfly Optimization with Deep Learning Based DenseNet Model. Transactions on Asian and Low-Resource Language Information Processing. https://doi.org/10.1145/3520439 
  5. Shekar, K. C., Cross, M. A., & Vasudevan, V. (2021). Optical Character Recognition and Neural Machine Translation Using Deep Learning Techniques. In Innovations in Computer Science and Engineering (pp. 277-283). Springer, Singapore. https://doi.org/10.1007/978-981-33-4543-0_30 
  6. Sethy, A., Patra, P. K., & Nayak, S. R. (2022). A Hybrid System for Handwritten Character Recognition with High Robustness. Traitement du Signal, 39(2). https://doi.org/10.18280/ts.390218 
  7. Sharma, R., & Kaushik, B. (2022). Handwritten Indic scripts recognition using neuro-evolutionary adaptive PSO based convolutional neural networks. Sadhana, 47(1), 1-19. https://doi.org/10.1007/s12046-021-01787-x 
  8. Sankara Babu, B., Nalajala, S., Sarada, K., Muniraju Naidu, V., Yamsani, N., &Saikumar, K. (2022). Machine Learning Based Online Handwritten Telugu Letters Recognition for Different Domains. In A Fusion of Artificial Intelligence and Internet of Things for Emerging Cyber Systems (pp. 227-241). Springer. 
  9. Ganji, T., Velpuru, M. S., &Dugyala, R. (2021). Multi variant handwritten telugu character recognition using transfer learning. In IOP Conference Series: Materials Science and Engineering (Vol. 1042, No. 1, p. 012026). IOP Publishing.