[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.9717/kmms.2020.24.5.651

Methods of Classification and Character Recognition for Table Items through Deep Learning

Lee, Dong-Seok (AI Grand ICT Research Center, Dong-eui University)
Kwon, Soon-Kak (Dept. of Computer Software Engineering, Dong-eui University)

Publication Information

Journal of Korea Multimedia Society / v.24, no.5, 2021 , pp. 651-658 More about this Journal

Abstract

In this paper, we propose methods for character recognition and classification for table items through deep learning. First, table areas are detected in a document image through CNN. After that, table areas are separated by separators such as vertical lines. The text in document is recognized through a neural network combined with CNN and RNN. To correct errors in the character recognition, multiple candidates for the recognized result are provided for a sentence which has low recognition accuracy.

Keywords

Optical character recognition; Image recognition; Deep learning; Intelligent document processing; Convolutional neural network;

Citations & Related Records

Reference

1	Korean Import and Export Logistics Process Vol 5(2016), https://www.nlic.go.kr/nlic/WhsBordPdfSch.action (accessed May 28, 2021).
2	S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, No. 6, pp. 1137-1149, 2017. DOI
3	K. He, G. Gkioxari, P. Dollar, and R. Girshick, "Mask R-CNN," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 42, No. 2, pp. 386-397, 2020. DOI
4	B. Shi, M. Yang, X. Wang. P. Lyu, C. Yao, and X. Bai, "ASTER: An Attentional Scene Text Recognizer with Flexible Rectification," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 41, No. 9, pp. 2035-2048, 2019. DOI
5	Y. Lecun, L. Bottou, Y. Bengio, and P. Haffner, "Gradient-based Learning Applied to Document Recognition," Proceedings of the IEEE, Vol. 86, No. 11, pp. 2278-2324, 1998. DOI
6	A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet Classification with Deep Convolutional Neural Networks," Communications of the ACM, Vol. 60, No. 6, pp. 84-90, 2017. DOI
7	P. Soviany and R. T. Ionescu, "Optimizing the Trade-off between Single-Stage and TwoStage Deep Object Detectors using Image Difficulty Prediction," Proceeding of the International Symposium on Symbolic and Numeric Algorithms for Scientific Computing, pp. 209-214, 2018.
8	R. Girshick, "Fast R-CNN," Proceeding of the IEEE International Conference on Computer Vision, pp. 1440-1448, 2015.
9	Z. Cheng, P. Bai, Y. Xu, G. Zheng, S. Pu, and S. Zhou, "Focusing Attention: Towards Accurate Text Recognition in Natural Images," Proceeding of the IEEE International Conference on Computer Vision, pp. 5076-5084, 2017.
10	R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 580-587, 2014.
11	J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, RealTime Object Detection," Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, 2016.
12	J. Redmon and A. Farhadi, "YOLO9000: Better, Faster, Stronger," Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, 2017.
13	W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Y. Fu, and A. C. Berg, "SSD: Single Shot MultiBox Detector," Proceeding of the European Conference on Computer Vision, pp. 21-37, 2016.
14	B. Shi, X. Bai, and C. Yao, "An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 39, No. 11, pp. 2298-2304, 2017. DOI
15	S. M. Gang and J. J. Lee, "Coreset Construction for Character Recognition of PCB Components Based on Deep Learning," Journal of Korea Multimedia Society, Vol. 24, No. 3, pp. 382-395, 2021. DOI
16	J. Wang and X. Hu, "Gated Recurrent Convolution Neural Network for OCR," Proceeding of the International Conference on Neural Information Processing Systems, pp. 334-343, 2017.
17	K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," Proceeding of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, 2016.
18	Vocabulary List for Learning Korean(2003), https://www.korean.go.kr/front/etcData/etcDataView.do?mn_id=46&etc_seq=71 (accessed May 28, 2021).
19	T. He, Z. Tian, W. Huang, C. Shen, Y. Qiao, and C. Sun, "An End-to-End TextSpotter with Explicit Alignment and Attention," Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5020-5029, 2018.

KSCI

Methods of Classification and Character Recognition for Table Items through Deep Learning 딥러닝을 통한 문서 내 표 항목 분류 및 인식 방법

Methods of Classification and Character Recognition for Table Items through Deep Learning