Browse > Article
http://dx.doi.org/10.9708/jksci.2021.26.12.061

Structuring of Pulmonary Function Test Paper Using Deep Learning  

Jo, Sang-Hyun (Dept. of Computer Science and Engineering, Kangwon National University)
Kim, Dae-Hoon (Dept. of Computer Science and Engineering, Kangwon National University)
Kim, Yoon (Dept. of Computer Science and Engineering, Kangwon National University)
Kwon, Sung-Ok (Dept. of Medical Bigdata Convergence, Kangwon National University)
Kim, Woo-Jin (Dept. of Internal Medicine and Biomedical Informatics, Kangwon National University)
Lee, Sang-Ah (Dept. of Preventive Medicine, Kangwon National University School of Medicine)
Abstract
In this paper, we propose a method of extracting and recognizing related information for research from images of the unstructured pulmonary function test papers using character detection and recognition techniques. Also, we develop a post-processing method to reduce the character recognition error rate. The proposed structuring method uses a character detection model for the pulmonary function test paper images to detect all characters in the test paper and passes the detected character image through the character recognition model to obtain a string. The obtained string is reviewed for validity using string matching and structuring is completed. We confirm that our proposed structuring system is a more efficient and stable method than the structuring method through manual work of professionals because our system's error rate is within about 1% and the processing speed per pulmonary function test paper is within 2 seconds.
Keywords
Text Detection; Text Recognition; Image Classification; Data Structuring; Deep Learning;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Lee, Junyeop, et al. "On recognizing texts of arbitrary shapes with 2D self-attention." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 546-547 2020. DOI:10.1109/CVPRW50498.2020.00281
2 Wan, Zhaoyi, et al. "Textscanner: Reading characters in order for robust scene text recognition." AAAI Conference on Artificial Intelligence. 34,07 pp. 12120-12127 Jan. 2020. DOI:10.1609/AAAI.V34I07.6891
3 Lyu, Pengyuan, et al. "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes" European Conference on Computer Vision (ECCV), pp. 71-88 Aug. 2018. DOI: https://doi.org/10.1007/978-3-030-01264-9_5
4 Liao, Minghui, et al. "Real-time Scene Text Detection with Differentiable Binarization" AAAI Conference on Artificial Intelligence, pp. 11474-11481 April. 2020. DOI: 10.1609/aaai.v34i07.6812
5 Liu, Jingchao, et al. "Pyramid mask text detector." arXiv preprint arXiv:1903.11800 Mar. 2019.
6 Loginov, Vladimir. "Why You Should Try the Real Data for the Scene Text Recognition." arXiv preprint arXiv:2107.13938 Jul. 2021.
7 Huang, Zhida, et al. "Mask R-CNN with pyramid attention network for scene text detection." 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 764-772 IEEE, 2019. DOI: 10.1109/WACV.2019.00086
8 Xing, Linjie, et al. "Convolutional Character Networks" IEEE/CVF International Conference on Computer Vision (ICCV), pp. 9126-9136 2019. DOI: 10.1109/ICCV.2019.00922
9 Liu, Yuliang, et al. "Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection" Int J Comput Vis 129, 1972-1992 2021. DOI: https://doi.org/10.1007/s11263-021-01459-7   DOI
10 Max Jaderberg, Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. "Reading Text in the Wild with Convolutional Neural Networks" Int J Comput Vis 116, 1-20 May. 2015. DOI: https://doi.org/10.1007/s11263-015-0823-z   DOI
11 Zhang, Xiangyu, et al. "Shufflenet: An extremely efficient convolutional neural network for mobile devices." IEEE conference on computer vision and pattern recognition (CVPR), pp. 6848-6856 June. 2018. DOI: 10.1109/CVPR.2018.00716
12 Baek, Youngmin, et al. "Character region awareness for text detection." IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 9365-9374 Apr. 2019. DOI: 10.1109/CVPR.2019.00959
13 Cai, Hongxiang, et al. "Revisiting Classification Perspective on Scene Text Recognition." arXiv preprint arXiv:2102.10884 Jun. 2021.
14 Iandola, Forrest N., et al. "SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and< 0.5 MB model size." arXiv preprint arXiv:1602.07360 Dec. 2017.
15 Huang, Gao, et al. "Densely connected convolutional networks." IEEE conference on computer vision and pattern recognition (CVPR), pp. 4700-4708 Jan. 2018. DOI: 10.1109/CVPR.2017.243
16 Tan, Mingxing, and Quoc Le. "Efficientnet: Rethinking model scaling for convolutional neural networks." International Conference on Machine Learning (PMLR), pp. 97:6105-6114 May. 2019.
17 Cui, Mengmeng, et al. "Representation and Correlation Enhanced Encoder-Decoder Framework for Scene Text Recognition." arXiv preprint arXiv:2106.06960 Jun. 2021.
18 Ye, Jian, et al. "TextFuseNet: Scene Text Detection with Richer Fused Features." IJCAI, pp. 516-522 July. 2020. DOI: 10.24963/ijcai.2020/72   DOI
19 Ankush Gupta, Andrea Vedaldi, and Andrew Zisserman. "Synthetic Data for Text Localisation in Natual Images" IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2315-2324 Apr. 2016. DOI: 10.1109/CVPR.2016.254
20 Howard, Andrew, et al. "Searching for mobilenetv3." IEEE/CVF International Conference on Computer Vision, pp. 1314-1324 Nov. 2019. DOI: 10.1109/ICCV.2019.00140