Browse > Article
http://dx.doi.org/10.9717/kmms.2020.23.8.953

Automatic Extraction of Hangul Stroke Element Using Faster R-CNN for Font Similarity  

Jeon, Ja-Yeon (Dept. of IT Engineering, Graduate School, Sookmyung Women's University)
Park, Dong-Yeon (Dept. of IT Engineering, Sookmyung Women's University)
Lim, Seo-Young (Dept. of IT Engineering, Sookmyung Women's University)
Ji, Yeong-Seo (Dept. of IT Engineering, Sookmyung Women's University)
Lim, Soon-Bum (Dept. of IT Engineering and Research Institute of ICT Convergence, Sookmyung Women's University)
Publication Information
Abstract
Ever since media contents took over the world, the importance of typography has increased, and the influence of fonts has be n recognized. Nevertheless, the current Hangul font system is very poor and is provided passively, so it is practically impossible to understand and utilize all the shape characteristics of more than six thousand Hangul fonts. In this paper, the characteristics of Hangul font shapes were selected based on the Hangul structure of similar fonts. The stroke element detection training was performed by fine tuning Faster R-CNN Inception v2, one of the deep learning object detection models. We also propose a system that automatically extracts the stroke element characteristics from characters by introducing an automatic extraction algorithm. In comparison to the previous research which showed poor accuracy while using SVM(Support Vector Machine) and Sliding Window Algorithm, the proposed system in this paper has shown the result of 10 % accuracy to properly detect and extract stroke elements from various fonts. In conclusion, if the stroke element characteristics based on the Hangul structural information extracted through the system are used for similar classification, problems such as copyright will be solved in an era when typography's competitiveness becomes stronger, and an automated process will be provided to users for more convenience.
Keywords
Characteristics of Hangul Shape; Object Detection of Stroke Element; Automatic Extraction of Object Deletion; Hangul Font Similarity;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 H. Kim and S. Lim, “Emotion-based Hangul Font Recommendation System Using Crowdsourcing,” Cognitive Systems Research, Vol. 47, No. 1, pp. 214-225, 2018.   DOI
2 S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: Towards Real-time Object Detection with Region Proposal Networks," arXiv Preprint arXiv:1506.0149, 2016.
3 C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, "Rethinking the Inception Architecture for Computer Vision," arXiv Preprint arXiv:1512.00567, 2015.
4 R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, arXiv Preprint arXiv:1311.2524, 2014.
5 R. Girshick, "Fast R-CNN," Proceeding of IEEE International Conference on Computer Vision, arXiv Preprint arXiv:1504.08083, 2015.
6 C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, et al., "Going Deeper with Convolutions," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-9, 2015.
7 C. Szegedy, S. Ioffe, V. Vanhoucke, and A. Alemi, "Inception-v4, Inception-resnet and the Impact of Residual Connections on Learning," Proceedings of the 31st American Association for Artificial Intelligence Conference on Artificial Intelligence, pp. 4278-4284, 2017.
8 Hewlett-Packard Company, PANOSE 2.0 White Paper, Hewlett-Packard Document EWC-92-0015h, 1993.
9 Korean Typeface Development Research Center, Korean Font Dictionary, King Sejong Foundation, Republic of Korea, 2000.
10 Y. Won, S. Seo, and M. Song, Typography Cheon Il Yahwa: The Concept and Practice of Hangul Typography, Angraphix, Ahn Graphics Publishers, Republic of Korea, 2019.
11 D. Park, J. Jeon, Y. Ji, and S. Lim, "Design of Structure Map for Modeling Hierarchical Structure Information of Hangul," Proceeding of the Fall Conference of the Korea Multimedia Society, pp. 578-579, 2019.
12 J. Jeon and S. Lim, "Research on Hangul Stroke Element Classification and Automatic Extraction Using SVM and Sliding Window Algorithm," Proceeding of the Fall Conference of the Korea Multimedia Society, pp. 732, 2019.
13 J. Jeon and S. Lim, "Machine Learning Based Automatic Extraction of Stroke Elements in Hangul Font," Proceeding of the 8th Korea-Japan J oint Workshop on Complex Communication Sciences, pp. 53-55, 2020.
14 J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Realtime Object Detection," Proceeding of IEEE Conference on Computer Vision and Pattern Recognition, arXiv Preprint arXiv:1506.02640, 2016.
15 W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Fu, et al., "SSD: Single Shot Multibox Detector," Proceeding of European Conference on Computer Vision, pp 21-37, 2016.
16 CV-tricks, Zero to Hero: Guide to Object Detection Using Deep Learning: Faster RCNN, YOLO, SSD(2017), https://cv-tricks.com/object-detection/faster-r-cnn-yolo-ssd/ (accessed July 10, 2020).
17 IabelImg(2017), https://github.com/tzutalin/labelImg (accessed July 10, 2020).
18 Korea National Hangul Museum, Survey and Prospect of Font Industry in 2015, Korea National Hangul Museum Research Report, 2015.
19 KISDI, In the Era of the 4th Industrial Revolution, Media Content Survival Strategy, KISDI Premium Report 17-14, 2017.
20 H. Kim and S. Lim, “Shape Property Study of Hangul Font for Font Classification,” Journal of Korea Multimedia Society, Vol. 20, No. 9, pp. 1584-1595, 2017.   DOI
21 T. Chen, Z. Wang, N. Xu, and H. Jin, "Large-Scale Tag-based Font Retrieval With Generative Feature Learning," Proceeding of IEEE/CVF International Conference on Computer Vision, arXiv Preprint arXiv:1909.02072, 2019.
22 Z. Wang, J. Yang, H. Jin, E. Shechtman, A. Agarwala, J. Brandt, et al., "Deep Font: Identify Your Font from an Image," arXiv Preprint arXiv:1507.03196, 2015.
23 S. Jiang, Z. Wang, A. Hertzmann, H. Jin, and Y. Fu, "Visual Font Pairing," arXiv Preprint arXiv:1811.08015, 2018.