Browse > Article
http://dx.doi.org/10.5909/JBE.2020.25.3.386

Development of Python-based Annotation Tool Program for Constructing Object Recognition Deep-Learning Model  

Lim, Song-Won (Seoul National University of Science And Technology)
Park, Goo-man (Seoul National University of Science And Technology)
Publication Information
Journal of Broadcast Engineering / v.25, no.3, 2020 , pp. 386-398 More about this Journal
Abstract
We developed an integrative annotation program that can perform data labeling process for deep learning models in object recognition. The program utilizes the basic GUI library of Python and configures crawler functions that allow data collection in real time. Retinanet was used to implement an automatic annotation function. In addition, different data labeling formats for Pascal-VOC, YOLO and Retinanet were generated. Through the experiment of the proposed method, a domestic vehicle image dataset was built, and it is applied to Retinanet and YOLO as the training and test set. The proposed system classified the vehicle model with the accuracy of about 94%.
Keywords
Annotation; GUI(Graphical User Interface); Tkinter; Crawling; Retinanet; YOLO;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Mark Everingham, Luc Van Gool, The PASCAL Visual Object Classes (VOC) Challenge, Int J Comput Vis (2010), Received: 30 July 2008 / Accepted: 16 July 2009 / Published online: 9 September 2009.
2 Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi "You Only Look Once: Unified, Real," Computer Vision and Pattern Recognition 2016, pp. 779-788, June 2016, arXiv:1506.02640.
3 Tsung-Yi Lin, Focal Loss for Dense Object Detection, Computer Vision and Pattern Recognition (cs.CV), Submitted on 7 Aug 2017 (v1), last revised 7 Feb.
4 Mr. Pankaj S. Parsania, A Comparative Analysis of Image Interpolation Algorithms, International Journal of Advanced Research in Computer and Communication Engineering Vol. 5, Issue 1, January 2016.
5 Karen Simonyan, VERY DEEP CONVOLUTIONAL NETWORK SFORLARGE-SCALE IMAGE RECOGNITION, Published as a conference paper at ICLR 2015.
6 Joseph Redmon, YOLO9000: Better, Faster, Stronger, Computer Vision and Pattern Recognition, Submitted on 25 Dec 2016, arXiv:1612.08242.
7 3D Object Representations for Fine-Grained Categorization, Kaggle, Stanford Cars Dataset. Jonathan Krause, Michael Stark, Jia Deng, Li Fei-Fei, 4th IEEE Workshop on 3D Representation and Recognition, at ICCV 2013 (3dRR-13). Sydney, Australia. Dec. 8, 2013.
8 J. Hu, L. Shen, and G. Sun. Squeeze-and-excitation networks. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7132-7141, 2018.
9 E. D. Cubuk, B. Zoph, D. Mane, V. Vasudevan, and Q. V. Le. Autoaugment: Learning augmentation strategies from data. In Proceedings of the IEEE conference on computer vision and pattern recognition, 2019.
10 Shaoqing Ren, Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks, Computer Vision and Pattern Recognition, Submitted on 4 Jun 2015 (v1), last revised 6 Jan 2016.
11 Tsung-Yi Lin, Microsoft COCO: Common Objects in Context, Computer Vision and Pattern Recognition (cs.CV), Submitted on 1 May 2014 (v1) last revised 21 Feb 2015 (this version, v3), arXiv:1405.0312 [cs.CV]