DOI QR코드

DOI QR Code

A Study on Algorithm Selection and Comparison for Improving the Performance of an Artificial Intelligence Product Recognition Automatic Payment System

  • Kim, Heeyoung (Department of Immersive Content Convergence, General graduate school, Kwangwoon University) ;
  • Kim, Dongmin (JLK Inc.) ;
  • Ryu, Gihwan (Department of Tourism Industry, Graduate school of smart convergence, Kwangwoon University) ;
  • Hong, Hotak (AI R&D Center, JLK Inc.)
  • Received : 2022.01.20
  • Accepted : 2022.03.08
  • Published : 2022.03.31

Abstract

This study is to select an optimal object detection algorithm for designing a self-checkout counter to improve the inconvenience of payment systems for products without existing barcodes. To this end, a performance comparison analysis of YOLO v2, Tiny YOLO v2, and the latest YOLO v5 among deep learning-based object detection algorithms was performed to derive results. In this paper, performance comparison was conducted by forming learning data as an example of 'donut' in a bakery store, and the performance result of YOLO v5 was the highest at 96.9% of mAP. Therefore, YOLO v5 was selected as the artificial intelligence object detection algorithm to be applied in this paper. As a result of performance analysis, when the optimal threshold was set for each donut, the precision and reproduction rate of all donuts exceeded 0.85, and the majority of donuts showed excellent recognition performance of 0.90 or more. We expect that the results of this paper will be helpful as the fundamental data for the development of an automatic payment system using AI self-service technology that is highly usable in the non-face-to-face era.

Keywords

References

  1. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection", In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, 2016.
  2. J. Redmon and A. Farhadi, "YOLO9000: better, faster, stronger", In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7263-7271, December 2017.
  3. D. Thuan, "Evolution of yolo algorithm and yolov5: the state-of- the-art object detection algorithm", Spring 2021.
  4. K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition", arXiv preprint arXiv:1409.1556, September 2014.
  5. K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition", In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770-778, 2016.
  6. S. Xie, R. Girshick, P. Dollar, Z, Tu, and K. He, "Aggregated residual transformations for deep neural networks", In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1492-1500, 2017.
  7. J. Redmon and A. Farhadi, "YOLOv3: An incremental improvement", arXiv preprint arXiv:1804.02767, April 2018.
  8. S. Ren, K. He, R. Girshick, and J. Sun, "Faster R-CNN: towards real-time object detection with region proposal networks", IEEE transactions on pattern analysis and machine intelligence, Vol. 39, No. 6, pp. 1137-1149, June 2016. DOI: 10.1109/TPAMI.2016.2577031
  9. J. Dai, Y. Li, K. He, and J. Sun, "R-FCN: Object detection via region-based fully convolutional networks", In Advances in Neural Information Processing Systems (NIPS), pp. 379-387, 2016.
  10. W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C. Fu, and A. C. Berg, "SSD: Single shot multibox detector", In European Conference on Computer Vision (ECCV), pp. 21-37, September 2016. DOI: 10.1007/978-3-319-46448-0_2
  11. T. Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar, "Focal loss for dense object detection", In Proceedings of the IEEE International Conference on Computer Vision (ICCV), pp. 2980-2988, October 2017.
  12. Y. J. Kim, "Deep learning network architecture design for 3D sculpture identification", Ph.D. Thesis. University of Sangmyung, Seoul, 2020.
  13. J. Redmon, "Darknet: Open Source Neural Networks in c", 2013. [Online]. Available: https://pjreddie.com/darknet