DOI QR코드

DOI QR Code

Sub-Frame Analysis-based Object Detection for Real-Time Video Surveillance

  • Received : 2019.10.15
  • Accepted : 2019.10.30
  • Published : 2019.11.30

Abstract

We introduce a vision-based object detection method for real-time video surveillance system in low-end edge computing environments. Recently, the accuracy of object detection has been improved due to the performance of approaches based on deep learning algorithm such as Region Convolutional Neural Network(R-CNN) which has two stage for inferencing. On the other hand, one stage detection algorithms such as single-shot detection (SSD) and you only look once (YOLO) have been developed at the expense of some accuracy and can be used for real-time systems. However, high-performance hardware such as General-Purpose computing on Graphics Processing Unit(GPGPU) is required to still achieve excellent object detection performance and speed. To address hardware requirement that is burdensome to low-end edge computing environments, We propose sub-frame analysis method for the object detection. In specific, We divide a whole image frame into smaller ones then inference them on Convolutional Neural Network (CNN) based image detection network, which is much faster than conventional network designed forfull frame image. We reduced its computationalrequirementsignificantly without losing throughput and object detection accuracy with the proposed method.

Keywords

References

  1. J. Redmon and A. Farhadi, "Yolo9000: Better, faster, stronger. In Computer Vision and Pattern Recognition (CVPR)," 2017 IEEE Conference on, pp. 6517-6525. IEEE, 2017.
  2. J. Redmon and A. Farhadi, "Yolov3: An incremental improvement," arXiv preprint arXiv:1804.02767, April 2018.
  3. R. Girshick, "Fast r-cnn," In Proceedings of the IEEE international conference on computer vision, pages 1440-1448, 2015.
  4. S.B. Lee, H.G. Kim, H.K.Seok, J.H. Nang, "Comparison of Fine-Tuned Convolutional Neural Networks for Clipart Style Classification" International Journal of Internet, Broadcasting and Communication(IJIBC), Vol.10 No.4, pp.50-64, 2018 DOI: https://doi.org/10.7236/IJIIBC.2018.10.4.50
  5. Martin Danelljan, Gustav Hager, Fahad Shahbaz Khan and Michael Felsberg, "Accurate Scale Estimation for Robust Visual Tracking," In Proceedings British Machine Vision Conference, 2014. DOI: https://doi.org/10.5244/C.28.65
  6. Seyed Yahya Nikouei, Yu Li Chen, Sejun Song, Ronghua Xu, Baek-Young Choi, Timothy R. Faughnan, "Real-Time Human Detection as an Edge Service Enabled by a Lightweight CNN," IEEE International Conference on Edge Computing (EDGE), 2018.
  7. H.M. Kwon, V. Kumaran, and S. Gupta, “Real-time Tracking and Identification for Multi-Camera Surveillance System,” The Journal of the Institute of Internet, Broadcasting and Communication(JIIBC), Vol. 10, No. 1, pp. 16-22, Feb. 2018. DOI: https://doi.org/10.7236/IJIBC.2018.10.1.3.
  8. A. Bewley, Z. Ge, L. Ott, F. Ramos, and B. Upcroft, "Simple Online and Realtime Tracking," 2016 IEEE International Conference on Image Processing (ICIP), August 2016. DOI: https://doi.org/10.1109/ICIP.2016.7533003
  9. L. Leal-Taixe, C. Canton-Ferrer, and K. Schindler, "Learning by Tracking: Siamese CNN for Robust Target Association," In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 33 - 40, 2016.
  10. N. Wojke, A. Bewley and D. Paulus, "Simple online and realtime tracking with a deep association metric," Proceedings International Conference on Image Processing, ICIP, 2017 Sep 17 - 20, 2018. DOI: https://doi.org/10.1109/ICIP.2017.8296962
  11. Y. Xiang, A. Alahi, and S. Savarese, "Learning to track: Online multi-object tracking by decision making," In Proceedings of the IEEE International Conference on Computer Vision, pp. 4705 - 4713, 2015.
  12. N. Le, A. Heili, and J. M. Odobez, "Long-termtime-sensitive costs for CRF-based tracking by detection," In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016.
  13. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," In CVPR, pp. 779 - 788, 2016.
  14. T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick, "Microsoft COCO: Common objects in context," In ECCV, pp. 740 - 755, 2014. DOI: https://doi.org/10.1007/978-3-319-10602-1_48