Browse > Article
http://dx.doi.org/10.5909/JBE.2022.27.3.308

Comparison of Artificial Intelligence Multitask Performance using Object Detection and Foreground Image  

Jeong, Min Hyuk (Dept. of Computer Engineering, Myongji University)
Kim, Sang-Kyun (Dept. of Software Convergent, Myongji University)
Lee, Jin Young (ETRI)
Choo, Hyon-Gon (ETRI)
Lee, HeeKyung (ETRI)
Cheong, Won-Sik (ETRI)
Publication Information
Journal of Broadcast Engineering / v.27, no.3, 2022 , pp. 308-317 More about this Journal
Abstract
Researches are underway to efficiently reduce the size of video data transmitted and stored in the image analysis process using deep learning-based machine vision technology. MPEG (Moving Picture Expert Group) has newly established a standardization project called VCM (Video Coding for Machine) and is conducting research on video encoding for machines rather than video encoding for humans. We are researching a multitask that performs various tasks with one image input. The proposed pipeline does not perform all object detection of each task that should precede object detection, but precedes it only once and uses the result as an input for each task. In this paper, we propose a pipeline for efficient multitasking and perform comparative experiments on compression efficiency, execution time, and result accuracy of the input image to check the efficiency. As a result of the experiment, the capacity of the input image decreased by more than 97.5%, while the accuracy of the result decreased slightly, confirming the possibility of efficient multitasking.
Keywords
Video coding for machine; Object detection; Object tracking; Pose estimation; MPEG;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 Y. Jang, D. Chung, "Technology Trend for Image Analysis Based on Deep Learning," Current Industrial and Technological Trends in Aerospace, vol.17, No.1, pp.113-122, July 2019.
2 M. Jeong, S. Kim, H. Jin, H. Lee, H. Choo, H. Lim, and J. Seo, "Experiment on the Effect of Feature Map Encoding on CNN Performance Evaluation," JOURNAL OF BROADCAST ENGINEERING, vol.25, No.7, pp.1081-1094, December 2020. doi: https://doi.org/10.5909/JBE.2020.25.7.1081   DOI
3 W. Lin, K. Dong, R. Yang, T. Wang, A. Zhang and D. Liu, "[VCM] Anchor generation for HiEve(object tracking)," ISO/IEC JTC1/SC29/WG02 m55761, Online, December 2020.
4 Github - facebookresearch/detectron2, https://github.com/facebookresearch/detectron2 (accessed Apr. 10, 2022).
5 VTM-12.0 jvet/VVCSoftware, https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware_VTM/-/tree/VTM-12.0 (accessed Apr.10, 2022).
6 Github - leoxiaobin/deep-high-resolution-net.pytorch, https://github.com/leoxiaobin/deep-high-resolution-net.pytorch(acces sed Apr.10, 2022).
7 J. Redmon and A. Farhadi, "YOLO v3: An Incremental Improvement", Computer Vision and Pattern Recognition, 2018. (accessed Apr. 10, 2022). doi: https://doi.org/10.48550/arXiv.1804.02767   DOI
8 H. Jin, M. Jeong, D. Yoo, S. Kim, J. Lee, H. Lee, and W. Cheong, "Compression of CNN Inference Results Using MPEG-7 Descriptor Binarization," Proceedings of the Korean Society of Broadcast Engineers Conference, pp.36-38, June 2021.
9 H. Lee, J. Lee, H. Choo, W. Cheong, J. Seo, "[VCM] Object of interest based VCM for multi-task," ISO/IEC JTC1/SC29/WG02 m58846, Online, January 2022
10 S. Wenkel, K. Alhazmi, T. Liiv, S. Alrshoud, M. Simon, "Confidence Score: The Forgotten Dimension of Object Detection Performance Evaluation," Sensors, Vol. 21, No.13: 4350, 2021, (accessed May. 3, 2022). doi: https://doi.org/10.3390/s21134350.   DOI
11 Github - Zhongdao/Towards-Realtime-MOT, https://github.com/Zhongdao/Towards-Realtime-MOT (accessed Apr.10, 2022).
12 FFmpeg, https://ffmpeg.org/ (accessed Apr. 10, 2022).