[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.5909/JBE.2022.27.3.308

Comparison of Artificial Intelligence Multitask Performance using Object Detection and Foreground Image

Jeong, Min Hyuk (Dept. of Computer Engineering, Myongji University)
Kim, Sang-Kyun (Dept. of Software Convergent, Myongji University)
Lee, Jin Young (ETRI)
Choo, Hyon-Gon (ETRI)
Lee, HeeKyung (ETRI)
Cheong, Won-Sik (ETRI)

Publication Information

Journal of Broadcast Engineering / v.27, no.3, 2022 , pp. 308-317 More about this Journal

Abstract

Researches are underway to efficiently reduce the size of video data transmitted and stored in the image analysis process using deep learning-based machine vision technology. MPEG (Moving Picture Expert Group) has newly established a standardization project called VCM (Video Coding for Machine) and is conducting research on video encoding for machines rather than video encoding for humans. We are researching a multitask that performs various tasks with one image input. The proposed pipeline does not perform all object detection of each task that should precede object detection, but precedes it only once and uses the result as an input for each task. In this paper, we propose a pipeline for efficient multitasking and perform comparative experiments on compression efficiency, execution time, and result accuracy of the input image to check the efficiency. As a result of the experiment, the capacity of the input image decreased by more than 97.5%, while the accuracy of the result decreased slightly, confirming the possibility of efficient multitasking.

Keywords

Video coding for machine; Object detection; Object tracking; Pose estimation; MPEG;

Citations & Related Records

Times Cited By KSCI : 1 (Citation Analysis)

Reference
Cited By KSCI

1	Y. Jang, D. Chung, "Technology Trend for Image Analysis Based on Deep Learning," Current Industrial and Technological Trends in Aerospace, vol.17, No.1, pp.113-122, July 2019.
2	M. Jeong, S. Kim, H. Jin, H. Lee, H. Choo, H. Lim, and J. Seo, "Experiment on the Effect of Feature Map Encoding on CNN Performance Evaluation," JOURNAL OF BROADCAST ENGINEERING, vol.25, No.7, pp.1081-1094, December 2020. doi: https://doi.org/10.5909/JBE.2020.25.7.1081 DOI
3	W. Lin, K. Dong, R. Yang, T. Wang, A. Zhang and D. Liu, "[VCM] Anchor generation for HiEve(object tracking)," ISO/IEC JTC1/SC29/WG02 m55761, Online, December 2020.
4	Github - facebookresearch/detectron2, https://github.com/facebookresearch/detectron2 (accessed Apr. 10, 2022).
5	VTM-12.0 jvet/VVCSoftware, https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware_VTM/-/tree/VTM-12.0 (accessed Apr.10, 2022).
6	Github - leoxiaobin/deep-high-resolution-net.pytorch, https://github.com/leoxiaobin/deep-high-resolution-net.pytorch(acces sed Apr.10, 2022).
7	J. Redmon and A. Farhadi, "YOLO v3: An Incremental Improvement", Computer Vision and Pattern Recognition, 2018. (accessed Apr. 10, 2022). doi: https://doi.org/10.48550/arXiv.1804.02767 DOI
8	H. Jin, M. Jeong, D. Yoo, S. Kim, J. Lee, H. Lee, and W. Cheong, "Compression of CNN Inference Results Using MPEG-7 Descriptor Binarization," Proceedings of the Korean Society of Broadcast Engineers Conference, pp.36-38, June 2021.
9	H. Lee, J. Lee, H. Choo, W. Cheong, J. Seo, "[VCM] Object of interest based VCM for multi-task," ISO/IEC JTC1/SC29/WG02 m58846, Online, January 2022
10	S. Wenkel, K. Alhazmi, T. Liiv, S. Alrshoud, M. Simon, "Confidence Score: The Forgotten Dimension of Object Detection Performance Evaluation," Sensors, Vol. 21, No.13: 4350, 2021, (accessed May. 3, 2022). doi: https://doi.org/10.3390/s21134350. DOI
11	Github - Zhongdao/Towards-Realtime-MOT, https://github.com/Zhongdao/Towards-Realtime-MOT (accessed Apr.10, 2022).
12	FFmpeg, https://ffmpeg.org/ (accessed Apr. 10, 2022).

KSCI

Comparison of Artificial Intelligence Multitask Performance using Object Detection and Foreground Image 물체탐색과 전경영상을 이용한 인공지능 멀티태스크 성능 비교

Comparison of Artificial Intelligence Multitask Performance using Object Detection and Foreground Image