Browse > Article
http://dx.doi.org/10.5909/JBE.2022.27.1.56

A Feature Map Compression Method for Multi-resolution Feature Map with PCA-based Transformation  

Park, Seungjin (Department of Computer Engineering, Kwangwoon University)
Lee, Minhun (Department of Computer Engineering, Kwangwoon University)
Choi, Hansol (Department of Computer Engineering, Kwangwoon University)
Kim, Minsub (Department of Computer Engineering, Kwangwoon University)
Oh, Seoung-Jun (Department of Electronic Engineering, Kwangwoon University)
Kim, Younhee (Electronics and Telecommunications Research Institute)
Do, Jihoon (Electronics and Telecommunications Research Institute)
Jeong, Se Yoon (Electronics and Telecommunications Research Institute)
Sim, Donggyu (Department of Computer Engineering, Kwangwoon University)
Publication Information
Journal of Broadcast Engineering / v.27, no.1, 2022 , pp. 56-68 More about this Journal
Abstract
In this paper, we propose a compression method for multi-resolution feature maps for VCM. The proposed compression method removes the redundancy between the channels and resolution levels of the multi-resolution feature map through PCA-based transformation. According to each characteristic, the basis vectors and mean vector used for transformation, and the transformation coefficient obtained through the transformation are compressed using a VVC-based coder and DeepCABAC. In order to evaluate performance of the proposed method, the object detection performance was measured for the OpenImageV6 and COCO 2017 validation set, and the BD-rate of MPEG-VCM anchor and feature map compression anchor proposed in this paper was compared using bpp and mAP. As a result of the experiment, the proposed method shows a 25.71% BD-rate performance improvement compared to feature map compression anchor in OpenImageV6. Furthermore, for large objects of the COCO 2017 validation set, the BD-rate performance is improved by up to 43.72% compared to the MPEG-VCM anchor.
Keywords
Video coding for machine; Principal component analysis; Feature map compression; Moving picture expert group;
Citations & Related Records
연도 인용수 순위
  • Reference
1 VTM12.0, https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware_VTM/-/tree/VTM-12.0 (accessed Nov. 26, 2021).
2 M. Rafie, Y. Zhang, and S. Liu, "[VCM] Evaluation Framework for Video Coding for Machines," ISO/IEC JTC 1/SC 29/WG 2, m58385, Online, Oct. 2021.
3 Y. LeCun, Y. Bengio, and G. E. Hinton, "Deep learning," Nature, vol. 512, pp. 436-444, 2015.   DOI
4 G. Sullivan, J. Ohm, W. Han, and T. Wiegand, "Overview of the high efficiency video coding (HEVC) standard," IEEE Transactions on Circuits and Systems for Video Technology, Vol. 22, No. 12, pp. 1649-1668, Dec. 2012.   DOI
5 S. Wang, C. Lin, C. Lin, T., and Y. Nie, "[VCM] Enable IBC in VTM8.2 for VCM," ISO/IEC JTC 1/SC 29/WG 2, m56792, Online, Apr. 2021.
6 S. Wang, C. Lin, and C. Lin (ITRI), "[VCM] A study on impact of coding tools on machine vision performance and visual quality," ISO/IEC JTC 1/SC 29/WG 2, m56867, Online, Apr. 2021.
7 Y. Lee, S., K. Yoon, H. Lim, H. Choo, W. Cheong, and J. Seo, "[VCM] Updated FLIR Anchor results for object detection," ISO/IEC JTC 1/SC 29/WG 2, m57375, Online, Jul. 2021.
8 S. Wang, Z. Wang, Y. Ye, and S. Wang, "[VCM] End-to-end image compression towards machine vision for object detection," ISO/IEC JTC 1/SC 29/WG 2, m57500, Online, Jul. 2021.
9 J. Do, J. Lee, Y. Kim, S. Yoon J., and J. Choi, "[VCM] Experimental Results of Feature Compression using CompressAI," ISO/IEC JTC 1/SC 29/WG 2, m56716, Online, Apr. 2021.
10 M. F. Mahmood, N. Hussin, "Information in Conversion Era: Impact and Influence from 4th Industrial Revolution," International Journal of Academic Research in Business and Social Sciences, Vol. 8, No. 9, pp. 320-328, 2018
11 B. Bross, Y. K. Wang, Y. Ye, S. Liu, and J. Chen, Overview of the versatile video coding (VVC) standard and its applications," IEEE Transactions on Circuits and Systems for Video Technology, Vol 31, No 10, pp. 3736-3764, 2021.   DOI
12 W. Gao, X. Xu, and S. Liu, "[VCM] Response to CfE: Investigation of VVC Codec for Video Coding for Machine," ISO/IEC JTC 1/SC 29/WG 2, m56681, Online, Apr. 2021.
13 S. Kim, M. Jeong, H. Jin H. Lee, H. Choo, H. Lim, and J. Seo, "[VCM] A report on intermediate feature coding for object detection and segmentation," ISO/IEC JTC 1/SC 29/WG 2, m55243, Online, Oct. 2020.
14 B. Zhu, L. Yu, and D. Li, "[VCM] Deep learning-based compression for machine vision," ISO/IEC JTC 1/SC 29/WG 2, m57335, Online, Jul. 2021.
15 H. Han, H. Choi, S. Kwak, J. Yun, W. Cheong, and J. Seo, "[VCM] Investigation on feature map channel reordering and compression for object detection," ISO/IEC JTC 1/SC 29/WG 2, m56653, Online, Apr. 2021.
16 COCO2017 validation set, https://cocodataset.org/#download (accessed Nov. 26, 2021).
17 M. Rafie, Y. Zhang, and S. Liu, "[VCM] Call for Evidence for Video Coding for Machines," ISO/IEC JTC 1/SC 29/WG 2, m56995, Online, Apr. 2021.
18 G. Bjontegaard, "Calculation of average PSNR differences between RDcurves," Tech. Rep. VCEGM33, Video Coding Experts Group (VCEG), 2001.
19 S Wang, Z. Wang, Y. Ye, and S Wang, "[VCM] Image or video format of feature map compression for object detection," ISO/IEC JTC 1/SC 29/WG 2, m55786, Online, Jan. 2021.
20 OpenImageV6, https://storage.googleapis.com/openimages/web/download.html (accessed Nov. 26, 2021)
21 L. Jinhua, T. Zhang, and G. Feng. "Channel Compression: Rethinking Information Redundancy Among Channels in CNN Architecture," IEEE Access, Vol. 8, pp. 147265-147274, 2020.   DOI
22 S. Wang, Z. Wang, Y. Ye, and S. Wang, "[VCM] Investigation on feature map layer selection for object detection and compression," ISO/IEC JTC 1/SC 29/WG 2, m55787, Online, Dec. 2020.
23 S. Xie, R. Girshick, P. Dollar, Z. Tu, and K. He, "Aggregated Residual Transformations for Deep Neural Networks," arXiv, 2017.
24 S. Wiedemann et al., "DeepCABAC: A universal compression algorithm for deep neural networks," IEEE J. Sel. Topics Signal Process., Vol. 14, No. 4, pp. 700-714, May 2020.   DOI