A Feature Map Compression Method for Multi-resolution Feature Map with PCA-based Transformation |
Park, Seungjin
(Department of Computer Engineering, Kwangwoon University)
Lee, Minhun (Department of Computer Engineering, Kwangwoon University) Choi, Hansol (Department of Computer Engineering, Kwangwoon University) Kim, Minsub (Department of Computer Engineering, Kwangwoon University) Oh, Seoung-Jun (Department of Electronic Engineering, Kwangwoon University) Kim, Younhee (Electronics and Telecommunications Research Institute) Do, Jihoon (Electronics and Telecommunications Research Institute) Jeong, Se Yoon (Electronics and Telecommunications Research Institute) Sim, Donggyu (Department of Computer Engineering, Kwangwoon University) |
1 | VTM12.0, https://vcgit.hhi.fraunhofer.de/jvet/VVCSoftware_VTM/-/tree/VTM-12.0 (accessed Nov. 26, 2021). |
2 | M. Rafie, Y. Zhang, and S. Liu, "[VCM] Evaluation Framework for Video Coding for Machines," ISO/IEC JTC 1/SC 29/WG 2, m58385, Online, Oct. 2021. |
3 | Y. LeCun, Y. Bengio, and G. E. Hinton, "Deep learning," Nature, vol. 512, pp. 436-444, 2015. DOI |
4 | G. Sullivan, J. Ohm, W. Han, and T. Wiegand, "Overview of the high efficiency video coding (HEVC) standard," IEEE Transactions on Circuits and Systems for Video Technology, Vol. 22, No. 12, pp. 1649-1668, Dec. 2012. DOI |
5 | S. Wang, C. Lin, C. Lin, T., and Y. Nie, "[VCM] Enable IBC in VTM8.2 for VCM," ISO/IEC JTC 1/SC 29/WG 2, m56792, Online, Apr. 2021. |
6 | S. Wang, C. Lin, and C. Lin (ITRI), "[VCM] A study on impact of coding tools on machine vision performance and visual quality," ISO/IEC JTC 1/SC 29/WG 2, m56867, Online, Apr. 2021. |
7 | Y. Lee, S., K. Yoon, H. Lim, H. Choo, W. Cheong, and J. Seo, "[VCM] Updated FLIR Anchor results for object detection," ISO/IEC JTC 1/SC 29/WG 2, m57375, Online, Jul. 2021. |
8 | S. Wang, Z. Wang, Y. Ye, and S. Wang, "[VCM] End-to-end image compression towards machine vision for object detection," ISO/IEC JTC 1/SC 29/WG 2, m57500, Online, Jul. 2021. |
9 | J. Do, J. Lee, Y. Kim, S. Yoon J., and J. Choi, "[VCM] Experimental Results of Feature Compression using CompressAI," ISO/IEC JTC 1/SC 29/WG 2, m56716, Online, Apr. 2021. |
10 | M. F. Mahmood, N. Hussin, "Information in Conversion Era: Impact and Influence from 4th Industrial Revolution," International Journal of Academic Research in Business and Social Sciences, Vol. 8, No. 9, pp. 320-328, 2018 |
11 | B. Bross, Y. K. Wang, Y. Ye, S. Liu, and J. Chen, Overview of the versatile video coding (VVC) standard and its applications," IEEE Transactions on Circuits and Systems for Video Technology, Vol 31, No 10, pp. 3736-3764, 2021. DOI |
12 | W. Gao, X. Xu, and S. Liu, "[VCM] Response to CfE: Investigation of VVC Codec for Video Coding for Machine," ISO/IEC JTC 1/SC 29/WG 2, m56681, Online, Apr. 2021. |
13 | S. Kim, M. Jeong, H. Jin H. Lee, H. Choo, H. Lim, and J. Seo, "[VCM] A report on intermediate feature coding for object detection and segmentation," ISO/IEC JTC 1/SC 29/WG 2, m55243, Online, Oct. 2020. |
14 | B. Zhu, L. Yu, and D. Li, "[VCM] Deep learning-based compression for machine vision," ISO/IEC JTC 1/SC 29/WG 2, m57335, Online, Jul. 2021. |
15 | H. Han, H. Choi, S. Kwak, J. Yun, W. Cheong, and J. Seo, "[VCM] Investigation on feature map channel reordering and compression for object detection," ISO/IEC JTC 1/SC 29/WG 2, m56653, Online, Apr. 2021. |
16 | COCO2017 validation set, https://cocodataset.org/#download (accessed Nov. 26, 2021). |
17 | M. Rafie, Y. Zhang, and S. Liu, "[VCM] Call for Evidence for Video Coding for Machines," ISO/IEC JTC 1/SC 29/WG 2, m56995, Online, Apr. 2021. |
18 | G. Bjontegaard, "Calculation of average PSNR differences between RDcurves," Tech. Rep. VCEGM33, Video Coding Experts Group (VCEG), 2001. |
19 | S Wang, Z. Wang, Y. Ye, and S Wang, "[VCM] Image or video format of feature map compression for object detection," ISO/IEC JTC 1/SC 29/WG 2, m55786, Online, Jan. 2021. |
20 | OpenImageV6, https://storage.googleapis.com/openimages/web/download.html (accessed Nov. 26, 2021) |
21 | L. Jinhua, T. Zhang, and G. Feng. "Channel Compression: Rethinking Information Redundancy Among Channels in CNN Architecture," IEEE Access, Vol. 8, pp. 147265-147274, 2020. DOI |
22 | S. Wang, Z. Wang, Y. Ye, and S. Wang, "[VCM] Investigation on feature map layer selection for object detection and compression," ISO/IEC JTC 1/SC 29/WG 2, m55787, Online, Dec. 2020. |
23 | S. Xie, R. Girshick, P. Dollar, Z. Tu, and K. He, "Aggregated Residual Transformations for Deep Neural Networks," arXiv, 2017. |
24 | S. Wiedemann et al., "DeepCABAC: A universal compression algorithm for deep neural networks," IEEE J. Sel. Topics Signal Process., Vol. 14, No. 4, pp. 700-714, May 2020. DOI |