DOI QR코드

DOI QR Code

Linear Sub-band Decomposition based Pre-processing Algorithm for Perceptual Video Coding

지각적 동영상 부호화를 위한 선형 부 대역 분해 기반 전처리 기법

  • Received : 2016.05.12
  • Accepted : 2016.12.12
  • Published : 2017.01.25

Abstract

This paper proposes a pre-processing algorithm to improve perceptual video coding efficiency which decomposes an input frame via a sub-band decomposition, and suppresses only high frequency band(s) having low visual sensitivity. First, we decompose the input frame into several frequency subbands by a linear sub-band decomposition. Next, high frequency subband(s) which is rarely recognized by human visual system (HVS) is suppressed by applying relatively small gain(s). Finally, the high frequency suppressed frame is compressed by a specific video encoder. We can find from the experimental results that if comparing before-use and after-use of the proposed pre-processing prior to the encoder, no visual difference is shown. Also, the proposed algorithm achieves bit-saving of 13.12% on average in a H.264 video encoder.

본 논문에서는 선형 부 대역 분해를 이용하여 입력 영상을 분해하고 시각적 민감도가 낮은 고주파 영역만을 효과적으로 억제하여 지각적 동영상 부호화의 효율을 향상시킬 수 있는 전처리 기법을 제안한다. 먼저 소정의 선형 부 대역 분해로 각 입력영상을 여러 주파수 대역들로 나눈다. 그런 다음 인간의 시각적 구조에서 거의 인지가 되지 않는 고주파 대역들에만 1보다 작은 이득 값들을 적용하여 해당 주파수 대역 정보를 억제시킨다. 이와 같이 고주파가 억제된 영상들을 소정의 비디오 인코더로 압축한다. 모의 실험을 통해 제안 기법의 적용 전후 압축 결과를 비교하여 시각적 차이 없음을 확인하였다. 또한, 제안 기법을 H.264 인코더의 전처리로 적용하였을 때 적용 전 대비 평균 13.12%의 데이터 감소 효과를 얻었다.

Keywords

References

  1. T. Wiegand, G. J. Sullivan, G. Bjntegaard, A. Luthra, "Overview of the H.264/AVC Video Coding Standard," IEEE Trans. Circuits Syst. Video Technol., vol. 13, no. 7, pp. 560-576. July 2003. https://doi.org/10.1109/TCSVT.2003.815165
  2. G. J. Sullivan, J.-R. Ohm, W.-J. Han, T. Wiegand, "Overview of the high efficiency video coding (HEVC) standard," IEEE Trans. Circuits Syst. Video Technol., vol. 22, no. 12, pp. 1649-1668, Dec. 2012. https://doi.org/10.1109/TCSVT.2012.2221191
  3. J.-R. Ohm, G. J. Sullivan, H. Schwarz, T. K. Tan, T. Wiegand, "Comparison of the coding efficiency of video coding standards-including high efficiency video coding (HEVC)," IEEE Trans. Circuits Syst. Video Technol., vol. 22, no. 12, pp. 1669-1684, Dec. 2012. https://doi.org/10.1109/TCSVT.2012.2221192
  4. H. R. Wu and K. R. Rao, Digital video image quality and perceptual coding, Boca Raton, FL: CRC Press, Nov. 2005.
  5. C.-W. Tang, C.-H. Chen, Y.-H. Yu, and C.-J. Tsai, "Visual sensitivity guided it allocation for video coding," IEEE Trans. Multimedia, vol. 8, no. 1, pp. 11-18, Feb. 2006. https://doi.org/10.1109/TMM.2005.861295
  6. Z. Li, S. Qin, and L. Itti, "Visual attention guided bit allocation in video compression," Image Vis. Comput., vol. 29, no. 1, pp. 1-14, Jan. 2011. https://doi.org/10.1016/j.imavis.2010.07.001
  7. J. Kim, S. H. Bae, and M. C. Kim, "An HEVC-compliant perceptual video coding scheme based on JND models for variable block-sized transform kernels" IEEE Trans. Circuits Syst. Video Technol., vol. 25, no. 11, pp. 1786-1800, Nov. 2015. https://doi.org/10.1109/TCSVT.2015.2389491
  8. M. Naccari and F. Pereira, "Advanced H.264/AVC-based perceptual video coding: architecture, tools, and assessment," IEEE Trans. Circuits Syst. Video Technol., vol. 21, no. 6, pp. 766-782, Jun. 2011. https://doi.org/10.1109/TCSVT.2011.2130430
  9. R. Vanam, L. J. Kerofsky, and Y. A. Reznik, "Perceptual pre-processing filter for adaptive video on demand content delivery," Proc. IEEE Int. Conf. on Image Processing, pp. 2537-2541, Paris, France, Oct. 2014.
  10. H. Oh and W. Kim, "Video processing for human perceptual visual quality-oriented video coding," IEEE Trans. Image Process., vol. 22, no. 4, pp. 1526-1535, Apr. 2013. https://doi.org/10.1109/TIP.2012.2233485
  11. Z. Wei and K. N. Ngan, "Spatio-temporal just noticeable distortion profile for gray scale image/video in DCT domain," IEEE Trans. Circuits Syst. Video Technol., vol. 19, no. 3, pp. 337-346, Mar. 2009. https://doi.org/10.1109/TCSVT.2009.2013518
  12. J. H. Jang, B. Choi, S. D. Kim, and J. B. Ra, "Sub-band decomposed multiscale retinex with space varying gain," Proc. IEEE Int. Conf. on Image Processing, pp. 3168-3171, San Diego, USA, Oct. 2008.
  13. Joint Video Team of ITU-T VCEG and ISO/IEC MPEG, Joint Model Reference Software, version 9.0.
  14. HM Reference Software 16.7. [Online]. Available: https://hevc.hhi.fraunhofer.de/trac/hevc/browser/tags/HM-16.7, accessed Feb. 11, 2016.
  15. ITU, "Methodology for the subjective assessment of the quality of television pictures," Geneva, Switzerland, ITU-R BT.500-11, 2002.
  16. Z. Wang, E. P. Simoncelli and A. C. Bovik, "Multi-scale structural similarity for image quality assessment," Proc. IEEE Asilomar Conf. Signals, Systems, Comput., vol. 2, pp. 1398-1402, California, USA, Nov. 2003.