DOI QR코드

DOI QR Code

Adaptive Block-based Depth-map Coding Method

적응적 블록기반 깊이정보 맵 부호화 방법

  • Published : 2009.09.30

Abstract

This paper proposes an efficient depth-map coding method for generating virtual-view images in 3D-Video. Virtual-view images can be generated by the view-interpolation based on the depth-map of the image. A conventional video coding method such as H.264 has been used. However, a conventional video coding method does not consider the image characteristics of the depth-map. Therefore, this paper proposes an adaptive depth-map coding method that can select between the H.264/AVC coding scheme and the proposed gray-coded bit plane-based coding scheme in a unit of block. This improves the coding efficiency of the depth-map data. Simulation results show that the proposed method, in comparison with the H.264/AVC coding scheme, improves the average BD-rate savings by 7.43% and the average BD-PSNR gains by 0.5dB. It also improves the subjective picture quality of synthesized virtual-view images using decoded depth-maps.

본 논문에서는 3D-Video에서 가상시점 영상을 생성하는데 필요한 깊이정보 맵의 효율적인 부호화 방법을 제안한다. 가상시점 영상은 실사 영상의 깊이정보 맵을 이용한 시점보간(View Interpolation) 방법으로 생성된다. 일반적으로 깊이정보 맵의 부호화에는 H.264 등의 자연영상에 대한 동영상 부호화 방법을 그대로 적용하고 있는데, 이러한 부호화 방법은 깊이정보 맵의 특성을 고려하지 않은 방법이다. 따라서 본 논문에서는 깊이정보 맵의 영상특성을 고려하여, 맵 정보를 그레이 코드로 변환한 후 비트평면 단위로 부호화하는 방법과, H.264 부호화 방법을 블록단위로 적응적으로 선택하여 부호화하는 방법을 제안하였다. 실험결과로서 제안하는 방법의 성능은 H.264 부호화 방법에 비하여, BD-PSNR이 평균 0.5dB 향상되고, BD-rate는 평균 7.43% 감소되어 부호화효율이 우수함을 확인할 수 있었다. 또한 복원된 깊이정보 맵을 이용하여 생성된 가상시점 영상 간의 비교에서 제안하는 방법이 H.264 부호화 방법에 비해 주관적 화질이 크게 향상된 것을 확인할 수 있었다.

Keywords

References

  1. ISO/IEC JTC1/SC29/WG11, 'Text of 1SO/IEC 14496-10:200X/FDAM 1 Multi-view Video Coding,' N9978, Hannovcr, Germany, July 2008
  2. ISO/JEC JTC1/SC29/WG11, 'Applications and Requirements of 3D Video Coding, N10358, Lausanne, Switzerland, February 2009
  3. M. Tanimoto, 'Overview of Free Viewpoint Television,' Signal Processing: Image Communication, vol.21, no.6, pp.454-461, July 2006 https://doi.org/10.1016/j.image.2006.03.009
  4. ISO/JEC JTC1/SC29/WG11, 'Vision on 3D Video,' N10357, Lausanne, Switzerland, February 2009
  5. A. Smolic, K. Mueller, N. Stefanoski, J. Ostermann, A. Gotchev, G.B. Akar, G.A. Triantafyllidis and A.Koz: 'Coding Algorithms for 3DTV - A Survey,' IEEE Trans. on Circuits and Systems for Video Technology, Vol 7, Issue 11, pp. 1606-1621, November 2007 https://doi.org/10.1109/TCSVT.2007.909972
  6. ISO/IEC JTC1/SC29/WG11, 'Text of ISO/IEC FDJS 23002-3 Representation of Auxiliary Video and Supplemental Information,' N8768, Marrakech, Morocco, January 2007
  7. ITU-T Recommendation H.264 and ISO/IEC 14496-10 (MPEG-4 Part 10 AVC), 'Advanced Video Coding for Generic Audiovisual Services,' Version 1: March 2003, Version 2: May 2004, Version 3: March 2005, Version 4: September 2005, Version 5 and Version 6: Junc 2006, Version 7: April 2007, Version 8: July 2007
  8. T. Wiegand, H. Schwarz, A. Joch, F. Kossentini, and G.J. Sullivan, 'Rate-Constrained Coder Control and Comparison of Video Coding Standards,' IEEE Transactions on Circuits and Systems for Video Technology, Vol. 13, No 7, pp. 688-703, July 2003 https://doi.org/10.1109/TCSVT.2003.815168
  9. ISO/IEC 14496-2 (MPEG-4 Visual), 'Coding of Audio-Visual Objects - Part 2: Visual,' Version 1: April 1999. Version 2: February 2000, Version 3: May 2004
  10. N. Brady, and F. Bossen, 'Shape compression of moving objects using context-based arithmetic encoding.' Signal Processing: Image Communication, Volume 15, pp. 601-617(17), Number 7, May 2000 https://doi.org/10.1016/S0923-5965(99)00047-8
  11. N. Brady, F. Bossen, and N. Murphy, 'Context-based arithmetic encoding of 2D shape sequences,' Special session on shape coding (ICIP '97), vol.1 pp. 29-32, 1997 https://doi.org/10.1109/ICIP.1997.647376
  12. F. Heinrich-Hertz-Institut, 'H.254 Reference Software Version JM13.2,' http://iphome.hhi.de/suehring/tml, May 2008
  13. G. Bjontegaard, 'Calculation of average PSNR differences between RD-curves,' ITU-T SG16 Q.6, VCEG-M33, Texas, USA, April 2001
  14. 호요성, 이천, '3차원 비디오 부호화를 위한 국제 표준화 기술,' 방송공학회지, 제14권 제2호, pp.31 - 44, June, 2009
  15. Y. Mori, N. Fukushima, T. Yendo, T. Fujii, M. Tanimoto, 'View generation with 3D warping using depth information for FTV,' Image Communication, v.24, n.1-2, p.65-72, January, 2009 https://doi.org/10.1016/j.image.2008.10.013
  16. M. Tanimoto, T. Fujii, 'View synthesis algorithm in view synthesis reference software 2.0 (VSRS 2.0),' ISO/IEC JTC 1/SC29/WG11 M16090. Lausanne, Switzerland, Feb. 2009