Adaptive Multiview Video Coding Scheme Based on Spatiotemporal Correlation Analyses

  • Zhang, Yun (Institute of Computing Technology, Chinese Academy of Sciences and Graduate School of Chinese Academy of Sciences) ;
  • Jiang, Gang-Yi (Faculty of Information Science and Engineering, Ningbo University) ;
  • Yu, Mei (Faculty of Information Science and Engineering, Ningbo University) ;
  • Ho, Yo-Sung (Department of Information and Communications, Gwangju Institute of Science and Technology)
  • 투고 : 2008.06.19
  • 심사 : 2009.02.10
  • 발행 : 2009.04.30

초록

In this paper, we propose an adaptive multiview video coding scheme based on spatiotemporal correlation analyses using hierarchical B picture (AMVC-HBP) for the integrative encoding performances, including high compression efficiency, low complexity, fast random access, and view scalability, by integrating multiple prediction structures. We also propose an in-coding mode-switching algorithm that enables AMVC-HBP to adaptively select a better prediction structure in the encoding process without any additional complexity. Experimental results show that AMVC-HBP outperforms the previous multiview video coding scheme based on H.264/MPEG-4 AVC using the hierarchical B picture (MVC-HBP) on low complexity for 21.5%, on fast random access for about 20%, and on view scalability for 11% to 15% on average. In addition, distinct coding gain can be achieved by AMVC-HBP for dense and fast-moving sequences compared with MVC-HBP.

키워드

참고문헌

  1. M. Tanimoto et al., “Proposal on Requirements for FTV,” ISO/IEC MPEG & ITU-T VCEG, JVT-W127, California, USA, Apr. 2007.
  2. Survey of Algorithms Used for MVC, ISO/IEC JTC1/SC29/WG11, N6909, Hong Kong, China, Jan. 2005.
  3. Y.S. Ho and K.J. Oh, “Overview of Multi-view Video Coding,” 14th Int'l Workshop Syst., Signals and Image Processing (IWSSIP), Maribo, Slovenia, June 2007, pp. 5-12.
  4. R. Koenen, “Overview of the MPEG-4 Standard,” ISO/IEC JTC1/SC29/WG11, N4030, Singapore, Mar. 2001.
  5. J.-W. Kang et al., “Graph Theoretical Optimization of Prediction Structure in Multiview Video Coding,” Proc. of IEEE Int'l Conf. Image Proc (ICIP), vol. 6, San Antonio, Sept. 2007, pp. 429-432.
  6. S. Oka, T. Endo, and T. Fujii, “Dynamic Ray-Space Coding Using Multi-directional Picture,” IEICE Technical Report, Dec. 2004, pp. 15-20.
  7. K. Yamamoto et al., “Multiview Video Coding Using View Interpolation and Color Correction,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 11, Nov. 2007. pp. 1436-1449.
  8. M. Kitahara et al., “Multi-view Video Coding Using View Interpolation and Reference Picture Selection,” Proc. IEEE Int'l Conf. Multimedia & Expo, Toronto, Canada, July 2006, pp. 97-100.
  9. P. Merkle et al., “Efficient Prediction Structures for Multiview Video Coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 17, no. 11, Nov. 2007. pp. 1461-1473.
  10. Description of Core Experiments in MVC, ISO/IEC JTC1/SC29/WG11, w8019, Montreux, Switzerland, Apr. 2006.
  11. H. Schwarz, M. Wien, and J. Vieron, JSVM Software Manual, Joint Video Team, ISO/IEC MPEG & ITU-T VCEG, JVT-S070, Geneva, Switzerland, Apr. 2006.
  12. Requirements on Multi-view Video Coding v.8, ISO/IEC JTC1/SC29/WG11, N9163, Lausanne, Switzerland, Jul. 2007.
  13. M. Yu et al., “Bandwidth Distortion Model for MVC in Interactive System,” ISO/IEC MPEG & ITU-T VCEG, JVTY027, Shenzhen, China, Oct. 2007.
  14. S. Shimizu et al., “View Scalable Multiview Video Coding Using 3-D Warping with Depth Map,” IEEE Trans. Circuits and Syst. for Video Technol., vol. 17, no. 11, Nov. 2007, pp. 1485-1495. https://doi.org/10.1109/TCSVT.2007.903773
  15. Y. Liu et al., “Low-Delay View Random Access for Multi-view Video Coding,” Proc. IEEE Int'l Symp. on Circuits and Syst. (ISCAS), New Orleans, USA, May 2007, pp. 997-1000.
  16. R. Kawada, “KDDI Multiview Video Sequences for MPEG 3DAV,” ISO/IEC JTC1/SC29/WG11, M10533, Munich, Germany, Mar. 2004.
  17. A. Vetro et al., “Multiview Video Test Sequences from MERL for the MPEG Multiview Working Group,” ISO/IEC JTC1/SC29/WG11, M12077, Busan, Korea, Apr. 2005.
  18. Y. Zhang et al., “An Approach to Multi-modal Multi-view Video Coding,” Proc. of Int'l Conf. Signal Processing (ICSP), vol. 2, Guilin China, Nov. 2006. pp. 1405-1408.
  19. U. Fecker and A. Kaup, “Statistical Analysis of Multi-Reference Block Matching for Dynamic Light Field Coding,” Proc. Vision, Modeling and Visual. (VMV), Erlangen, Germany, Nov. 2005, pp. 445-452.
  20. K. Sohn et al., “Results on CE1 for Multi-view Video Coding,” ISO/IEC MPEG & ITU-T VCEG, JVT-T102, Klagenfurt, Austria, July 2006.
  21. H. Schwarz, D. Marpe, and T. Wiegand, “Hierarchical B Pictures,” ISO/IEC MPEG & ITU-T VCEG, JVT-P014, Poznan, Poland, July 2005.
  22. Y. Zhang, M. Yu, and G.Y. Jiang, “Evaluation of Typical Prediction Structures for Multi-view Video Coding,” ISAST Trans. Electronics and Signal Processing, vol. 2, no. 1, 2008, pp. 7-15.