DOI QR코드

DOI QR Code

Improved Prediction Structure and Motion Estimation Method for Multi-view Video Coding

다시점 비디오 부호화를 위한 개선된 예측 구조와 움직임 추정 기법

  • 윤효순 (전남대학교 전자컴퓨터공학부) ;
  • 김미영 (전남도립대학교 보건의료학과)
  • Received : 2014.04.15
  • Accepted : 2014.09.17
  • Published : 2014.11.15

Abstract

Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. The computational complexity of multi view video coding increases in proportion to the number of cameras. To reduce computational complexity and maintain the image quality, improved prediction structure and motion estimation method is proposed in this paper. The proposed prediction structure exploits an average distance between the current picture and its reference pictures. The proposed prediction structure divides every GOP into several groups to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. And the proposed motion estimation method uses a hierarchical search strategy. This strategy method consists of modified diamond search pattern, progressive diamond search pattern and modified raster search pattern. Experiment results show that the complexity reduction of the proposed prediction structure and motion estimation method over JMVC (Joint Multiview Video Coding) reference model using hierarchical B pictures of Fraunhofer-HHI and TZ search method can be up to 40~70% while maintaining similar video quality and bit rates.

다시점 비디오는 하나의 3차원 장면을 여러 시점에서 다수의 카메라로 촬영 영상으로 다시점 비디오 부호화의 계산량은 카메라 수에 비례하여 증가한다. 본 논문에서는 다시점 비디오 부호화의 계산량을 줄이면서 영상 화질을 유지하는 예측 구조와 움직임 추정 기법을 제안한다. 제안한 개선된 예측 구조는 B계층 최대 인덱스 그리고 각 Bi계층의 화면수를 고려하였다. 제안한 움직임 추정 기법은 계층적인 탐색 기법으로 수정된 다이아몬드 탐색 패턴, 점진적인 다이아몬드 탐색 패턴 그리고 수정된 래스터 탐색 패턴으로 구성된다. 제안한 예측 구조와 움직임 추정 기법의 성능을 Fraunhofer-HHI의 계층적 B화면 구조와 TZ 움직임 추정 기법을 사용한 JMVC 참조 모델의 성능과 비교한 경우, 영상 화질과 발생 비트량은 비슷하지만 다시점 비디오 부호화의 계산량을 40~70% 줄인다.

Keywords

Acknowledgement

Supported by : 한국연구재단

References

  1. A. Smolic, K. Mueller, P. Merkle, C. Fehn, P. kauff, P. Eisert, and T. Wiegand, "3D Video and Free Viewpoint Video - Technologies, Applications and MPEG Standards," Proc. of IEEE International Conference on Multimedia and Exposition, Jul. 2006.
  2. ITU-T Rec. & ISO/IEC 14496-10 AVC, "Advanced Video Coding for Generic Audio Visual services," 2005.
  3. ISO/IEC JTC1/SC29/WG11 N10357, "Vision on 3D Video," Feb. 2009.
  4. A. Vetro, P. Pandic, H. Kimata, A. Smolic, and Y. Wang, "Joint Multi-view Video Model (JMVM) 8.0," Joint Video Team of ISO/IEC JTC21/SC29/WG11 and ITU-T SG16 Q.6, JVT-AA207, Switzerland, Apr. 2008.
  5. P. Merkle, K. Muller, A. Smolic, and T. Wiegand, "Efficient compression of multi-view video exploiting inter-view dependencies based on H.264/MPEG4-AVC," Proc. of IEEE International Conference on Multimedia and Exposition, Jul. 2006.
  6. P. Merkle, A. Smolic, K. Muller, and T. Wiegand, "Efficient prediction structures for multiview video coding," IEEE Trans. Circuits and Systems for Video Technology, Vol. 17, No. 11, pp. 1461-1473, Nov. 2007. https://doi.org/10.1109/TCSVT.2007.903665
  7. ISO/IEC JTC1/SC29/WG11, "Joint Multiview Video Model (JMVM) 8," Doc. N9762, May, 2008.
  8. B. Zatt, M. Shafique, S. Bampi, and J. Henkel, "A Multi-Level Dynamic Complexity Reduction Scheme for Multiview Video Coding," IEEE International Conference on Image Processing(ICIP), pp. 761-764, 2011.
  9. P. K. Park, K. J. Oh, and Y. S. Ho, "Efficient viewtemporal prediction structures for multi-view video coding," Electronics Letters, Vol. 44, No. 2, pp. 102-103, 2008. https://doi.org/10.1049/el:20082082
  10. R. L. He, "A Multiview Video Coding Method with Adaptive Selection of Reference Frame Modes," Journal of Computer-aided Design And Computer Graphics, Vol. 30, No. 12, pp. 2205-2211, Dec. 2007.
  11. F. Lu, P. An, Z. Zhang, and L. Shen, "Multi-view Video Coding Based on Sequence Correlation," Audio Language and Image Processing (ICALIP), pp. 1227-1232, 2010.
  12. TANG Xiu-lil, OAT Sheng-kui, and CAT Can-hui, "An Analysis of TZSearch Algorithm in JMVC," pp. 516-519, 2010.
  13. N. Purnachand, Luis Nero Alves, and Antonio Navarro, "Improvements to TZ search motion estimation algorithm for multiview video coding," IWSSIP 2012, pp. 388-391, 2012.
  14. H. S. Yoon and G. S. Lee, "Adaptive Motion Estimation method using temporal correlation," The KIPS Transactions: Part B, Vol. 9, No. 2, pp. 199-204, 2004.
  15. 'Description of core experiments in MVC', ISO/IEC JTC1/SC29/WG11MPEG2006/W8019, Montreux, Apr. 2006.
  16. K. Muller, P. Merkle, H. Schwarz, T. Hinz, A. Smolic, T. Oelbaum, and T. Wiegand, "Multi-viewVideo Coding Based on H.264/AVC Using Hierarchical B-frames," Proc. of the Picture Coding Symposium, pp. 385-390, Apr. 2006.
  17. K. Muller, P. Merkle, A. Smolic, and T. Wiegand, ISO/IEC JTC1/SC29/WG11, Multiview Coding using AVC, JVT Doc. m12945, Bangkok, Thailand, Jan. 2006.
  18. ISO/IEC JTC1/SC29/WG11, Requirements on Multiview Video Coding v.4, Doc. N7282, Poznan, Poland, Jul. 2005.
  19. ISO/IEC JTC1/SC29/WG11, Call for Proposals on Multi-view Video Coding, Doc. N7327, Poznan, Poland, Jul. 2005.
  20. J. Daase, U. Goelz, P. Kauff, K. Mueller, O. Schreer, A. Smolic, R. Tanger, and T. Wiegand, Fraunhofer HHI Test Data Sets for MVC, ISO/IEC JTC1/SG29/WG11, Document MPEG2005/M11894, Busan, Korea, 2005.
  21. A. Vetro, M. McGuire, W. Matusik, A. Behrens, J. Lee and H. Pfister, Multiview Video Test Sequences from MERL, ISO/IEC JTC1/SG29/WG11, Document MPEG 200/M12077, Busan, Korea 2005.
  22. G. Bjontegaard, "Calculation of Average PSNR Differences Between RD-curves," document VCEG-M33, ITU-T Video Coding Experts Group (VCEG) Meeting, Austin, TX, Apr. 2001.