DOI QR코드

DOI QR Code

A New Hardware Architecture of High-Speed Motion Estimator for H.264 Video CODEC

H.264 비디오 코덱을 위한 고속 움직임 예측기의 하드웨어 구조

  • Received : 2010.10.11
  • Accepted : 2011.03.10
  • Published : 2011.03.30

Abstract

In this paper, we proposed a new hardware architecture for motion estimation (ME) which is the most time-consuming unit among H.264 algorithms and designed to the type of intellectual property (IP). The proposed ME hardware consists of buffer, processing unit (PU) array, SAD (sum of absolute difference) selector, and motion vector (MVgenerator). PU array is composed of 16 PUs and each PU consists of 16 processing elements (PUs). The main characteristics of the proposed hardware are that current and reference frames are re-used to reduce the number of access to the external memory and that there is no clock loss during SAD operation. The implemented ME hardware occupies 3% hardware resources of StatixIII EP3SE80F1152C2 which is a FPGA of Altera Inc. and can operate at up to 446.43MHz. Therefore it can process up to 50 frames of 1080p in a second.

본 논문에서는 H.264/AVC 인코더에서 가장 많은 연산 시간이 소요되는 움직임 추정(motion estimation, ME) 동작을 위한 하드웨어의 구조를 제안하고 IP(intellectual property) 형태로 구현하였다. 고속 움직임 추정기의 구조는 버퍼(buffer), PU 어레이(processing unit array), SAD 선택기(SAD selector), MV 생성기(motion vector generator) 등으로 구성되어 있다. PU 어레이는 16개의 PU로 구성되어 있고, 각각의 PU는 16개의 PE(processing element)로 이루어져 있다. 제안한 하드웨어의 동작적인 특징은 외부메모리 접근량을 줄이기 위해 현재와 참조프레임의 데이터를 재사용한다는 것과 SAD연산을 수행할 때 클록의 손실 없이 계산을 할 수 있다는 것이다. 구현한 고속 움직임 추정기는 Altera 사의 FPGA인 StatixIII EP3SE80F1152C2에서 3%의 자원을 사용하였고, 최대 동작주파수는 446.43MHz이었다. 따라서 구현한 하드웨어는 1080p 영상을 최대 50fps로 처리할 수 있다.

Keywords

References

  1. J.-C. Tuan, T.-S. Chang, and C.-W. Jen, "On the data reuse and memory bandwidth analysis for full-search block-matching VLSI architecture," IEEE Trans. Circuits Syst. Video Technol., vol. 12, no. 1, pp. 61-72, Jan. 2002. https://doi.org/10.1109/76.981846
  2. Y. Su and M.-T. Sun, ""Fast multiple reference frame motion estimation for H.264/AVC,"" IEEE Trans. Circuits Syst. Video Technol., vol. 16, no. 3, pp. 447-452, Mar. 2006. https://doi.org/10.1109/TCSVT.2006.869970
  3. Y. Su and M.-T. Sun, ""Fast multiple reference frame motion estimation for H.264/AVC,"" IEEE Trans. Circuits Syst. Video Technol., pp. 447-452, Mar. 2006. https://doi.org/10.1109/TCSVT.2006.869970
  4. Z. Liu, L. Li, Y. Song, T. Ikenaga, and S. Goto, ""VLSI oriented fast multiple reference frame motion estimation algorithm for H.264/AVC,"" in Proc. IEEE Int. Conf. Multimedia Expo, Beijing, China, pp. 1902-1905, Jul. 2007. https://doi.org/10.1109/ICME.2007.4285047
  5. S.Y. Yap and J.V. McCanny, "A VLSI Architecture for Variable Block Size, Video Motion Estimation", IEEE Trans. Circuits and Systems, vol.51, pp.384-389, 2004. https://doi.org/10.1109/TCSII.2004.829555
  6. S.Lopez, F.Tobajas, A.Villar, V. de Armas, J.F.Lopez, and R.Sarmiento,"Low cost efficient architecture for H.264 motion estimation", IEEE International Symposium on Circuits and Systems (ISCAS) Kobe, Japan, pp.412-415, May 23-26, 2005 https://doi.org/10.1109/ISCAS.2005.1464612
  7. Cao Wei, Mao Zhi Gang, "A novel VLSI Architecture for VBSME in MPEG-4 AVC/H.264"IEEE International Symposium on Circuits and Systems(ISCAS) Lobe, Japan, pp.1794-1797, May 23-26, 2005 https://doi.org/10.1109/ISCAS.2005.1464957
  8. C.-Y.Chen, S.-Y.Chien, Y.-W.Huang, T.-C.Chen, T.-C.Wang, L.-G.Chen,"Analysis and Architecture Design of Variable Block-Size Motion Estimation for H.264/AVC", IEEE Transactions on Circuits and Systems I: Regular Papers, 16 vol.53, pp.578-593, 2006 https://doi.org/10.1109/TCSI.2005.858488
  9. Chien-Min Ou, Chian-Feng Le, Wen-Jyi Hwang, "An efficient VLSI architecture for H.264 variable block size motion estimation", IEEE Transactions on Consumer Electronics, vol.51, pp.1291-1299, 2005 https://doi.org/10.1109/TCE.2005.1561858
  10. J.-C. Tuan, T.-S. Chang, and C.-W. Jen, "On the data reuse and memory bandwidth analysis for full-search block-matching VLSI architecture", IEEE Trans. Circuits Syst. Video Technol., Vol.12, pp. 61-72, 2002. https://doi.org/10.1109/76.981846
  11. Iain E. G. Richardson "H. 264 and MPEG-4 Video Compression" 2003
  12. T. Wiegand, G. Sullivan, G. Bjontegaard, and A. Luthra, "Overview of the H.264/AVC video coding standard", IEEE Trans. Circuits and Systems for Video Technology, vol.13, no.7, pp. 560-576, Jul. 2003. https://doi.org/10.1109/TCSVT.2003.815165
  13. "Draft ITU-T recommendation and final draft international standard of joint video specification (ITU-T Rec. H.264/ISO/IEC 14 496-10 AVC,"" in Joint Video Team (JVT) of ISO/IEC MPEG and ITU-T VCEG, JVTG050, 2003.
  14. Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification, ITU-T Rec. H.264 and ISO/IEC 14496-10 AVC, May 2003.
  15. J. Ostermann, T. Wedi, et al., "Video coding with H.264/AVC: tools, performance, and complexity," IEEE Circuits and Systems Magazine, vol. 4, pp.7-28, 2004. https://doi.org/10.1109/MCAS.2004.1286980
  16. Heejun Shim,Chong-Min Kyung, "Selective Search Area Reuse Algorithm for Low External Memory Access Motion Estimation", IEEE Transactions On Circuits And Systems For Video Technology, VOL. 19, NO. 7, July 2009
  17. Kthiri, M.; Kadionik, P.; Levi, H.; Loukil, H.; Ben Atitallah, A.; Masmoudi, N., "Hardware implementation of fast block matching algorithm in FPGA for H.264/AVC ", Systems, Signals and Devices, 2009. SSD '09. 6th International Multi-Conference, 2009
  18. Kthiri, M.; Kadionik, P.; Levi, H.; Loukil, H.; Ben Atitallah, A.; Masmoudi, N., "An FPGA implementation of motion estimation algorithm for H.264/AVC", I/V Communications and Mobile Network (ISVC), 2010 5th International Symposium on, 2010