Fast Intra Mode Decision for H.264/AVC by Using the Approximation of DCT Coefficient

H.264/AVC에서 DCT 계수의 근사화를 이용한 고속 인트라 모드 결정 기법

  • La, Byeong-Du (Dept., Electrical & Electronics Eng. Yonsei University) ;
  • Eom, Min-Young (Dept., Electrical & Electronics Eng. Yonsei University) ;
  • Choe, Yoon-Sik (Dept., Electrical & Electronics Eng. Yonsei University)
  • 라병두 (연세대학교 전기전자공학과) ;
  • 엄민영 (연세대학교 전기전자공학과) ;
  • 최윤식 (연세대학교 전기전자공학과)
  • Published : 2007.05.25

Abstract

The H.264/AVC video coding standard uses rate distortion optimization (RDO) method to improve the compression performance in the intra prediction. The complexity and computational load are increased more than previous standard by using this method, even though this standard selects the best coding mode for the current macroblock. This paper proposes a fast intra mode decision algorithm for H.264/AVC encoder based on dominant edge direction (DED). To apply the idea, this algorithm uses the approximation of discrete cosine transform (DCT) coefficient. By detecting the DED, 3 modes instead of 9 modes are chosen for RDO calculation to decide the best mode in the $4{\times}4$ luma block. As for the $16{\times}16$ luma and $8{\times}8$ chroma block, instead of 4 modes, only 2 modes are searched. Experimental results show that the computation time of the proposed algorithm is decreased to about 72% of the full search method with negligible quality loss.

H.264 영상 부호화 표준은 인트라 예측에서 압축 효율을 향상시키기 위해 율-왜곡 최적화(RDO : Rate Distortion Optimization) 방법을 사용한다. 이러한 방법을 사용함으로써 현재 블록에 대한 최적의 부호화 모드의 선택이 가능해졌지만 복잡도와 연산은 이전대비 더욱 증가하였다. 본 논문은 우세한 에지 방향(DED : Dominant Edge Direction)의 예측을 통한 고속인트라 모드 결정 알고리즘을 제안한다. 이를 위해 이 알고리즘은 이산 코사인 변환(DCT : Discrete Cosine Transform) 계수를 근사화하여 이용한다. DED를 예측함으로써 $4{\times}4$ 휘도 블록의 경우 최적 모드 결정을 위한 율-왜곡 최적화 계산에 9개 모드 중 3개 모드가 선택된다. $16{\times}16$ 휘도 블록과 $8{\times}8$ 색상 블록의 경우 4개 모드 대신에 2개 모드가 최적 모드 결정을 위해 율-왜곡 최적화 계산을 수행한다. 이러한 방법을 이용한 실험 결과 인트라 전체 검색 방법대비 약 72%의 연산시간이 감소하는 결과를 보여준다.

Keywords

References

  1. Thomas Wiegand, Gary J. Sullivan, Gisle Bjontegaard, and Ajay Luthra, 'Overview of the H.264/AVC Video Coding Standard,' IEEE Transaction on Circuits and System for Video Technology, Vol. 13, No.7, pp.560-576, July 2003 https://doi.org/10.1109/TCSVT.2003.815165
  2. Iain E.G. Richardson, 'H.264 and MPEG-4 Video Compression,' The Robert Gordon University, Aderdeen, 2004
  3. Lidong Xu, Xinggang Lin, 'Fast mode decision for inter frames in H.264/AVC,' IEEE International Symposium on Communications and Information Technology, Volume1, pp.433-436 Oct. 2005 https://doi.org/10.1109/ISCIT.2005.1566886
  4. Shen Gao, Tiejun Lu, 'An improved fast mode decision algorithm in H.264 for video communications,' International Symposium on Systems and Control in Aerospace and Astronautics, pp.57-60, Jan 2006 https://doi.org/10.1109/ISSCAA.2006.1627702
  5. Jeyun Lee, Byeungwoo Jeon, 'Fast mode decision for H.264,' IEEE International Conference on Multimedia and Expo, Volume 2, pp.1131-1134, June 2004
  6. Chao-Hsuing Tseng, Hung-Ming Wang, and Jar-Ferr Yang, 'Enhanced Intra-4x4 Mode Decision for H.264/AVC Coders,' Circuits and Systems for Video Technology, IEEE Transactions on Volume 16, Issue 8, pp. 1027-1032, Aug. 2006 https://doi.org/10.1109/TCSVT.2006.878146
  7. Meng, B, Au, Oscar. C, 'Fast intra-prediction mode selection for 4x4 blocks in H.264,' IEEE International Conference on Acoustics, Speech, and Signal Processing, Volume 3, pp.389-392 vol.3, April 2003 https://doi.org/10.1109/ICASSP.2003.1199493
  8. Rui Su, Guizhong Liu, Tongyu Zhang, 'Fast Mode Decision Algorithm for Intra Prediction in H.264/AVC,' ICASSP 2006 Proceedings, IEEE International Conference on Acoustics, Speech and Signal Processing, Volume 2, pp.921-924, May 2006 https://doi.org/10.1109/ICASSP.2006.1660494
  9. Liao Ning, Quan Zi Yi, Men Ai Dong, 'Enhanced fast mode decision based on edge map and motion detail analysis for H.264/JVT,' IEEE International Workshop on VLSI Design and Video Technology, pp.187-190, May 2005 https://doi.org/10.1109/IWVDVT.2005.1504582
  10. Zhang Yong-dong, Dai Feng, Lin Shou-xun, 'Fast 4x4 intra-prediction mode selection for H.264,' IEEE International Conference on Multimedia and Expo, Volume 2, pp.1151-1154, June 2004
  11. Chao-Chung Cheng, Tian-Sheuan Chang, 'Fast three step intra prediction algorithm for 4x4 blocks in H.264,' IEEE International Symposium on Circuits and Systems, pp.1509-1512 Vol. 2. May 2005 https://doi.org/10.1109/ISCAS.2005.1464886
  12. Changsung Kim, H.-H. Shih, C.-C. J Kuo, 'Feature-based intra-prediction mode decision for H.264,' International Conference on Image Processing Volume 2, pp.769-772 Vol.2, Oct 2004 https://doi.org/10.1109/ICIP.2004.1419411
  13. Yu-Wen Huang, Bing-Yu Hsieh, Tung-Chien Chen, Liang-Gee Chen, 'Analysis, fast algorithm, and VLSI architecture design for H.264/AVC intra frame coder,' IEEE Transactions on Circuits and Systems for Video Technology, Volume 15, Issue 3, pp.378-401, March 2005 https://doi.org/10.1109/TCSVT.2004.842620
  14. Jamil-Ur-Rehman, Ye Zhang, 'Fast intra prediction mode decision using parallel processing,' Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Volume8, pp.5094-5098, Aug 2005 https://doi.org/10.1109/ICMLC.2005.1527841
  15. F. Pan, X. Lin, S. Rahardja, K. P. Lim, Z. G. Li, D. Wu, and S. Wu, 'Fast mode decision algorithm for intra prediction in H.264/A VC video coding,' IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 7, pp.813-822, July 2005 https://doi.org/10.1109/TCSVT.2005.848356
  16. Jhing-Fa Wang, Jia-Ching Wang, Jang-Ting Chen, An-Chao Tsai, and Anand Paul, 'A Novel Fast Algorithm for Intra Mode Decision in H.264/AVG Encoders,' ISCAS2006, pp.3498-3501, July 2006 https://doi.org/10.1109/ISCAS.2006.1693380
  17. Seong-Whan Lee, Young-Min Kim, and Sung Woo Choi, 'Fast Scene Change Detection using Direct Feature Extraction from MPEG Compressed Videos,' IEEE Transactions On Multimedia, Vol. 2, No. 4, pp.240-254, Dec 2000 https://doi.org/10.1109/6046.890059
  18. Minyoung Eom, Yoonsik Choe, 'Fast Extraction of Edge Histogram in DCT Domain based on MPEG7,' Transactions on Engineering, Computing and Technology V9, pp.209-212, Nov 2005