Browse > Article
http://dx.doi.org/10.5573/IEIESPC.2014.3.1.1

Low-latency SAO Architecture and its SIMD Optimization for HEVC Decoder  

Kim, Yong-Hwan (Smart Media Research Center, Korea Electronics Technology Institute)
Kim, Dong-Hyeok (Multimedia IP Research Center, Korea Electronics Technology Institute)
Yi, Joo-Young (Multimedia IP Research Center, Korea Electronics Technology Institute)
Kim, Je-Woo (Multimedia IP Research Center, Korea Electronics Technology Institute)
Publication Information
IEIE Transactions on Smart Processing and Computing / v.3, no.1, 2014 , pp. 1-9 More about this Journal
Abstract
This paper proposes a low-latency Sample Adaptive Offset filter (SAO) architecture and its Single Instruction Multiple Data (SIMD) optimization scheme to achieve fast High Efficiency Video Coding (HEVC) decoding in a multi-core environment. According to the HEVC standard and its Test Model (HM), SAO operation is performed only at the picture level. Most realtime decoders, however, execute their sub-modules on a Coding Tree Unit (CTU) basis to reduce the latency and memory bandwidth. The proposed low-latency SAO architecture has the following advantages over picture-based SAO: 1) significantly less memory requirements, and 2) low-latency property enabling efficient pipelined multi-core decoding. In addition, SIMD optimization of SAO filtering can reduce the SAO filtering time significantly. The simulation results showed that the proposed low-latency SAO architecture with significantly less memory usage, produces a similar decoding time as a picture-based SAO in single-core decoding. Furthermore, the SIMD optimization scheme reduces the SAO filtering time by approximately 509% and increases the total decoding speed by approximately 7% compared to the existing look-up table approach of HM.
Keywords
HEVC; SAO; Low-latency; Multi-core; SIMD;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J.-Y. Yi, Y.-H. Kim, J. Park, and J.-W. Kim, "Implementation of HEVC decoder S/W using framebased multi-threading method," Proc. ITC-CSCC, Sapporo, Japan, July 2012.
2 Parveen.G.B and R. Adireddy, "Analysis and approximation of SAO estimation for CTU-level HEVC encoder," Proc. Int. Conf. VCIP, Nov. 2013. Article (CrossRefLink)
3 P. N. Subramanya, R. Adireddy, and D. Anand, "SAO in CTU decoding loop for HEVC video decoder," Proc. Int. Conf. Signal Processing and Communication, December 2013. Article (CrossRefLink)
4 Intel, Intel 64 and IA-32 Architectures Software Developer's Manual, Volume 2, June 2013. Article (CrossRefLink)
5 T. Suzuki, G. Sullivan, and W. Wan, HEVC conformance draft 5, JCTVC-O1004, 15th meeting, Geneva, CH, October 2013. Article (CrossRefLink)
6 C.-M. Fu, et al, "Sample adaptive offset in the HEVC standard," IEEE Trans. CSVT, Vol. 22, No. 12, pp. 1755-1764, December 2012. Article (CrossRefLink)
7 ITU-T Rec. H.265, High Efficiency Video Coding, ITU-T, March 2013. Article (CrossRefLink)
8 G. J. Sullivan, J.-R. Ohm, W.-J. Han, and T. Wiegand, "Overview of the High Efficiency Video Coding (HEVC) standard," IEEE Trans. CSVT, Vol. 22, No. 12, pp. 1649-1668, December 2012. Article (CrossRefLink)
9 J.-R. Ohm, G. J. Sullivan, H. Schwarz, T. K. Tan, and T. Wiegand, "Comparison of the coding efficiency of video coding standards-including High Efficiency Video Coding (HEVC)," IEEE Trans. CSVT, Vol. 22, No. 12, pp. 1669-1684, December 2012. Article (CrossRefLink)
10 JCT-VC, HEVC Test Model (HM) reference software 12.1. Article (CrossRefLink)