Performance Comparison of DCT Algorithm Implementations Based on Hardware Architecture

Lee Jae-Seong;Pack Young-Cheol;Youn Dae-Hee;

The Journal of Korean Institute of Communications and Information Sciences (한국통신학회논문지)

Volume 31 Issue 6C
/
Pages.637-644
/
2006
/
1226-4717(pISSN)
/
2287-3880(eISSN)

The Korean Institute of Commucations and Information Sciences (한국통신학회)

Performance Comparison of DCT Algorithm Implementations Based on Hardware Architecture

프로세서 구조에 따른 DCT 알고리즘의 구현 성능 비교

이재성 (연세대학교 전자공학과 디지털신호처리 연구실) ;
박영철 (연세대학교 컴퓨터정보통신공학부) ;
윤대희 (연세대학교 전자공학과 디지털신호처리 연구실)

Published : 2006.06.01

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

This paper presents performance and implementation comparisons of standard and fast DCT algorithms that are commonly used for subband filter bank in MPEG audio coders. The comparison is made according to the architectural difference of the implementation hardware. Fast DCT algorithms are known to have much less computational complexity than the standard method that involves computing a vector dot product of cosine coefficient. But, due to structural irregularity, fast DCT algorithms require extra cycles to generate the addresses for operands and to realign interim data. When algorithms are implemented using DSP processors that provide special operations such as single-cycle MAC (multiply-accumulate), zero-overhead nested loop, the standard algorithm is more advantageous than the fast algorithms. Also, in case of the finite-precision processing, the error performance of the standard method is far superior to that of the fast algorithms. In this paper, truncation errors and algorithmic suitability are analyzed and implementation results are provided to support the analysis.

본 논문에서는 MPEG 오디오 부호화 과정 중 서브밴드 필터뱅크를 구현하기 위해 사용되는 DCT(Discrete Cosine Transform) 과정에 대해 구현 시스템의 구조에 따른 DCT 알고리즘의 구현 결과와 성능 차이를 분석한다. 고속 DCT 알고리즘은 코사인 계수의 내적을 통해 구하는 직접 구현 방법보다 연산량이 현저하게 적은 것으로 알려져 있지만, 피연산자의 어드레스가 불규칙적이고 출력 데이터를 재정렬하는 과정이 필요하기 때문에 규칙성이 결여되며, 재정렬만을 위한 추가적인 연산이 필요한 경우도 있다. 따라서 DSP와 같이 반복적인 연산을 고속으로 수행하기 위해 최적화된 구조의 하드웨어에서는 알고리즘의 규칙성이 높은 직접 구현 방법에 비해 고속 알고리즘이 불리한 측면이 있으며, 더욱이 유효 자리수를 제한하는 경우, 직접 구현 방법에 비해 더 많은 프로세싱 단계를 거쳐야 하므로 누적 오차가 커진다. 본 논문에서는 알고리즘의 규칙성과 각 프로세서의 연산 방법간의 관계와 유효 자리수에 따른 누적 오차를 분석하고 프로세서의 구조에 따른 고속 알고리즘의 선택 기준을 제시하였다.

Keywords

References

ISO/IEC JTCl/SC29/WG11 MPEG, International Standard ISO/IEC IS 13818-3 'Information technology Generic Coding of Moving Pictures and Associated Audio:', Part 3: Audio, 1994
Yingbiao Yao, and et al., 'Embedded Software Optimization for MP3 Decoder Implemented on RISC Core,' IEEE Trans. on Consumer Electronics, Vol. 50, No. 4, Nov. 2004
K. Konstantinides, 'Fast Subband Filtering in MPEG AUDIO Coding.', IEEE Signal Processing Letters, vol. I, no.2, pp. 26-28, Feb., 1994 https://doi.org/10.1109/97.300309
P. Duhamel, Y. Mahieux, and J. P. Petit, 'A fast algorithm for the implementation of filter banks based on time domain aliasing cancellation,' in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing '91,Toronto, ON, Canada, May 1991, pp. 2209-2212
V. Britanak and K. R. Rao, 'An efficient implementation of the forward and inverse MDCT in MPEG audio coding,' IEEE Signal Processing Lett, vol. 8, pp. 4851, Feb. 2001 https://doi.org/10.1109/97.895372
Vijay K. Madisetti, 'VLSI Digital Signal Processors- An Introduction to Rapid Prototyping and Design Synthesis', IEEE Press, 1995
Berkeley Design Technology, Inc., 'Choosing a DSP Processor', http://www.BDTI.com
N. Ahmed, T. Natarajan, and K. R. Rao, 'Discrete cosine transform,' IEEE Trans. Comput., vol. C-23, pp.90-94., Jan. 1974 https://doi.org/10.1109/T-C.1974.223784
B. D. Tseng and W. C. Miller, 'On computing the discrete cosine transform,' IEEE Trans. Comput., Vol. C-27, pp.966-968, Oct., 1978 https://doi.org/10.1109/TC.1978.1674977
M. J. Narasimha and A. M. Peterson, 'On the computation of the discrete cosine transform,' IEEE Trans. Commun., vol. COM-26, pp. 934-946, June 1978
B. G. Lee, 'A NewAlgorithm to Compute the Discrete Cosine Transform', IEEE Trans, on Acoust., Speech, Signal Processing, pp.1243-1245, Dec. 1984
김준석 'DSP 코어 기반 오디오 프로세서의 설계', 연세 대학교 박사 학위 논문, 2000,12
http://www.zaram.com/product1.htm
http://arm.com/products/CPUs/families/ARM9Family.html
Alan V. Oppenheim, 'Discrete time Signal Processing', Prentice hall, 1999
C. Loeffler, A. Ligtenberg, and G.S. Moschytz, 'Practical Fast 1D DCT Algorithms with 11 Multiplications,' Proc. Int'l Conf. Acoustics, Speech and Signal Processing, Vol. 2, IEEE, New York, 1989, pp. 988-991

The Journal of Korean Institute of Communications and Information Sciences (한국통신학회논문지)

Performance Comparison of DCT Algorithm Implementations Based on Hardware Architecture

프로세서 구조에 따른 DCT 알고리즘의 구현 성능 비교

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)