Browse > Article
http://dx.doi.org/10.5909/JBE.2009.14.5.543

A New Overlap Save Algorithm for Fast Convolution  

Kuk, Jung-Gap (Department of Electrical Engineering and Computer Science, Seoul National University, Institute of New Media & Communications)
Cho, Nam-Ik (Department of Electrical Engineering and Computer Science, Seoul National University, Institute of New Media & Communications)
Publication Information
Journal of Broadcast Engineering / v.14, no.5, 2009 , pp. 543-550 More about this Journal
Abstract
The most widely used block convolution method is the overlap save algorithm (OSA), where a block of M data to be convolved with a filter is concatenated with the previous block and 2M-point FFT and multiplications are performed for this overlapped block. By discarding half of the results, we obtain linear convolution results from the circular convolution. This paper proposes a new transform which reduces the block size to only M for the block convolution. The proposed transform can be implemented as the M multiplications followed by M-point FFT Hence, existing efficient FFT libraries and hardware can be exploited for the implementation of proposed method. Since the required transform size is half that of the conventional method, the overall computational complexity is reduced. Also the reduced transform size results in the reduction of data access time and cash miss-hit ratio, and thus the overall CPU time is reduced. Experiments show that the proposed method requires less computation time than the conventional OSA.
Keywords
FFT; Overlap save algorithm; QDFT; DFT; Block convolution;
Citations & Related Records
연도 인용수 순위
  • Reference
1 A. V. Oppenheim, R. W. Schafer and J. R. Buck, Discrete-Time Signal Processing, Upper Saddle River, New Jersey: Prentice Hall, 1998
2 R. C. Agarwal, C. S. Burus, 'Number theoretic transforms to implement fast digital convolution,' Proceedings of IEEE, Vol. 63, No. 4, pp. 550 - 560, Apr. 1978   DOI   ScienceOn
3 J. L. Vernet 'Improved fourier and hartley transform algorithm: application to cyclic convolution of real data,' IEEE trans. on Acoustics, Speech and Signal Processing, vol. ASSP-35, No. 6, June 1987   DOI
4 Z. Wang, 'Fast algorithms for the discrete W transform and for the discrete fourier transform,' IEEE trans. on Acoustics, Speech and Signal Processing, vol. 32, pp. 803 - 816, Aug. 1984   DOI
5 R. Matusiak, 'Implementing fast fourier transform algorithms of real-valued sequences with the TMS320 DSP family' Application Report of Texas Instruments, 1997
6 P. Duhamel, 'Implementation of 'Split-radix' FFT algorithms for complex, real, and real-symmetric data,' IEEE Trans. on Acoust, Speech and Signal Processing, vol. 34, no.2, pp.285-295, Apr. 1986   DOI
7 Intel Performance Libraries. Intel Integrated Performance Primitives website. http://www3.intel.com/cd/software/products/asmona/eng/per-flib/ 302910.htm