Browse > Article

Fast Harmonic Synthesis Method for Sinusoidal Speech-Audio Model  

Kim, Gyu-Jin (Department of Radio Engineering, Chungbuk National University)
Kim, Jong-Hark (Department of Radio Engineering, Chungbuk National University)
Jung, Gyu-Hyeok (Department of Radio Engineering, Chungbuk National University)
Lee, In-Sung (Department of Radio Engineering, Chungbuk National University)
Publication Information
Abstract
Most harmonic synthesis methods using phase information employ a quadratic or cubic phase interpolation. The methods are computationally expensive to implement because every component sinewave must be synthesized on a per sample basis. In this paper, we propose a fast harmonic synthesis method for sinusoidal speech/audio coding based on the quadratic and cubic phase function to overcome the complexity problem. To derive the fast harmonic synthesis method, we define the over-sampling function and phase modulation function by constraining the parameter of phase function to be independent for harmonic index and derive the fast synthesis method using IFFT. Experimental results show that the proposed method significantly reduce the complexity of conventional cosine synthesis method while maintaining the performance.
Keywords
cubic phase; interpolation; inverse fast fourier transform(IFFT);
Citations & Related Records
연도 인용수 순위
  • Reference
1 David L. Thomson, 'Parametric Models of the Magnitude/Phase Spectrum for Harmonic Speech Coding,' ICASSP 1988, pp.378-381, 198
2 R. J. McAulay and T. F. Quatieri, 'Speech analysis/synthesis based on a sinusoidal representation,' IEEE Trans. on ASSP, vol. 34, no. 4, pp. 744-754, Aug. 1986   DOI
3 D.W.Griffm, 'Multiband Excitation Vocoder', Ph.D. dissertation, M.I.T., Cambridge, MA, 1987
4 A. McCree, K. Truong, E. B. George, T. P. Barnwell 111, and V. Viswanathan, 'A 2.4 kbit/s MELP Coder Candidate for the New U.S. Federal Standard,' in Proc. IEEE Int. Con$ ASSP, (Atlanta), pp. 200-203, May 1996
5 ISO/IEC 14496-3, 'Information Technology - Coding of Audio Visual Object, Part 3 : Audio, Subpart 2 : Parametric Coding', ISO/IEC International Standard, 2000
6 A. V. McCree and T. P. Barnwell 111, 'Mixed Excitation LPC Vocoder Model for Low Bit Rate Speech Coding,' IEEE Trans. on Speech and Audio Processing, vol. 3, pp. 242-250, July 1995   DOI   ScienceOn
7 D.W.Griffm and J.S.Lim, 'Multiband Excitation Vocoder', IEEE Trans. on Acoustics, Speech, and Signal Processing, pp1223-1235, 1988   DOI   ScienceOn
8 E. B. George and M. J. T. Smith, 'Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model,' IEEE Trans. Speech Audio Processing, vol. 5, no. 5, pp. 389-406, 1997   DOI   ScienceOn
9 Masayuki Nishiguchi, 'Harmonic vector excitation coding of speech', Acoustical Science and Technology, Vol. 27, No.6 pp.375-383, 2006   DOI   ScienceOn
10 W. B. Kleijn and K.K Paliwal, 'Speech coding and synthesis', ELSEVIER, chapter 4, 1995
11 T. F. Quatieri and R. J. McAulay, 'Phase modelling and its application to sinusoidal transform coding,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '86, vol. 3, pp. 1713-1715, Apr. 1986
12 X. Serra and J. Smith, 'Spectral modeling synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition,' Computer Music journal, vol. 14, pp. 12-24, Dec. 1990