[KSCI] Korea Science Citation Index Service

Fast Harmonic Synthesis Method for Sinusoidal Speech-Audio Model

Kim, Gyu-Jin (Department of Radio Engineering, Chungbuk National University)
Kim, Jong-Hark (Department of Radio Engineering, Chungbuk National University)
Jung, Gyu-Hyeok (Department of Radio Engineering, Chungbuk National University)
Lee, In-Sung (Department of Radio Engineering, Chungbuk National University)

Publication Information

Journal of the Institute of Electronics Engineers of Korea SP / v.44, no.4, 2007 , pp. 109-116 More about this Journal

Abstract

Most harmonic synthesis methods using phase information employ a quadratic or cubic phase interpolation. The methods are computationally expensive to implement because every component sinewave must be synthesized on a per sample basis. In this paper, we propose a fast harmonic synthesis method for sinusoidal speech/audio coding based on the quadratic and cubic phase function to overcome the complexity problem. To derive the fast harmonic synthesis method, we define the over-sampling function and phase modulation function by constraining the parameter of phase function to be independent for harmonic index and derive the fast synthesis method using IFFT. Experimental results show that the proposed method significantly reduce the complexity of conventional cosine synthesis method while maintaining the performance.

Keywords

cubic phase; interpolation; inverse fast fourier transform(IFFT);

Citations & Related Records

Reference

1	David L. Thomson, 'Parametric Models of the Magnitude/Phase Spectrum for Harmonic Speech Coding,' ICASSP 1988, pp.378-381, 198
2	R. J. McAulay and T. F. Quatieri, 'Speech analysis/synthesis based on a sinusoidal representation,' IEEE Trans. on ASSP, vol. 34, no. 4, pp. 744-754, Aug. 1986 DOI
3	D.W.Griffm, 'Multiband Excitation Vocoder', Ph.D. dissertation, M.I.T., Cambridge, MA, 1987
4	A. V. McCree and T. P. Barnwell 111, 'Mixed Excitation LPC Vocoder Model for Low Bit Rate Speech Coding,' IEEE Trans. on Speech and Audio Processing, vol. 3, pp. 242-250, July 1995 DOI ScienceOn
5	A. McCree, K. Truong, E. B. George, T. P. Barnwell 111, and V. Viswanathan, 'A 2.4 kbit/s MELP Coder Candidate for the New U.S. Federal Standard,' in Proc. IEEE Int. Con$ ASSP, (Atlanta), pp. 200-203, May 1996
6	ISO/IEC 14496-3, 'Information Technology - Coding of Audio Visual Object, Part 3 : Audio, Subpart 2 : Parametric Coding', ISO/IEC International Standard, 2000
7	D.W.Griffm and J.S.Lim, 'Multiband Excitation Vocoder', IEEE Trans. on Acoustics, Speech, and Signal Processing, pp1223-1235, 1988 DOI ScienceOn
8	E. B. George and M. J. T. Smith, 'Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model,' IEEE Trans. Speech Audio Processing, vol. 5, no. 5, pp. 389-406, 1997 DOI ScienceOn
9	Masayuki Nishiguchi, 'Harmonic vector excitation coding of speech', Acoustical Science and Technology, Vol. 27, No.6 pp.375-383, 2006 DOI ScienceOn
10	W. B. Kleijn and K.K Paliwal, 'Speech coding and synthesis', ELSEVIER, chapter 4, 1995
11	T. F. Quatieri and R. J. McAulay, 'Phase modelling and its application to sinusoidal transform coding,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '86, vol. 3, pp. 1713-1715, Apr. 1986
12	X. Serra and J. Smith, 'Spectral modeling synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition,' Computer Music journal, vol. 14, pp. 12-24, Dec. 1990

KSCI

Fast Harmonic Synthesis Method for Sinusoidal Speech-Audio Model 정현파 음성-오디오 모델의 빠른 하모닉 합성 방법

Fast Harmonic Synthesis Method for Sinusoidal Speech-Audio Model