Time-Domain Quantization and Interpolation of Pitch Cycle Waveform

  • Kim, Moo-Young (Dept. information and communications Eng., Sejong University)
  • Published : 2008.03.31

Abstract

In this paper, a pitch cycle waveform (PCW) is extracted, quantized, and interpolated in a time domain to synthesize high-quality speech at low bit rates. The pre-alignment technique is proposed for the accurate and efficient PCW extraction, which predicts the current PCW position from the previous PCW position assuming that pitch periods evolve slowly. Since the pitch periods are different frame by frame, the original PCW is converted into the fixed-dimension PCW using the dimension-conversion method, and subsequently quantized by code-excited linear predictive (CELP) coding. The excitation signal for the linear predictive coding (LPC) synthesis filter is generated using the time-domain interpolation and interlink of the quantized PCW's. The coder operates at 4.2 kbit/s and 3.2 kbit/s depending on the pitch period. Informal listening test demonstrates the effectiveness of the proposed coding scheme.

Keywords

References

  1. R. Salami, C. Laflamme, J. Adoul, A. Kataoka, S. Hayashi, T. Moriya, C. Lamblin, D. Massaloux, S. Proust, P. Kroon, and Y. Shoham, "Design and Description of CS-ACELP: A Toll Quality 8 kb/s Speech Coder," IEEE Trans. Speech Audio Processing, 6(2), 116-130, Mar. 1998 https://doi.org/10.1109/89.661471
  2. W. B. Kleijn and J. Haagen, Speech Coding and Synthesis. Amsterdam, (The Netherlands: Elsevier, 1995)
  3. T. F. Quatieri, Discrete-Time Speech Signal Processing: Principles and Practices. Upper Saddle River, NJ: Prentice Hall, 2002
  4. Y. D. Cho, M. Y. Kim, and S. R. Kim, "A spectrally mixed excitation (SMX) vocoder with robust parameters determination," in Proc. IEEE ICASSP, 601-604, Seattle, WA. USA. 1998
  5. M. Y. Kim, N. K. Ha, and S. R. Kim, "Linked Split-Vector Quantizer of LPC Parameters," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, Atlanta, GA. 741-744, May 1996