Low Rate Speech Coding Using the Harmonic Coding Combined with CELP Coding

하모닉 코딩과 CELP방법을 이용한 저 전송률 음성 부호화 방법

  • Published : 2000.04.01

Abstract

In this paper, we propose a 4kbps speech coder that combines the harmonic vector excitation coding with time-separated transition coding. The harmonic vector excitation coding uses the harmonic excitation coding in the voiced frame and uses the vector excitation coding with the structure of analysis-by-synthesis in the unvoiced frame, respectively. But two mode coding method is not effective for transition frame mixed in voiced and unvoiced signal and a new method beyond using unvoiced/voiced mode coding is needed. Thus, we designed a time-separated transition coding method for transition frame in which a voiced/unvoiced decision algorithm separates unvoiced and voiced duration in a frame, and harmonic-harmonic excitation coding and vector-harmonic excitation coding method is selectively used depending on the previous frame U/V decision. In the decoder, the voiced excitation signals are generated efficiently through the inverse FFT of harmonic magnitudes and the unvoiced excitation signals are made by the inverse vector quantization. The reconstructed speech signal are synthesized by the Overlap/Add method.

본 논문에서는 선형예측 잔여신호에 대한 하모닉 벡터 여기 코딩에, 시간 대역 분리 혼합 코딩을 결합한 4kbps 음성코더를 제안한다. 하모닉 벡터 여기 코딩은 유성음 구간에서 하모닉 여기 코딩을 사용하며, 무성음 구간에 대해서는 분석-합성 구조의 벡터 여기 코딩을 사용한다. 그러나, 이러한 양단 모드 코딩 방법은 유성음과 무성음이 혼재하는 전이 구간에서는 비효과적이므로, 유/무성음 모드 코딩 이외의 새로운 방법이 요구된다. 이에, 전이 구간을 위한 시간 분리 전이 코딩을 설계하였으며, 여기서, 유/무성음 결정 알고리즘은 단위 구간 내의 유성음과 무성음의 존속기간을 결정하고, 이전 구간의 유/무성음 결정에 따라 하모닉-하모닉 코딩과 벡터-하모닉 코딩을 선택적으로 사용한다. 복호화기에서는 하모닉 크기값들의 IFFT 과정을 통해 유성음 여기신호가 효과적으로 합성되며, 무성음 여기신호는 역 벡터 양자화를 통해 만들어진다. 재 복원된 음성 신호는 중첩합산 방법에 의해 합성된다.

Keywords

References

  1. Speech Coding and Synthesis Speech Coding Standards R. V. Cox;W. B. Kleijn(ed.);K. K. Paliwell(ed.)
  2. IEEE Signal Pocessing Magazine The Past, Present, and Future of Speech Processing D. Childer;R. V. Cox;R. DeMori.(etc.);B. H. Juang(ed.)
  3. Digital Speech Coding Strategies and Standards A. M. Kondoz
  4. Proc. ICASSP-96 v.1 A 2.4kbit/s MELP Coder Candidate for the New U.S. Federal Standard A. V. McCree;K. Trung;E, B, George;T. P. Banwell;V. Viswanathan
  5. Proc. IEEE Workshop on Speech Coding For Telecommunications Harmonic vector excitation coding of speech at 2.0 kbps L. Nishiguchi;K. Iijima;J. Matsumoto
  6. Speech Coding and Synthesis Sinusoidal coding R. J. McAulay;T. F. Quartieri;W. B. Kleijn(ed.);K. K. Paliwell(ed.)
  7. IEEE Transaction on Acoustic, Speech, and Signal Processing v.1 no.ASSP-34 Speech ananlysis/synthesis based on a sinusoidal representation R. J. McAulay;T. F. Quatieri
  8. Proc. IEEE ICASSP-88 Phase compensation in all-pole speech analysis P. Hedelin
  9. Proc. ICASSP-98 Combined Harmonic and Waveform Coding of Speech at Low Bit Rate E. Shlomot;Vladimir Cuperman;A. Gersho
  10. Proc. ICASSP-95 Harmonic and Noise Coding of LPC Residuals with Classified Vector Quantization Masayuki;Nishiguchi;J. Matsumotor
  11. Digital Speech Code-Excited Linear Predictive Coding A. M. Kondoz
  12. IEEE Transaction on Acoustic, Speech, and Signal Processing v.6 no.2 Design and Descrition of CS-ACELP: A Toll Quality 8 kb/s Speech Coder R. Salami;C. Laflamme;J. Adoul.(etc.)
  13. Proc. IEEE Workshop on speech Coding for Telecommunications QCELP: North American CDMA digital cellular variable rate speech coding standard P. J. A. DeJaco;W. Gardner;C. Lee
  14. Proc. Military Speech Tech. The new 4800 bps voice coding standard J. P. Campbell;V. C. Welch;T. E. Tremain