Browse > Article

Wavelet-based Pitch Detector for 2.4 kbps Harmonic-CELP Coder  

방상운 (충북대학교 전파공학과)
이인성 (충북대학교 전파공학과)
권오주 (국방과학연구소)
Abstract
This paper presents the methods that design the Wavelet-based pitch detector for 2,4 kbps Harmonic-CELP Coder, and that achieve the effective waveform interpolation by decision window shape of the transition region, Waveform interpolation coder operates by encoding one pitch-period-sized segment, a prototype segment, of speech for each frame, generate the smooth waveform interpolation between the prototype segments for voiced frame, But, harmonic synthesis of the prototype waveforms between previous frame and current frame occur not only waveform errors but also discontinuity at frame boundary on that case of pitch halving or doubling, In addtion, in transition region since waveform interpolation coder synthesizes the excitation waveform by using overlap-add with triangularity window, therefore, Harmonic-CELP fail to model the instantaneous increasing speech and synthesis waveform linearly increases, First of all, in order to detect the precise pitch period, we use the hybrid 1st pitch detector, and increse the precision by using 2nd ACF-pitch detector, Next, in order to modify excitation window, we detect the onset, offset of frame by GCI, As the result, pitch doubling is removed and pitch error rate is decreased 5.4% in comparison with ACF, and is decreased 2,66% in comparison with wavelet detector, MOS test improve 0.13 at transition region.
Keywords
Prototype waveform interpolation coder; Wavelet; Pitch detection;
Citations & Related Records
연도 인용수 순위
  • Reference
1 S. Mallat, W. L. Hwang, 'Singularity detection and processing with wavelets,' IEEE trans. on IT, 38 (2), 617- 643, 1992   DOI   ScienceOn
2 R. J. McAulay and T. F. Quatieri, 'The application of subband coding to improve quality and robustness of the sinusoidal transform coder,' Proc. ICASSP 93, 2, 439-442, 1993
3 S. Kadambe and G. F. Boudreux-Barlels, 'Application of the wavelet transform for Pitch detection of Speech Signal,' IEEE Trans Information Theory, 38 (2), Mar. 1992
4 W. B. Kleijn, 'Encoding speech using prototype waveforms,' IEEE Trans. Speech Audio Processing, 1, 386-399, Oct. 1993   DOI   ScienceOn
5 K. A. Teague, B. Leach, and W. Andrews, 'Development of a high-quality MBE based vocoder for implementation at 2400 bps,' Proc. IEEE Wichita Cont. Communications, Networking and Signal Processing, 129-133, April 1994
6 D. W. Griffin and J. S. Lim, 'Multiband excitation vocoder,' IEEE Trans. Acoust., Speech, Signal Processing, 36 (8), 1223-1235, 1988   DOI   ScienceOn
7 W. B. Kleijn and J. Haagen, 'A speech coder based on decomposition of characteristic waveforms,' Proc. ICASSP 95, 508-511, 1995
8 F. C. A. Brooks, and Lajos Hanzo, 'A multiband excited waveform interpolated 2.35kbps speech codec for bandlimited channels,' IEEE Trans on VT, 49 (3), May 2000
9 K. Yaghmaie and A. M. Kondoz, 'Multiband prototype waveform analysis synthesis for very low bitrate speech coding,' Proc. ICASSP97, 1571-1574, 1997
10 H. Hassanein, A. Brind Amour, S. Dry, and K. Bryden, 'Frequency selective harmonic coding at 2400 bps,' Proc. 37th Midwest Symp, Circuits and Systems, 2, 1436-1439, 1995
11 ITU-T Recomendation G.729, 'Coding of speech at 8kbps using conjugate-structure algebraic code excited linear prediction (CS-ACELP),' June 1995
12 D. J. Hiotakakos and C. S. Xydeas, 'Low bit rate coding using an inter-polated zinc excitation model', Proc. ICCS 94, 865-869, 1994
13 E. Shlomot, V. Cuperman, and A. Gersho, 'Combined harmonic and waveform coding of speech at low bit rate,' Proc, ICASSP 98, 585-588, 1998
14 김종학, 이인성, '하모닉 코딩과 CELP방법을 이용한 저 전송률 음성 부호화 방법 Low Rate Speech Coding Using the Harmonic Coding Combined with CELP Coding,' 한국음향학회지 THE JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 19 (3), 26-34, 1225-4428, 2000
15 A. McCree, K. Truong, E. George, T. Barnwell, and V. Viswanathan, 'A 2.4 kbit/s coder candidate for the new U.S. tederal standard,' Proc, IEEE International Conference on Acoustics, Speech and Signal Processing, 200-203, Atlanta, 1996
16 J. Stachurski, A. McCree, V. Viswanathan, A. Heikkinen, A. Ramo, S. Himanen, and P. Blocher, 'HYBRID MELP/CELP coding at bit rates from 6,4 TO 2,4 kb/s', DSP Solutions R&D Center, Texas Instruments, Dallas, Texas, USA, 2003
17 S. Mallat and S. Zhong, Characterization of signals from multiscale edges, IEEE Trans. Pattern Anal. Machine lntell., 14, 710-732, July 1992   DOI   ScienceOn
18 손영호, 배건성, '웨이블렛 변환을 이용한 유성음/무성음/묵음분류,' 음성통신 및 신호처리 워크샵 논문집, 449-453, 1998
19 A. M. Kondoz, 'Code excited linear predictive coding,' Digital Speech, Chap. 6, 174-212, 1994