Browse > Article

Multi Mode Harmonic Transform Coding for Speech and Music  

Kim, Jonghark (Dept. of Radio Engineering, Chungbuk National University)
Shin, Jae-Hyun (Dept. of Radio Engineering, Chungbuk National University)
Lee, Insung (Dept. of Radio Engineering, Chungbuk National University)
Abstract
A multi-mode harmonic transform coding (MMHTC) for speech and music signals is proposed. Its structure is organized as a linear prediction model with an input of harmonic and transform-based excitation. The proposed coder also utilizes harmonic prediction and an improved quantizer of excitation signal. To efficiently quantize the excitation of music signals, the modulated lapped transform(MLT) is introduced. In other words, the coder combines both the time domain (linear prediction) and the frequency domain technique to achieve the best perceptual quality. The proposed coder showed better speech quality than that of the 8 kbps QCELP coder at a bit-rate of 4 kbps.
Keywords
Speech coding; Harmonic coding; CELP; Audio coding;
Citations & Related Records
연도 인용수 순위
  • Reference
1 A. V. McCree, K. Trung, E, B, George, T. P. Barnwell and V. Viswanathan, 'A 2.4 kbil/s MELP coder candidate lor the new U. S. federal standard,' Proc IEEE Int. Cont. Acoust., Speech, Signal Processing, 1, 200-203, May 1996
2 R. V. Cox, 'Speech coding standards,' Speech Coding and Synthesis, 2, W. B. Kleijn, and K. K. Paliwell Eds., Elsevier, 1995
3 R. Lefebvre, R. Salami, C. Laflamme, and J. P. Adoul, 'High quality coding of wideband audio signals using Transform Coded Excitation (TCX),' Proc, ICASSP-94, 1, 193-196, 1994
4 B. Yegnanarayana, Christophe d'Alessandro and Vassilis Darsinos, 'An iterative algorithm for decomposition of speech signals into periodic and aperiodic components'" IEEE Transaction on speech and audio processing, 6 (1), 1-11,1998   DOI   ScienceOn
5 P. Lupini, and V. Cuperman, 'Nonsquare transform vector quantization,' IEEE Signal Precessing letters, 3 (1), January 1996
6 R. Y. Qiao, 'Mixed wideband speech and music coding using a speech/music discriminator,' IEEE TENCON, 605-608, 1997
7 H. Malvar 'Fast algorithms for orthogonal and biothogonal modulated lapped transforms,' Proc IEEE Symposium, Advances in Digital Filtering and Signal Processing, 159-163, 1998
8 S. A. Ramprashad, 'A two stage hybrid embedded speech/ audio coding structure,' Proc. IEEE Int. Cont. Acount., Speech, Signal Processing, 337-340, 1998
9 R. J. McAulay, and T. F. Ouartieri, 'Sinusoidal coding,' Speech Coding and Synthesis, 4, W. B. Kleijn, and K. K. Paliwell Eds., Elsevier, 1995
10 A. M. Kondoz, 'Coding strategies and standards,' Digital Speech,5, John Wiley, 1994
11 P. J. A. OeJaco, W. Gardner and C. Lee, 'QCELP: north american COMA digital cellular variable rate speech coding standard,' Proc. IEEE Workshop on speech Coding for Telecommunications, (sainte-Adele. Quebec), 5-6, 1993
12 O. R. Ladd, and J. Terken, 'Modelling intra- and inter-speaker pitch range variation,' Proceedings at the 13th International Congress at Phonetic Sciences Stockholm (eds. EJenius, K. & Branderud, P,), 2, 386-389, 1995
13 T. Moriya, N. Iwakami, A. Jin, K. Ikeda, and S. Miki, 'A design of transform coder for both speech and audio signals at 1 bit/samples,' Proc. IEEE Int. Coni. Acount., Speech, Signal Processing, 1371-1374, 1997
14 ISO/IEC JTC1/SC29/wG11, 'Information technology-coding of audiovisual objects part 3: audio sub part2: parametric coding.' N1903PAR, 1997