[KSCI] Korea Science Citation Index Service

Multi Mode Harmonic Transform Coding for Speech and Music

Kim, Jonghark (Dept. of Radio Engineering, Chungbuk National University)
Shin, Jae-Hyun (Dept. of Radio Engineering, Chungbuk National University)
Lee, Insung (Dept. of Radio Engineering, Chungbuk National University)

Publication Information

The Journal of the Acoustical Society of Korea / v.22, no.3E, 2003 , pp. 101-109 More about this Journal

Abstract

A multi-mode harmonic transform coding (MMHTC) for speech and music signals is proposed. Its structure is organized as a linear prediction model with an input of harmonic and transform-based excitation. The proposed coder also utilizes harmonic prediction and an improved quantizer of excitation signal. To efficiently quantize the excitation of music signals, the modulated lapped transform(MLT) is introduced. In other words, the coder combines both the time domain (linear prediction) and the frequency domain technique to achieve the best perceptual quality. The proposed coder showed better speech quality than that of the 8 kbps QCELP coder at a bit-rate of 4 kbps.

Keywords

Speech coding; Harmonic coding; CELP; Audio coding;

Citations & Related Records

Reference

1	A. V. McCree, K. Trung, E, B, George, T. P. Barnwell and V. Viswanathan, 'A 2.4 kbil/s MELP coder candidate lor the new U. S. federal standard,' Proc IEEE Int. Cont. Acoust., Speech, Signal Processing, 1, 200-203, May 1996
2	R. V. Cox, 'Speech coding standards,' Speech Coding and Synthesis, 2, W. B. Kleijn, and K. K. Paliwell Eds., Elsevier, 1995
3	R. Lefebvre, R. Salami, C. Laflamme, and J. P. Adoul, 'High quality coding of wideband audio signals using Transform Coded Excitation (TCX),' Proc, ICASSP-94, 1, 193-196, 1994
4	B. Yegnanarayana, Christophe d'Alessandro and Vassilis Darsinos, 'An iterative algorithm for decomposition of speech signals into periodic and aperiodic components'" IEEE Transaction on speech and audio processing, 6 (1), 1-11,1998 DOI ScienceOn
5	P. Lupini, and V. Cuperman, 'Nonsquare transform vector quantization,' IEEE Signal Precessing letters, 3 (1), January 1996
6	R. Y. Qiao, 'Mixed wideband speech and music coding using a speech/music discriminator,' IEEE TENCON, 605-608, 1997
7	H. Malvar 'Fast algorithms for orthogonal and biothogonal modulated lapped transforms,' Proc IEEE Symposium, Advances in Digital Filtering and Signal Processing, 159-163, 1998
8	S. A. Ramprashad, 'A two stage hybrid embedded speech/ audio coding structure,' Proc. IEEE Int. Cont. Acount., Speech, Signal Processing, 337-340, 1998
9	R. J. McAulay, and T. F. Ouartieri, 'Sinusoidal coding,' Speech Coding and Synthesis, 4, W. B. Kleijn, and K. K. Paliwell Eds., Elsevier, 1995
10	A. M. Kondoz, 'Coding strategies and standards,' Digital Speech,5, John Wiley, 1994
11	P. J. A. OeJaco, W. Gardner and C. Lee, 'QCELP: north american COMA digital cellular variable rate speech coding standard,' Proc. IEEE Workshop on speech Coding for Telecommunications, (sainte-Adele. Quebec), 5-6, 1993
12	O. R. Ladd, and J. Terken, 'Modelling intra- and inter-speaker pitch range variation,' Proceedings at the 13th International Congress at Phonetic Sciences Stockholm (eds. EJenius, K. & Branderud, P,), 2, 386-389, 1995
13	T. Moriya, N. Iwakami, A. Jin, K. Ikeda, and S. Miki, 'A design of transform coder for both speech and audio signals at 1 bit/samples,' Proc. IEEE Int. Coni. Acount., Speech, Signal Processing, 1371-1374, 1997
14	ISO/IEC JTC1/SC29/wG11, 'Information technology-coding of audiovisual objects part 3: audio sub part2: parametric coding.' N1903PAR, 1997