Browse > Article

Matching Pursuit Estimation and Quantizer Design for Sinusoidal Model-based Coder  

Ahn Yeong-Uk ((주)코아로직)
Jeong Gyu-Hyeok (충북대학교 전파공학과)
Kim Jong-Hak (충북대학교 전파공학과)
Yang Yong-Ho (충북대학교 전파공학과)
Lee In-Sung (충북대학교 전파공학과)
Abstract
In this paper. we propose a coding method using a matching pursuit algorithm in a strongly periodic highband signal. Also. we propose an efficient quantizer for the estimated parameters : spectral magnitude and phase. Based on the error concealment principle and sinusoidal model. the MP algorithm requires the high-precision pitch period estimation. To estimate more accurate pitch period. the refined pitch obtained from lowband speech is used. which increases the efficiency of bit allocation. The spectral magnitude parameters are quantized by the method which is combined with MDCT (Modified Discrete Cosine Transform) and multi-stage structure. The spectral phase quantizer uses the $2{\pi}$ modular characteristic of phases and the weighted function by spectral magnitudes. To evaluate the efficiency of the proposed method. we applied it to analysis-by-synthesis system. Furthermore we suggest the possibillity of scalable wideband speech codecs based on band-split structure.
Keywords
Matching Pursuit;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 ITU- T Recommendation. G.722, '7 kHz audio-coding within 64 kbit/s.' Nov. 1988
2 K. Koishida, V. Cuperman, A. Gersho, 'A 16-kbit/s bandwidth scalable audio coder based on the G.729 standard:' IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 1149-1152, Jun. 2000
3 R. McAulay, T, Ouatieri, 'Speech Analysis/Synthesis Based on a Sinusoidal Representation:' IEEE Transactions on Signal Processing, 34, 744-754, Aug. 1986   DOI
4 E. B. George, M. J. T. Smith, Audio analysis/synthesis system, (U.S Patent 5327518, Jul. 1994)
5 F. A. Bilson, 'On the influence of the number and phase of harmonics on the perceptibility of the pitch of complex signals,' Acoustica, 28, 60-65, Sep, 1973
6 Alan V. Oppenheim, Ronald W. Schafer, Discretetime Signal Processino-2nd ed(Prentice Hall, New Jersey, 1999), 240 - 339
7 Peter Lupini. Vladimir Cuperman, 'Nonsquare Transform Vector Quantization:' in IEEE Signal Precessing letters, 3 (1), Jan. 1996
8 ITU- T Recommendation. G. 722.1, 'Coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss,' Sep. 1999
9 S. G. Mallet, Zhifeng Zhang, 'Matching pursuit with time-frequency dictionaries:' IEEE Transactions on Signal Processing, 41, 3397-3415, Dec. 1993   DOI   ScienceOn
10 A. M. Kondoz, Digital Speech(John Wiley & Sons Ltd' New York, 1994), 36-41
11 김도석, '인지에 중요한 음향신호의 위상에 대해,'음향학회지 19 (7), 28-33, 2000
12 A. McCree, 'A 14 kb/s wideband speech coder with a parametric highband model,' IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 1153-1156, Jun. 2000
13 K. T. Kim, S. K. Juno, Y. C. Park, D. H. Youn, 'A new bandwidth scalable wideband speech/audio coder:' IEEE International Conference on Acoustics, Speech and Signal Processing, 1, 657-660, May. 2002
14 Lalos Hanzo, F. Clare, A. Somerville and Jason P. Woodard, Voice Compression and communications (The Institute of Electrical and Electronic Engineering, Inc., New York, 2001), 531-564
15 O.K. Al-Shavkh, E. Miloslavsky, 'Video compression using matching pursuits:' IEEE Transactions on Circuits and Systems for Video Technology, 9, 123-143, Feb. 1999   DOI   ScienceOn
16 P. Vera-Candeas, N. Ruiz-Reves. 'New matching pursuit based sinusoidal modelling method for audio coding,' lEE Proceedings on Vision, Image and Signal Processing. vol. 151, 21-28, Feb. 2004
17 ITU-T SG16 Q.9, 'Report of Q.9/16 meeting:' Nov. 2004
18 K. Skretting, K Engan, J.H. Husoy, 'EOG compression using signal dependent frames and matching pursuit:' IEEE International Conference on Acoustics, Speech and Signal Processing, 4, 585-588, Mar. 2005
19 ITU-T Recommendation. G.722.2, 'Wideband coding of speech at around 16 kbit/s using Adaptive MultiRate Wideband:' Jan. 2001
20 E. B. George, M. J. T. Smith, 'Speech analysis/synthesis and modification using an analysis-bysynthesis/overlap-add sinusoidal model:' IEEE Transcations on Signal Processing, 5, 389-406, Sep, 1997
21 Yuan Yuan, D. M. Monro, 'Improved Matching Pursuits Image Coding:' IEEE International Conference on Acoustics, Speech and Signal Processing, 2, 201-204, Mar. 2005
22 송재종, 박호종, 김무영, 김도석, 김정수,'광대역 신호 압축기를 위한 주파수 대역 특성에 선택적인 양자화 방법,' 음향학회지 20 (7), 76-82, 2001