Browse > Article

Matching Pursuit Sinusoidal Modeling with Damping Factor  

Jeong, Gyu-Hyeok (Dept. of Radio Science & Engineering, Chungbuk National University)
Kim, Jong-Hark (Dept. of Radio Science & Engineering, Chungbuk National University)
Lim, Joung-Woo (Dept. of Radio Science & Engineering, Chungbuk National University)
Joo, Gi-Ho (Dept. of Informations and Communications Engineering, PaiChai University)
Lee, In-Sung (Dept. of Radio Science & Engineering, Chungbuk National University)
Publication Information
Abstract
In this paper, we propose the matching pursuit with damping factors, a new sinusoidal model improving the matching pursuit, for the codecs based on sinusoidal model. The proposed model defines damping factors by using a correlativity of parameters between the current and adjacent frame, and estimates sinusoidal parameters more accurately in analysis frame by using the matching pursuit according to damping factor, and synthesizes the final signal. Then it is possible to model efficiently without interpolation schemes. The proposed sinusoidal model shows a better speech quality without an additional delay than the conventional sinusoidal model with interpolation methods. Through the SNR(signal to noise ratio), the MOS(Mean Opinion Score), LR(Itakura-Saito likelihood ratio), and CD(cepstral distance), we compare the performance of our model with that of matching pursuit using interpolation methods.
Keywords
matching pursuit;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 R. J. McAulay and T. F. Quatieri, 'Magnitude-only reconstruction using a sinusoidal speech model,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '84, vol. 2, pp. 27.6.1-27.6.4, Mar. 1984
2 R. J. McAulay and T. F. Quatieri, 'Computationally ecient sine-wave synthesis and its application to sinusoidal transform coding,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '88, vol. 1, pp. 370-373, Apr. 1988   DOI
3 J. Nieuwenhuijse, R. Heusdens, and E. F. Deprettere, 'Robust exponential modeling of audio signals,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '98, vol. 2, pp. 3581-3584, Mar. 1998   DOI
4 X. Serra and J. Smith, 'Spectral modeling synthesis: A sound analysis/synthesis system based on a deterministic plus stochastic decomposition,' Computer Music journal, vol. 14, pp. 12-24, Dec. 1990
5 T. S Verma and T. H. Y. Meng,' Sinusoidal Modeling Using Frame-Based Perceptually Weighted Matching Pursuit,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '99, vol. 2, pp. 981-984, 1999   DOI
6 Y. Ding and X. Qian, 'Estimating sinusoidal parameters of musical tones based on global waveform fitting,' Multimedia Signal Processing, pp. 95 - 100, Jun. 1997   DOI
7 I. Atkinson, S. Yeldner and A. Kondoz, 'High quality split band LPC vocoder operating at low bit rates,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '97, vol. 2, pp. 1559 - 1562, Apr. 1997   DOI
8 K. Skretting, K. Engan and J.H. Husoy, 'ECG compression using signal dependent frames and matching pursuit,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05, vol. 4, pp. 585-588, Mar. 2005   DOI
9 P. Vera-Candeas and N. Ruiz-Reyes, 'New matching pursuit based sinusoidal modelling method for audio coding,' IEE Proceedings on Vision, Image and Signal Processing, vol. 151, pp. 21-28, Feb. 2004   DOI   ScienceOn
10 T. Painter and A. Spanias, 'Perceptual segmentation and component selection in compact sinusoidal representations of audio,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '01, vol. 5, pp. 3289 - 3292, May 2001   DOI
11 Yuan Yuan and D. M. Monro, 'Improved Matching Pursuits Image Coding,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05, vol. 2, pp. 201-204, Mar. 2005
12 J. Nieuwenhuijse, R. Heusdens, and E.F. Deprettere, 'Robust exponential modeling of audio signals,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '98, Seattle, Washington, USA, vol. 6, pp. 3581?3584, May 1998   DOI
13 T. S. Verma and T. H. Y. Meng, 'Sinusoidal modeling using frame-based perceptually weighted matching pursuits,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '99, Phoenix, Arizona, USA, vol. 2, pp. 981?984, May 1999   DOI
14 T. F. Quatieri and R. J. McAulay, 'Speech transformations based on a sinusoidalrepresentation,' IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 1449-1464, 1986   DOI
15 K. Hermus, W. Verhelst, and P. Wambacq, 'Psycho-acoustic modeling of audio with exponentially damped sinusoids,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '02, pp. 1821-1824, 2002
16 W. B. Kleijin and K. K. Paliwal, Speech coding and synthesis, Elevier Science Publishers, Amsterdam, 1995
17 L. Girin, S. Marchand, J. di Martino, A. Robel and G. Peeters, 'Comparing the order of a polynomial phase model for the synthesis of quasi-harmonic audio signals,' Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on. 19-22 pp. 193-196, Oct. 2003
18 ITU-T Recommendation P.862, 'Perceptual evaluation of speech quality (PESQ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codec', Feb. 2001
19 S. Wang, A. Sekey and A. Gersho, 'An objective measure for predicting subjective quality of speech coders,' Selected Areas in Communications, IEEE Journal on vol. 10, pp. 819 - 823, Jun. 1992   DOI   ScienceOn
20 J. Jensen, S. H. Jensen, and E. Hansen, 'Exponential sinusoidal modeling of transitional speech segments,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '99, pp. 473-476, 1999   DOI
21 안 영욱, 정 규혁, 김 종학, 양 용호, 이 인성, '정현파 모델 부호화기를 위한 MP(MatchingPursuit)알고리즘과 파라미터 양자화기,' 음향학회지 제 24권 제 7호, pp. 402-409, 2005   과학기술학회마을
22 T. F. Quatieri and R. J. McAulay, 'Phase modelling and its application to sinusoidal transform coding,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '86, vol. 3, pp. 1713-1715, Apr. 1986
23 R. J. McAulay and T. F. Quatieri, 'Speech analysis/synthesis based on a sinusoidal representation,' IEEE Trans. on ASSP, vol. 34, no. 4, pp. 744?754, Aug. 1986
24 E. B. George and M. J. T. Smith, 'Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model,' IEEE Trans. Speech Audio Processing, vol. 5, no. 5, pp. 389-406, 1997   DOI   ScienceOn
25 Y. Stylianou, 'Applying the harmonic plus noise model in concatenative speech synthesis,' IEEE Trans. Speech Audio Processing, vol. 9, pp. 232-239, Mar. 2001   DOI   ScienceOn
26 J. Jensen and J. H. L. Hansen, 'Speech enhancement using a constrained iterative sinusoidal model,' IEEE Trans. Speech Audio Processing, vol. 9, pp. 731-740, Oct. 2001   DOI   ScienceOn