Browse > Article

Trade-off between Model Complexity and Performance in Intra-frame Predictive Vector Quantization of Wideband Speech  

Song, Geun-Bae (ITRC, Soongsil Univ.)
Hahn, Hern-Soo (School of Electronics Engineering, Soongsil Univ.)
Publication Information
The Journal of Korea Robotics Society / v.5, no.1, 2010 , pp. 70-76 More about this Journal
Abstract
This paper addresses a design issue of "model complexity and performance trade-off" in the application of bandwidth extension (BWE) methods to the intra-frame predictivevector quantization problem of wideband speech. It discusses model-based linear and non-linear prediction methods and presents a comparative study of them in terms of prediction gain. Through experimentation, the general trend of saturation in performance (with the increase in model complexity) is observed. However, specifically, it is also observed that there is no significant difference between HMM and GMM-based BWE functions.
Keywords
Bandwidth Extension; Wideband Speech; Gaussian Mixture Model; Hidden Markov Model;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. Nilsson, H. Gustafsson, S. Andersen, and W. Kleijn, "Gaussian mixture model based mutual information between frequency bands in speech," ICASSP, Vol.1, pp.525-528, May 2002.
2 Y. Agiomyrgiannakis and Y. Stylianou, "Conditional vector quantization for speech coding," IEEE Trans. Audio, Speech, Lang. Process., Vol.15, No.2, pp.377-386, Feb. 2007.   DOI
3 B. Geiser and P. Vary, "Backwards compatible wideband telephony in mobile networks: CELP watermarking and bandwidth extension," ICASSP, Vol.4, pp.533-536, April 2007.
4 P. Jax, "Bandwidth extension for speech," in Audio Bandwidth Extension, E. Larsen and R. M. Aarts (Ed.), NY:John Wiley & Sons, Nov. 2004, Chap. 6, pp.171-235.
5 K. -Y. Park, H.S. Kim, "Narrowband to wideband conversion of speech using GMM based transformation," ICASSP, Vol.3, pp.1843-1846, June 2000.
6 G. -B. Song and P. Martynovich, "A Study of HMM-based bandwidth extension of speech signals," Signal Processing, Vol.89, No.10, pp.2036-2044, Oct. 2009.   DOI   ScienceOn
7 Linde Y, Buzo A, and Gray RM, "An algorithm for vector quantizer design," IEEE Trans. Comm. Vol.28, No.1, pp.84-95, 1980.   DOI
8 P. Jax, and P. Vary, "Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model," ICASSP, Vol.1, pp.680-683, April 2003.
9 J. S. Garofolo, L. F. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, "DARPA TIMIT Acoustic- Phonetic Continuous Speech Corpus CD-ROM," NIST, 1990.