Browse > Article

HMM-Based Bandwidth Extension Using Baum-Welch Re-Estimation Algorithm  

Song, Geun-Bae (삼성전자)
Kim, Austin (삼성전자)
Abstract
This paper contributes to an improvement of the statistical bandwidth extension(BWE) system based on Hidden Markov Model(HMM). First, the existing HMM training method for BWE, which is suggested originally by Jax, is analyzed in comparison with the general Baum-Welch training method. Next, based on this analysis, a new HMM-based BWE method is suggested which adopts the Baum-Welch re-estimation algorithm instead of the Jax's to train HMM model. Conclusionally speaking, the Baum-Welch re-estimation algorithm is a generalized form of the Jax's training method. It is flexible and adaptive in modeling the statistical characteristic of training data. Therefore, it generates a better model to the training data, which results in an enhanced BWE system. According to experimental results, the new method performs much better than the Jax's BWE systemin all cases. Under the given test conditions, the RMS log spectral distortion(LSD) scores were improved ranged from 0.31dB to 0.8dB, and 0.52dB in average.
Keywords
Bandwidth extension; Hidden markov model; Gaussian mixture model; Baum-welch Re-estimation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 N. Enbom and W. B. Kleijn, 'Bandwidth expansion of speech based on vector quantization of the Mel frequency cepstral coefficients,' IEEE Workshop on Speech Coding, 171-173, June 1999
2 S. Jaisirrha and I. Y. Soon, 'Bandwidth Extension of Narrow Band Speech Using Cepstral Linear Prediction,' Joint Conference of the Fourth International Conference on Multimedia 3, 1404-1407, Dec. 2003
3 M. Nilsson, S. V. Andersen, and W. B. Kleijn, 'On the mutual information between frequency bands in speech,' ICASSP 3, 1327-1330, June 2000
4 S. Chennoukh, A. Gerrits, and R. Sluijter. 'Speech Enhancement via Frequency Bandwidth Extension Using Line Spectral Frequencies,' ICASSP 1, 665-668, May 2001
5 M. Nilsson, H. Gustafsson, S, V, Andersen, and W. B, Kleijn, 'Gaussian mixture model based mutual information estimation between frequency bands in speech,' ICASSP 1, 525-528, June 2002
6 P. Jax and P. Vary, 'Artificial Bandwidth Extension of Speech Signals Using MMSE Estimation Based on a Hidden Markov Model,' ICASSP 1, 680-683, April 2003
7 T. K. Moon, 'The expectation-maximization algorithm,' IEEE Signal Process. Mag 13 (6), 47-60, Nov. 1996   DOI   ScienceOn
8 Y. Agiomyriannakis and Y. Stylianou, 'Combined estimation/ coding of highband spectral envelopes for speech spectrum expansion,' ICASSP 1, 496-472, May 2004
9 Wei-shou Hsu, Robust bandwidth Extension of narrowband speech, M.A. thesis, McGill Univ., Dept. of Electrical & Computer Engineering, 26-29, Nov. 2004
10 J. S. Garofolo, L. F. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, 'DARPA-TIMlT: Acoustic-Phonetic Continuous Speech Corpus,' 1990
11 P. Jax and P. Vary, 'Wideband extension of telephone speech using a hidden Markov model,' IEEE Workshop on Speech Coding, 133-135, Sept. 2000
12 K. -Y Park and H. S. Kim, 'Narrowband to Wideband Conversion of Speech Using GMM Based Transformation,' ICASSP 3, 1843-1846, June 2000
13 L. R. Rabiner, 'A tutorial on Hidden Markov Models and Selected Applications in Speech Recognition,' Proceedings of the IEEE 77 (2), 257-286, Feb. 1989
14 P. Jax and P. Vary, 'On artificial bandwidth extension of telephone speech,' Signal Processing 83 (8), 1707-1719, Aug. 2003
15 Y. Linde, A. Buzo, R. M. Gray, 'An algorithm for vector quantizer design,' IEEE Trans. Commun. 28 (1), 84-95, 1980   DOI