Browse > Article
http://dx.doi.org/10.7776/ASK.2008.27.6.309

A Study of BWE-Prediction-Based Split-Band Coding Scheme  

Song, Geun-Bae (삼성전자 통신연구소)
Kim, Austin (삼성전자 통신연구소)
Abstract
In this paper, we discuss a method for efficiently coding the high-band signal in the split-band coding approach where an input signal is divided into two bands and then each band may be encoded separately. Generally, and especially through the research on the artificial bandwidth extension (BWE), it is well known that there is a correlation between the two bands to some degree. Therefore, some coding gain could be achieved by utilizing the correlation. In the BWE-prediction-based coding approach, using a simple linear BWE function may not yield optimal results because the correlation has a non-linear characteristic. In this paper, we investigate the new coding scheme more in details. A few representative BWE functions including linear and non-linear ones are investigated and compared to find a suitable one for the coding purpose. In addition, it is also discussed whether there are some additional gains in combining the BWE coder with the predictive vector quantizer which exploits the temporal correlation.
Keywords
Split-band Coding; Predictive Coding Scheme; Bandwidth Extension; Predictive Vector Quantizer;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 M. Nilsson, H. Gustafsson, S. V. Andersen, and W. B. Kleijn, "Gaussian mixture model based mutual information between frequency bands in speech," ICASSP 1, 525-528, May 2002
2 V. Cuperman and A. Gersho, "Vector predictive coding of speech at 16 kbits/s," IEEE Trans. Commun. 33(7), 685-696, July 1985   DOI
3 J. S. Garofolo, L. F. Fisher, J. G. Fiscus, D. S. Pallett, and N. L. Dahlgren, "DARPA-TIMIT: Acoustic-Phonetic Continuous Speech Corpus," 1990
4 R. Hagen,"Spectral quantization of cepstral coefficients," ICASSP 1, 509-512, April 1994
5 P. Jax,"Bandwidth extension for speech,"in Audio Bandwidth Extension, E. Larsen and R. M. Aarts (Ed.), (John Wiley & Sons, 2004), Chap.6, pp.171-235
6 3GPP TS 26.290, Audio codec processing functions; Extended Adaptive Multi-Rate Wideband (AMR-WB+) codec; Transcoding functions, (June 2004)
7 H. Khalil, K. Rose, and S. L. Regunathan, "The asymptotic closed-loop approach to predictive vector quantizer design with application in video coding," IEEE Trans. Image Processing 10(1), 15-23, Jan. 2001   DOI   ScienceOn
8 B. Geiser and P. Vary, "Backwards compatible wideband telephony in mobile networks: CELP watermarking and bandwidth extension," ICASSP 4, 533-536, April 2007
9 M. Dietz, L. Liljeryd, K. Kjorling, and O. Kunz, "Spectral Band Replication, a novel approach in audio coding," 112th AES Convention, Preprint 5553, May 2002
10 P. Jax and P. Vary, "Feature selection for improved bandwidth extension of speech signals," ICASSP 1, 697-700, May 2004
11 H. Khalil and K. Rose, "Predictive vector quantizer design using deterministic annealing," IEEE Trans. Signal Processing 51(1), 244-254, Jan. 2003   DOI   ScienceOn
12 Y. Linde, A. Buzo, and R.M. Gray, "An algorithm for vector quantizer design," IEEE Trans. Commun. 28(1), 84-95, 1980   DOI
13 송근배, 김석호, "음성신호의 대역폭 확장을 위한 GMM 방법 및 HMM 방법의 성능평가", 한국음향학회지 27(3), 119-128, 2008   과학기술학회마을
14 3GPP TS 26.404, General audio codec audio processing functions; Enhanced aacPlus general audio codec; Enhanced aacPlus encoder SBR part, (Sept. 2004)
15 M. Nilsson, S. V. Andersen, and W. B. Kleijn, "On the mutual information between frequency bands in speech," ICASSP 3, 1327-1330, June 2000