[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7776/ASK.2012.31.2.093

Quantization of LPC Coefficients Using a Multi-frame AR-model

Jung, Won-Jin (세종대학교 정보통신공학과)
Kim, Moo-Young (세종대학교 정보통신공학과)

Publication Information

The Journal of the Acoustical Society of Korea / v.31, no.2, 2012 , pp. 93-99 More about this Journal

Abstract

For speech coding, a vocal tract is modeled using Linear Predictive Coding (LPC) coefficients. The LPC coefficients are typically transformed to Line Spectral Frequency (LSF) parameters which are advantageous for linear interpolation and quantization. If multidimensional LSF data are quantized directly using Vector-Quantization (VQ), high rate-distortion performance can be obtained by fully utilizing intra-frame correlation. In practice, since this direct VQ system cannot be used due to high computational complexity and memory requirement, Split VQ (SVQ) is used where a multidimensional vector is split into multilple sub-vectors for quantization. The LSF parameters also have high inter-frame correlation, and thus Predictive SVQ (PSVQ) is utilized. PSVQ provides better rate-distortion performance than SVQ. In this paper, to implement the optimal predictors in PSVQ for voice storage devices, we propose Multi-Frame AR-model based SVQ (MF-AR-SVQ) that considers the inter-frame correlations with multiple previous frames. Compared with conventional PSVQ, the proposed MF-AR-SVQ provides 1 bit gain in terms of spectral distortion without significant increase in complexity and memory requirement.

Keywords

LSF; LPC; Quantization; VQ; AR model;

Citations & Related Records

Reference

1	K. K. Paliwal and B. S. Atal, "Efficient Vector Quantization of LPC Parameters at 24 Bits/Frame," IEEE Trans. Speech and Audio Proc., vol. 1, no. 1, pp. 3-14, 1993. DOI ScienceOn
2	F. Nordin and T. Eriksson, "On split quantization of LSF parameters," IEEE Int. Conf. Acoust. Speech and Signal Proc., vol. 1, pp. I-157-60, 2004.
3	S. So and K. K. Paliwal, "Switched split vector quantization of line spectral frequencies for wideband speech coding," in Proc. European Conf. Speech Commun. Tech (INTERSPEECH -2005), pp. 2705-2708, 2005.
4	S. So and K. K. Paliwal, "Efficient product code vector quantization using the switched split vector quantizer," Digital Signal Proc., vol. 17, no. 1, pp. 138-171, 2007. DOI ScienceOn
5	W. P. LeBlanc, B. Bhattacharya and S. A. Mahmoud, "Efficient Search and Design Procedures for Robust Multi-Stage VQ of LPC Parameters for 4 kb/s Speech Coding" IEEE Trans. Speech Audio Proc., vol. 1, no. 4, pp. 373-385, 1993. DOI
6	T. Eriksson, J. Linden and Jan Skoglund, "Interframe LSF Quantization for Noisy Channels," IEEE Trans. Speech Audio Proc., vol. 7, no. 5, pp. 495-509, 1999. DOI ScienceOn
7	S. Chatterjee and T.V. Sreenivas, "Predicting VQ Performance Bound for LSF Coding," IEEE Signal Proc. Letter, vol. 15, pp. 166-169, 2008. DOI ScienceOn
8	M. Sabin and R. Gray, "Global convergence and empirical consistency of the generalized Lloyd algorithm," IEEE Trans. Information Theory, vol. 32, no. 2, pp. 148-155, 1986. DOI
9	Y. Linde, A. Buzo and R. Gray, "An Algorithm for Vector Quantization Design," Commun., IEEE Trans., vol. 28, no. 1, pp. 84-95, 1980. DOI
10	W. B. Kleijn, A Basis for Source Coding, Course notes, KTH, Stockholm, 2008.
11	R. Salami, C. Laflamme, J.-P. Adoul and D. Massalux, "A Toll Quality 8 Kb/s Speech Codec for the Personal Communications System (PCS)," IEEE Trans. Vehicular tech., vol. 43, no. 3, part: 1-2, pp. 808-816, Aug. 1994. DOI ScienceOn
12	F. Itakura, "Line Spectrum Representation of Linear Predictive Coefficients of Speech Signal," J. Acoust. Soc. Amer., vol. 57, suppl. 1, pp. S35(A), 1975.
13	김해진, 강상원, "효율적인 LSF 양자화기를 이용한 QCELP 성능개선," 한국음향학회지, 16권, 1호, 10-15쪽, 1997.

KSCI

Quantization of LPC Coefficients Using a Multi-frame AR-model Multi-frame AR model을 이용한 LPC 계수 양자화

Quantization of LPC Coefficients Using a Multi-frame AR-model