Browse > Article

Design of a Quantization Algorithm of the Speech Feature Parameters for the Distributed Speech Recognition  

Lee Joonseok (한양대학교 전자컴퓨터공학부)
Yoon Byungsik (한양대학교 전자전기제어계측공학과 대학원)
Kang Sangwon (한양대학교 전자컴퓨터공학부)
Abstract
In this paper, we propose a predictive block constrained trellis coded quantization (BC-TCQ) to quantize cepstral coefficients for the distributed speech recognition. For Prediction of the cepstral coefficients. the 1st order auto-regressive (AR) predictor is used. To quantize the prediction error signal effectively. we use a BC-TCQ. The performance is compared to the split vector quantizers used in the ETSI standard, demonstrating reduction in the cepstral distance and computational complexity.
Keywords
Mel-cepstrum; Distributed speech recognition; BC-TCQ; Vector quantization;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. W. Marcellin, and T. R. Fischer, 'Trellis coded quantization of memoryless and Gauss-Markov sources,' IEEE Trans. Communications, 38, issue 1, 82-93, Jan. 1990   DOI   ScienceOn
2 S. Kang, Y. Shin, and T.R. Fischer, 'Low-complexity predictive trellis coded quantization of speech line spectral frequencies,' IEEE Trans. Signal Processing, 52 (7), 2070-2079, July 2004   DOI   ScienceOn
3 G. D. Forney Jr., 'The Viterbi algorithm,' Proc. IEEE, 61, 268-278, Mar. 1973
4 R. F. Kubichk, 'Mel-Cepstral Distance measure for objective speech quality assessment,' Communications, Computers and Signal Processing, IEEE Pacific Rim Conf, 1, 125 - 128, May 1993
5 Speech Processing, Transmission and Quality aspects (STQ); Distributed speech recognition; Front-end feature extraction algorithm; Compression algorithms, ETSI ES 201 108 (V1.1.2), April 2000
6 S. Nikneshan and A. K. Khandani, 'Soft Decision Decoding of Fixed Rate Entropy Constrained Quantizer over a Noisy Channel,' 20th Biennial Symposium on Communications, 116-118, Kingston, ON, May 28-May 31, 2000
7 N. S. Jayant, 'Digital Coding of Waveforms; Principles and Applications to Speech and Video,' Prentice Hall Signal Processing Series, Academic Press, 524-532, 1984