Browse > Article
http://dx.doi.org/10.7840/kics.2016.41.10.1183

Intelligibility Improvement of Low Bit-Rate Speech Coder Using Stochastic Spectral Equalizer  

Lee, Jeong Hun (Department of Electronic Engineering, Seoul National University of Science and Technology)
Yun, Deokgyu (Department of Electronic Engineering, Seoul National University of Science and Technology)
Choi, Seung Ho (Department of Electronic and IT Media Engineering, Seoul National University of Science and Technology)
Abstract
Low bit-rate speech coder in digital speech communications synthesizes speech using vocal tract model parameters. In this case, the spectra of the synthesized speech can be much distorted since the allocated bits for the parameters are considerably limited, which results in the degradation of speech intelligibility. In this paper, we propose a speech intelligibility improvement method using stochastic spectral equalizer. This method stochastically obtains the weight vector of each speech coder using spectral ratios between original and synthesized speech, then applies this weight vector to synthesized speech. From the experiments of objective speech intelligibility tests, we found that the performance of the proposed method is better than that of the conventional method.
Keywords
Stochastic spectral equalizer; Low bit-rate speech coder; Speech intelligibility;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 J.-H. Chen and A. Gersho, "Adaptive postfiltering for quality enhancement of coded Speech," IEEE Trans. Speech and Audio Process., vol. 3, no. 1, pp. 59-71, Aug. 1995.   DOI
2 T. Raitio, A. Suni, H. Pulakka, M. Vainio, and P. Alku, "Comparison of formant enhancement methods for HMM-Based speech synthesis," SSW, pp. 334-339, Sep. 2010.
3 J. Jensen and C. H. Taal, "Speech intelligibility prediction based on mutual information," IEEE/ACM TASLP, vol. 22, no. 2, pp. 430-440, Feb. 2014.   DOI
4 J. P. Campbell Jr., T. E. Tremain, and V. C. Welch, "The federal standard 1016 4800 bps CELP voice coder," Digital Signal Process, vol. 1, no. 3, pp. 145-155, 1991.   DOI
5 Alan McCree, et al., "A 2.4 kbit/s MELP coder candidate for the new US Federal Standard," Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings, 1996 IEEE International Conference on, vol. 1, pp. 200-203, May. 1996.
6 Y. Chun and B. Jun, "An enhanced MELP vocoder in noise environments," The Journal of Korean Institute of Communications and Information Sciences, vol. 28, no. 1, pp. 81-89, 2003.
7 T. E. Tremain, "The government standard linear predictive coding algorithm: LPC-10," Speech Technology, vol. 1, no. 2, pp. 40-49, 1982.