Browse > Article
http://dx.doi.org/10.6109/jkiice.2012.16.1.059

Blind Classification of Speech Compression Methods using Structural Analysis of Bitstreams  

Yoo, Hoon (상명대학교 디지털미디어학부)
Park, Cheol-Sun (국방과학연구소)
Park, Young-Mi (국방과학연구소)
Kim, Jong-Ho (순천대학교 멀티미디어공학과)
Abstract
This paper addresses a blind estimation and classification algorithm of the speech compression methods by using analysis on the structure of compressed bitstreams. Various speech compression methods including vocoders are developed in order to transmit or store the speech signals at very low bitrates. As a key feature, the vocoders contain the block structure inevitably. In classification of each compression method, we use the Measure of Inter-Block Correlation (MIBC) to check whether the bitstream includes the block structure or not, and to estimate the block length. Moreover, for the compression methods with the same block length, the proposed algorithm estimates the corresponding compression method correctly by using that each compression method has different correlation characteristics in each bit location. Experimental results indicate that the proposed algorithm classifies the speech compression methods robustly for various types and lengths of speech signals in noisy environment.
Keywords
Speech compression; MIBC; correlation analysis; structural bitstream analysis;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M.R.Schroeder and B.S.Atal, "Code-excited linear prediction (CELP): high quality speech at very low bit rates," Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 25.1.1-25.1.4, 1985.
2 Wikipedia, Code-Excited Linear Prediction (CELP), [Online]. Available: http://en.wikipedi a.org/wiki/Codeexcited_linear_prediction.
3 Wikipedia, Mixed Excitation Linear Prediction (MELP), [Online]. Available: http://en.wikipedi a.org/wiki/Mixed_Excitation_Linear_Prediction.
4 A.M.Kondoz, Digital Speech Coding for Low Bit Rate Communications Systems, Chichester, U.K.: John Wiley & Sons, 1994.
5 L.R.Rabiner and R.W.Schafer, Digital Processing of Speech Signals, Englewood Cliffs, NJ: Prentice Hall, 1978.
6 CCITT Rec. G.711, "Pulse Code Modulation (PCM) of Voice Frequencies," CCITT Blue Book, vol. III, Fascicle III.4, pp. 175-184, Nov. 1988.
7 Wikipedia, Linear Predictive Coding (LPC), [Online]. Available: http://en.wikipedia.org/wiki /Linear_predictive_coding.
8 A.Spanias, "Speech coding: a tutorial review," Proceedings of the IEEE, vol. 82, no. 10, pp. 1541-1582, Oct. 1994.   DOI   ScienceOn