DOI QR코드

DOI QR Code

A Low Bit Rate Speech Coder Based on the Inflection Point Detection

  • Iem, Byeong-Gwan (Department of Electronic Engineering, Gangneung-Wonju National University)
  • Received : 2015.12.09
  • Accepted : 2015.12.25
  • Published : 2015.12.25

Abstract

A low bit rate speech coder based on the non-uniform sampling technique is proposed. The non-uniform sampling technique is based on the detection of inflection points (IP). A speech block is processed by the IP detector, and the detected IP pattern is compared with entries of the IP database. The address of the closest member of the database is transmitted with the energy of the speech block. In the receiver, the decoder reconstructs the speech block using the received address and the energy information of the block. As results, the coder shows fixed data rate contrary to the existing speech coders based on the non-uniform sampling. Through computer simulation, the usefulness of the proposed technique is shown. The SNR performance of the proposed method is approximately 5.27 dB with the data rate of 1.5 kbps.

Keywords

References

  1. L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals. Englewood Cliffs, NJ: Prentice-Hall, 1978.
  2. A. M. Kondoz, Digital Speech: Coding For Low Bit Rate Communication Systems. Chichester: John Wiley & Sons, 1994.
  3. G. Lee and W. G. Kim, "Emotion recognition using pitch parameters of speech," Journal of Korean Institute of Intelligent Systems, vol. 25, no. 3, pp. 272-278, 2015. http://dx.doi.org/10.5391/jkiis.2015.25.3.272
  4. W. G. Kim, "Robust speech recognition parameters for emotional variation," Journal of Korean Institute of Intelligent Systems, vol. 15, no. 6, pp. 655-660, 2005. http://dx.doi.org/10.5391/jkiis.2005.15.6.655
  5. M. Bae, W. Lee, and D. Kim, "On a new vocoder technique by the non-uniform sampling," in Proceeding of IEEE Military Communications Conference (MILCOM' 96), McLean, VA, 1996, pp. 649-652. http://dx.doi.org/10.1109/milcom.1996.569428
  6. M. Budaes and L. Goras, "On speech signal reconstruction from local extreme values," in Proceeding of International Symposium on Signals, Circuits and Systems (ISSCS), Lasi, Romania, 2005, pp. 315-318. http://dx.doi.org/10.1109/isscs.2005.1509917
  7. L. D. Davisson, "Data compression using straight line interpolation," IEEE Transactions on Information Theory, vol. 14, no.3, pp. 390-394, 1968. http://dx.doi.org/10.1109/tit.1968.1054160
  8. T. Fjallbrant, "Method of data reduction of sampled speech signals by using non-uniform sampling and a timevariable digital filter," Electronics Letters, vol. 13, no. 11, pp. 334-335, 1977. http://dx.doi.org/10.1049/el:19770243
  9. P. K. Ghosh and T. V. Sreenivas, "Dynamic programming based optimum non-uniform samples for speech reconstruction and coding," in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toulouse, France, 2006, pp. 1221-1224. http://dx.doi.org/10.1109/icassp.2006.1660247
  10. J. W. Mark and T. D. Todd, "A non-uniform sampling approach to data compression," IEEE Transactions on Communications, vol. 29, no.1, pp. 24-32, 1981. http://dx.doi.org/10.1109/TCOM.1981.1094872
  11. B. G. Iem, "A non-uniform sampling technique based on inflection point detection and its application to speech coding," Journal of the Acoustical Society of America, vol. 136, no. 2, pp. 903-909, 2014. http://dx.doi.org/10.1121/1.4884882
  12. B. G. Iem, "A non-uniform sampling technique and its application to speech coding," Journal of Korean Institute of Intelligent Systems, vol. 24, no. 1, pp. 28-32, 2014. http://dx.doi.org/10.5391/jkiis.2014.24.1.028

Cited by

  1. A Fixed Rate Speech Coder Based on the Filter Bank Method and the Inflection Point Detection vol.16, pp.4, 2016, https://doi.org/10.5391/IJFIS.2016.16.4.276