DOI QR코드

DOI QR Code

음성의 변곡점 추출 및 전송에 기반한 가변 데이터율 음성 부호화 기법

A Variable Data Rate Speech Coding Technique Based on the Inflection Point Detection of Speech

  • 임병관 (강릉원주대학교 전자공학과)
  • Iem, Byeong-Gwan (Department of Electronic engineering, Gangneung-Wonju National University)
  • 투고 : 2013.01.09
  • 심사 : 2013.03.25
  • 발행 : 2013.04.01

초록

A new variable rate speech coding technique is proposed. The method is based on the observation that the speech signal approximately looks linear for a very short period of time. The information transmitted is the location and data value of inflection points. If the distance between the inflection points is large, the mid point location and its data value are also delivered. Thus, the encoder transmits both the location and the data value for the inflection samples, but the location only for the non-inflection points. The location information is expressed using one bit for each sample, 0 for non-inflection and 1 for inflection point. At the receiver, using the interpolation, the decoder estimates the untransmitted sample values for non-inflection locations from the received sample values for the inflection samples. With 50 % of computational cost of the existing CVSD delta modulation, the proposed method is expected to achieve the data rate of 36 to 38 kbps and the SNR of 10 to 13 dB.

키워드

참고문헌

  1. A. M. Kondoz, Digital Speech, Wiley, 1994.
  2. B. Boashash, "Estimating and interpreting the instantaneous frequency of a signal-Part 2: algorithms and applications," Proc. of the IEEE , vol. 80, pp. 540-568, Apr. 1992. https://doi.org/10.1109/5.135378
  3. D. G. Childers, Speech Processing and Synthesis Toolboxes, Wiley, 2000.
  4. P. J. Davis, Interpolation and Approximation, Dover, 1975.
  5. L.R. Rabinar and R.W. Schafer, Digital Processing of Speech Signals, Prentice-Hall, 1978.

피인용 문헌

  1. Performance Analysis of A Variable Bit Rate Speech Coder vol.62, pp.12, 2013, https://doi.org/10.5370/KIEE.2013.62.12.1750