DOI QR코드

DOI QR Code

Frequency Band Selection Exited Linear Prediction Wideband Speech/Audio Coding Using SBR

SBR을 이용한 주파수 밴드선택 여기 선형예측 광대역 음성/오디오 부호화

  • 장성훈 (충북대학교 전파통신공학과) ;
  • 이인성 (충북대학교 전파통신공학과)
  • Received : 2013.05.21
  • Accepted : 2013.08.23
  • Published : 2013.11.30

Abstract

This paper is aimed to improve performance of Band-Selection speech/audio Coder reconstucted band spectrum that is not sent by the comfort noise. To improve the performance, we use the Spectral Band Replication(SBR) technique instead of substitution of Comfort noise. To synthesize SBR signal, the SBR algorithm is referenced in selected signals and the spectrum synthesized by SBR is injected to non-selected band. Each sub-band spectrum has been energy-weighted by real audio signal. We propose the enhanced the Band-Selection Coder that utilizes synthesized SBR signal from selected signal instead of comfort noise.

본 논문은 컴포트 노이즈(comfort noise)를 이용하는 주파수 밴드선택 음성/오디오 코덱에서 컴포트 노이즈 대신 SBR(Spectral Band Replication) 기술을 이용하여 여기 신호를 대체 함으로서 밴드 선택 광대역 음성/오디오 부호화기의 성능 향상을 목표로 한다. 비 전송 밴드에 SBR 기술로 합성된 신호를 삽입하기 위하여 부밴드 별로 전송된 신호를 활용하며, 각각의 부밴드 별로 에너지 가중치를 설정한다. 백색잡음 성분의 컴포트 노이즈 대신 전송신호에 의존하는 신호를 합성 함으로서 보다 높은 음질의 밴드 선택 부호화기를 제안하였다.

Keywords

References

  1. A. Spanias, "Speech coding: a tutorial review" Proc. IEEE, 82, 1541-1582 (1994). https://doi.org/10.1109/5.326413
  2. Kondoz A.M, Digital Speech: Coding for Low Bit Rate Communication Systems, 2nd Ed., (John Wiley & Sons, New Jersey, 2004), pp. 219-255.
  3. T. Painter, A. Spanias, "Perceptual coding of digital audio," Proc. IEEE, 88, 451-515 (2000). https://doi.org/10.1109/5.842996
  4. J. Schnitzler, P. Vary, "Signal processing: trends and perspectives in wideband speech coding," Elsevier, 80, 2267-2281 (2000). https://doi.org/10.1016/S0165-1684(00)00116-X
  5. T.J. Lee, K.O. Kang and W.W. Kim, "MPEG audio new standard: USAC technology," JBE, 16, 693-704 (2011). https://doi.org/10.5909/JEB.2011.16.5.693
  6. K. J. An, "Performance evaluation of the MPEG USAC according to the spectral band replication bandwidth," JBE, 15, 705-713 (2011). https://doi.org/10.5909/JEB.2011.16.5.705
  7. S. H. Jang, K. B. Hong and I. S. Lee, "Design of low bits rate transform excitation wide band speech and audio coder of analysis-by-synthesis structure" (in Korean), J. Acoust. Soc. Kr. 31, 472-479 (2012). https://doi.org/10.7776/ASK.2012.31.7.472
  8. 3GPP TS 26.304, ANSI-C code for the floating point Extended AMR Wideband codec, V9.0.0., 2009.
  9. ITU-R BS.1534, Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA), 2001.