DOI QR코드

DOI QR Code

Modified Generic Mode Coding Scheme for Enhanced Sound Quality of G.718 SWB

G.718 초광대역 코덱의 음질 향상을 위한 개선된 Generic Mode Coding 방법

  • 조근석 (한국과학기술원, 전기 및 전자공학과) ;
  • 정상배 (경상대학교, 전자공학과(공학연구원))
  • Received : 2012.08.03
  • Accepted : 2012.09.20
  • Published : 2012.09.30

Abstract

This paper describes a new algorithm for encoding spectral shape and envelope in the generic mode of G.718 super-wide band (SWB). In the G.718 SWB coder, generic mode coding and sinusoidal enhancement are used for the quantization of modified discrete cosine transform (MDCT)-based parameters in the high frequency band. In the generic mode, the high frequency band is divided into sub-bands and for every sub-band the most similar match with the selected similarity criteria is searched from the coded and envelope normalized wideband content. In order to improve the quantization scheme in high frequency region of speech/audio signals, the modified generic mode by the improvement of the generic mode in G.718 SWB is proposed. In the proposed generic mode, perceptual vector quantization of spectral envelopes and the resolution increase for spectral copy are used. The performance of the proposed algorithm is evaluated in terms of objective quality. Experimental results show that the proposed algorithm increases the quality of sounds significantly.

Keywords

References

  1. Cho, K., Sung J., Hahn, M., Kim, Y. & Jeong, S. (2009). Enhanced Spectral Envelope Coding Scheme Using Inter-Frame Correlation for G.729.1, Journal of Korean Society of Speech Sciences, Vol. 1, No. 4. 97-103. (조근석, 성종모, 한민수, 김영일, 정상배 (2009). G.729.1 코더에서 프레임 간의 상호상관 관계를 이용한 개선된 스펙트럼 포락 코딩 방법, 말소리와 음성과학, 제1권 제4호, 97-103.)
  2. Kim, D., Sung, J., Lee, M., Bae, H. & Lee, B. (2009). Trends of Speech-Based Audio Convergence Codec Technology. Electronics and Telecommunications Trends. Vol. 24, No 5, 10-19. (김도영, 성종모, 이미숙, 배현주, 이병선 (2009). 음성기반 오디오 융합코덱 기술동향, 전자통신동향분석. 제24권 제5호, 10-19.)
  3. Lee, M., Kim, D. & Lee, B. (2010). Trends of Codec Technology for 4G Mobile Enhanced Voice Service. Electronics and Telecommunications Trends. Vol. 25, No 6. 29-37. (이미숙, 김도영, 이병선 (2009). 4G 모바일 증감음성 서비스를 위한 코덱 기술 동향, 전자통신동향분석. 제25권 제6호, 29-37.)
  4. ITU-T Rec. (2010). Superwideband scalable extension for G.718, New G. 718 Annex B.
  5. Wang, S., Sekey, A. & Gersho, A. (1992). An objective measure for predicting subjective quality of speech coders, IEEE Trans. Selected Areas in Comm. Vol. 10. No. 5, 819-829. https://doi.org/10.1109/49.138987
  6. Linde, Y., Buzo, A. & Gray, R. (1980). An Algorithm for Vector Quantizer Design. IEEE Transaction on Communications, 28 (1), 84-94. https://doi.org/10.1109/TCOM.1980.1094577
  7. Wim, D., & Xavier, R. (2003). Discrete Cepstrum Coefficients as Perceptual Features, International Computer Music Conference (ICMC), 1-4.
  8. Oh, Y. (1998). Speech language information processing, Seoul: Hongleung Science Publisher. (오영환 (1998). 음성언어정보처리, 홍릉과학출판사.)