Low-bitrate Multichannel Audio Coding

저비트율 멀티채널 오디오 부호화

  • Jang, Inseon (Broadcasting Media Research Group, Digital Broadcasting Research Division, ETRI) ;
  • Seo, Jeongil (Broadcasting Media Research Group, Digital Broadcasting Research Division, ETRI) ;
  • Beak, Seungkwon (Broadcasting Media Research Group, Digital Broadcasting Research Division, ETRI) ;
  • Kang, Kyeongok (Broadcasting Media Research Group, Digital Broadcasting Research Division, ETRI)
  • 장인선 (한국전자통신연구원 디지털방송연구단 방송미디어연구그룹) ;
  • 서정일 (한국전자통신연구원 디지털방송연구단 방송미디어연구그룹) ;
  • 백승권 (한국전자통신연구원 디지털방송연구단 방송미디어연구그룹) ;
  • 강경옥 (한국전자통신연구원 디지털방송연구단 방송미디어연구그룹)
  • Published : 2005.09.01

Abstract

Technology for compressing low-bitrate multichannel audio coding is being standardized owing to the increasing need of consumer for multichannel audio contents. In this paper we propose the sound source location cue coding (SSLCC) for extremely compressing multichannel audio to be suitable at the narrow bandwidth transmission environment. To improve the compression capability of the conventional binaural cue coding(BCC), the SSLCC adopts the virtual source location information (VSLI) as a spatial cue parameter, a symmetric uniform quantizer, and Huffman coder. The objective and subjective assessment results show that the SSLCC provides lower bitrate and better audio quality than conventional BCC method.

멀티채널 오디오 컨텐츠에 대한 사용자의 욕구가 증가하고 있는 가운데 서비스를 위한 저비트율 멀티채널 오디오 부호화 기술에 대한 표준화가 진행되고 있다. 본 논문에서는 협대역폭 채널환경에 적합한 고압축율의 멀티채널 오디오 부호화 기술인 음원위치단서 부호화 기술(Sound Source Location Cue Coding; SSLCC)을 제안하고, SSLCC가 기존 BCC (Binaural Cue Coding)의 압축 성능을 향상시키기 위하여 단서로 이용한 VSLI (Virtual Source Location Information)를 설명한다. 또한 객관적 평가로 기존의 BCC 시스템과의 평균 비트율을 비교하고, MUSHRA (Multi-Stimulus test with Hidden Reference and Anchor) 방법을 이용한 주관 청취 평가를 수행하여 제안된 SSLCC 부호화 기법의 우수성을 확인하였다.

Keywords

References

  1. ISO/IEC JTC1/SC29/WG11 (MPEG), Document N6455, 'Call for Proposals on Spatial Audio Coding,' Munich, Mar. 2004
  2. C. Faller and F. Baumgarte, 'Efficient Representation of Spatial Audio using Perceptual Parameterization,' IEEE Workshop on Appl. of Sig. Proc. to Audio and Acoust., Oct. 2001
  3. C. Faller and F. Baumgarte, 'Binaural Cue Coding: A Novel and Efficient Representation of Spatial Audio,' Proceeding ICASSP 2002, Orlando, FL, May 2002
  4. F. Baumgarte and C. Faller, 'Estimation of Auditory Spatial Cues for Binaural Cue Coding (BCC),' Proceeding ICASSP 2002, Orlando, FL, May 2002
  5. ISO/IEC JTC1/SC29/WG11, Document M11264, 'A New Cue Parameter for Spatial Audio Coding,' Redmond, July 2004
  6. 문한길, 이정석, 서정일, 성굉모, '멀티채널 오디오 재생 시스템에서 가상 음원의 위치 정보를 이용한 압축 재생 방법,' 2004년도 한국음향학회 추계학술발표대회, 제23권 제2(s)호, 2004
  7. 문한길, '가상음원 정보를 이용한 MPEG-4 SAC 다채널 오디오 압축 기법,' 서울대학교박사학위논문, 2005년 2월
  8. S. Beack, J. Seo, H. Moon, and M. Hahn, 'Angle-based Virtual Source Location Representation for Spatial Audio Coding,' submitted to IEE Letter
  9. H. Moon, J. Seo, S. Beack, and K. Sung, 'A Multi-Channel Audio Compression Method with Virtual Source Location Information for MPEG-4 SAC,' submitted to IEEE Trans. on Consumer Electronics
  10. V. Pulkki, 'Localization of Amplitude-Panned Virtual Sources I: Three-dimensional Panning,' Journal Audio Engineering Society, vol. 49, no. 9, pp. 753-767, September 2001
  11. C. Faller and F. Baumgarte, 'Binaural Cue Coding Applied to Audio Compression with Flexible Rendering,' Proceeding AES 113th Convention, Los Angeles, CA, Oct. 2002
  12. C. Faller and F. Baumgarte, 'Binaural Cue Coding Part II: Schemes and Applications,' IEEE Trans. Speech Audio Processing, vol. 11, pp. 520-531, Nov. 2003 https://doi.org/10.1109/TSA.2003.818108
  13. ITU-R Recommendation, 'Multichannel Stereophonic Sound System with and Without Accompanying Picture,' International Telecommunication Union, BS. 775-1, Geneva, 1994
  14. S. Beack, J. Seo, H. Moon, and M. Hahn, 'Angle-based Virtual Source Location Representation for Spatial Audio Coding,' submitted to IEE Letter
  15. ISO/IEC 14496-3:2001, 'Information Technology-Generic Coding of Audio-Visual Objects, Part 3: Audio,' International Standard Organization, 2001
  16. ISO/IEC JTC1/SC29/WG11 (MPEG), Document N6691, 'Procedures for the Evaluation of Spatial Audio Coding Systems,' Redmond, July 2004
  17. G. A. Soulodre, and M. C. Lavoie, , 'Subjective Evaluation of Large and Small Impairments in Audio Codecs,' AES 17th International Conference, Florence, pp. 329-336, Sep. 1999