DOI QR코드

DOI QR Code

An Improved Synthesis Method of Parametric Stereo Coding Based on Tonality Information

토널리티 정보를 기반으로 한 파라메트릭 스테레오 부호화의 개선된 합성 기법

  • Lee, Tung chin (School of Electrical and Electronic Engineering, Yonsei University) ;
  • Park, Young-Cheol (Computer and Telecommunications Engineering Division, Yonsei University) ;
  • Youn, Dae Hee (School of Electrical and Electronic Engineering, Yonsei University)
  • 이동금 (연세대학교 전기전자공학과) ;
  • 박영철 (연세대학교 컴퓨터 정보통신공학부) ;
  • 윤대희 (연세대학교 전기전자공학과)
  • Received : 2014.02.08
  • Accepted : 2014.05.25
  • Published : 2014.06.25

Abstract

In this paper, we propose a synthesis method that can effectively suppress the ambience which affects tonal components in the PS decoder. Ambience component was obtained by using decorrelation filter and the weighting of the ambience in the decoder was determined through IC parameter. However, since the parameters are extracted in the sub-band domain, a low IC value could be analyzed even if the tonal component is dominant. The quality of the output signal may be degraded. To prevent this problem, the tonality was measured in the downmixed signal and the weighting of the ambience components were adjusted appropriately according to the measured tonality index. The performance of the proposed method was evaluated by simulations. Furthermore, the subjective test was performed and the results confirmed that the proposed method offers improved quality.

본 논문에서는 PS의 복호화과정에서 톤 성분에 영향을 주는 잔향 성분을 효과적으로 억제할 수 있는 합성 방법을 제안하였다. PS에서 잔향 성분은 비상관 필터를 이용하여 구할 수 있으며, 부호화단에서 분석된 IC 파라미터를 통해서 합성되는 잔향의 비중이 결정된다. 하지만 파라미터들은 서브밴드 도메인에서 분석되기 때문에, 톤 성분이 존재하는 대역에서도 낮은 IC값이 분석될 수 있고, 이는 출력 신호의 음질 열화를 야기시킨다. 본 논문에서는 이러한 문제를 보완하기 위해 복호화단으로 입력되는 다운믹스 신호의 토널리티를 측정하였고, 이 측정된 값을 통해 합성되는 잔향 성분의 비중을 조절해주었다. 실험은 시뮬레이션 결과를 통해 성능을 검증한 후에 주관적 음질 평가를 수행하였고, 전체적으로 음질 향상이 있음을 확인하였다.

Keywords

References

  1. T. Painter, A. Spanias, "Perceptual coding of digital audio," Proc. of IEEE, vol. 88, no. 4, pp. 451-515, Apr., 2000. https://doi.org/10.1109/5.842996
  2. ISO/IEC 11172-3, Coding of Moving Pictures and Associated Audio for Digital Storage Media up to about 1.5 Mbits/s (part 3: MPEG-Audio), August 1993.
  3. ISO/IEC 13818-7 Information Technology - Generic Coding of Moving Pictures and Associated Audio, Part 7: Adavanced Audio Coding, 1997.
  4. M. Wolters, K. Kjorling, D. Homm, and H. Purnhagen, "A closer look into MPEG-4 high efficiency AAC," in Proc. 115th AES Convention, New York, USA, October 2003.
  5. J. Breebaart, G. Hotho, J. Koppens, E. Schuijers, W. Oomen, and S. van de Par, "Background, Concept and Architecture for the Recent MPEG Suround Standard on Multichannel Audio Compression" J. Audio Eng. Soc. vol 55, pp. 331-351, 2007.
  6. J. Breebaart, S. van de Par, A. Kohlrausch and E. Schuijers, "Parametric Coding of Stereo Audio" EURASIP J. Appl. Signal Process., vol 9, pp. 1305-1322, 2004.
  7. G. Hotho, L. Villemoes, J. Breebaart, "A Backward-Compatible Multichannel Audio Codec," IEEE Trans. on Audio, Signal and Lang. Proc., Vol. 16, no. 1, pp. 83-93, Jan. 2008. https://doi.org/10.1109/TASL.2007.910768
  8. J.D. Johnston, "Transform coding of audio signal using perceptual noise criteria," IEEE J. on Sel. Areas in Comm., Vol. 6, no. 2, pp. 314-323, Feb., 1988. https://doi.org/10.1109/49.608
  9. H. Purnhagen: "Low Complexity Parametric Stereo Coding in MPEG-4," 7th Inter. Conf. on Audio Effects (DAFX-04), pp. 163-168, Naples, Italy, Oct., 2004.
  10. M. Neuendorf, et al., "Unified Speech and Audio Coding Scheme for High Quality at Low Bitrates," Proc. ICASSP, pp. 1-4, Taipei, Taiwan, Apr., 2009.
  11. Method for the Subjective Assessment of Intermediate Quality Level of Coding Systems 2003, ITU-R BS.1534-1.