A Sound Externalization Method for Realistic Audio Rendering in a Headphone Listening Environment

헤드폰 청취환경에서의 실감 오디오 재현을 위한 음상 외재화 기법

  • Kim, Yong-Guk (School of Information and Communications, Gwangju Institute of Science and Technology) ;
  • Chun, Chan-Jun (School of Information and Communications, Gwangju Institute of Science and Technology) ;
  • Kim, Hong-Kook (School of Information and Communications, Gwangju Institute of Science and Technology) ;
  • Lee, Yong-Ju (Realistic Acoustics Research Team, Electronics and Telecommunications Research) ;
  • Jang, Dae-Young (Realistic Acoustics Research Team, Electronics and Telecommunications Research) ;
  • Kang, Kyeong-Ok (Realistic Acoustics Research Team, Electronics and Telecommunications Research)
  • 김용국 (광주과학기술원 정보통신공학부) ;
  • 전찬준 (광주과학기술원 정보통신공학부) ;
  • 김홍국 (광주과학기술원 정보통신공학부) ;
  • 이용주 (한국전자통신연구원 실감음향연구팀) ;
  • 장대영 (한국전자통신연구원 실감음향연구팀) ;
  • 강경옥 (한국전자통신연구원 실감음향연구팀)
  • Received : 2010.06.19
  • Published : 2010.09.25

Abstract

In this paper, a sound externalization method is proposed for out-of-the-head localization in a headphone listening environment. In order to reduce timbre distortion by the conventional methods using a measured a head-related transfer function (HRTF) or early reflections, the proposed method integrates a model-based HRTF with reverberation. In addition, for improving frontal externalization performance, techniques such as decorrelation and spectral notch filtering are included. To evaluate the performance of the proposed externalization method, subjective listening tests are conducted by using different types of sound sources such as white noise, sound effects, speech, and music. It is shown from the test results that the proposed externalization method can localize sound sources farther away from out of the head than the conventional method.

본 논문에서는 헤드폰 재생 환경에서의 머리 밖 음상정위를 위한 음상 외재화(externalization) 기법을 제안한다. 제안된 기법에서는 기존의 머리전달함수(HRTF) 또는 초기 반사음 등을 이용한 외재화 기법들에서 발생하는 정위된 음성의 음색 왜곡을 줄이는 것에 그 초점을 맞춘다. 즉, 제안된 음상 외재화 기법은 모델 기반의 HRTF와 잔향 기법을 결합하고, 전방 음상 외재화의 성능 향상을 위하여 decorrelation 및 spectral notch 필터링 기법 등을 포함한다. 제안된 음상 외재화 기법의 성능을 평가하기 위하여 백색잡음, 효과음, 음성 및 오디오 등 다양한 장르의 음원을 이용하여, 평가자의 주관에 의한 청취평가를 수행하였다. 제안된 음상 외재화 알고리즘은 성능평가 결과에서 기존의 방법에 비해 더 좋은 외재화 거리 성능을 보였다.

Keywords

References

  1. D. R. Begault, 3-D Sound for Virtual Reality and Multimedia, Academic Press, Cambridge, MA, 1994.
  2. D. J. M. Robinson and R. G. Greenwood, "A binaural simulation which renders out-ofhead- localisation with low-cost digital signal processing of head-related transfer functions and pseudo reverberation," in Proc. of 104th AES Convention, Amsterdam, Netherlands, Preprint 4723, May 1998.
  3. 서정일, 이용주, 장인선, 유재현, 강경옥, "청취환경 차이에 따른 3차원 오디오 기술 개발 동향," 한국방송공학회지, 제13권, 제1호, pp. 82-96, 2008 년 3월.
  4. P. Rubak, "Headphone signal processing system for out-of-the-head localization," in Proc. of 90th AES Convention, Paris, France, Preprint 3063, Feb. 1991.
  5. T. Choi, Y. C. Park, and D. H. Youn, "Efficient out of head localization system for mobile applications," in Proc. of 120th AES Convention, Paris, France, Preprint 6758, May 2006.
  6. Y. G. Kim, H. K. Kim, Y. J. Lee, D. Y. Jang, I. Jang, and K. Kang, "A sound externalization algorithm based on a structural head-related transfer function model and reverberation," in Proc. of International Technical Conference on Circuits/ Systems, Computers and Communications, Jeju island, Korea, pp. 1316-1319, July 2009.
  7. J. Blauert, Spatial Hearing, MIT Press, 1997.
  8. 강성훈, 강경옥, 입체 음향, 기전연구사, 1997년 7월.
  9. W. G. Gardner, 3-D Audio Using Loudspeakers, Kluwers Academic Publishers, 1998.
  10. D. J. Kistler and F. L. Wightman, "A model of head-related transfer functions based on principal components analysis and minimumphase reconstruction," J. Acoust. Soc. Am., vol. 91, no. 3, pp. 1637-1647, Mar. 1992. https://doi.org/10.1121/1.402444
  11. C. P. Brown and R. O. Duda, "An efficient HRTF model for 3-D sound," in Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp. 298-301, Oct. 1997.
  12. J. A. Moorer, "About this reverberation business," Computer Music Journal, vol. 3, no. 2, pp. 13-28, June 1979. https://doi.org/10.2307/3680280
  13. J. B. Allen and D. A. Berkley, "Image method for efficiently simulating small-room acoustics," J. Acoust. Soc. Am., vol. 65, no. 4, pp. 943-951, Apr. 1979. https://doi.org/10.1121/1.382599
  14. G. S. Kendall, "The decorrelation of audio signals and its impact on spatial imagery," Computer Music Journal, vol. 19, no. 4, pp. 71-87, Winter 1995. https://doi.org/10.2307/3680992
  15. V. R. Algazi, R. O. Duda, D. M. Thompson, and C. Avendano, "The CIPIC HRTF database," in Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Electroacoustics, New Paltz, NY, pp. 99-102, Oct. 2001.
  16. W. G. Gardner and K. D. Martin, "HRTF measurements of a KEMAR," J. Acoust. Soc. Am., vol. 97, no. 6, pp. 3907-3908, June 1995. https://doi.org/10.1121/1.412407