DOI QR코드

DOI QR Code

Efficient Individualization Method of HRTFs Using Critical-band Based Spectral Cue Control

  • Hur, Yoo-Mi (Dept. of Electrical and Electronic Engineering, Yonsei University) ;
  • Park, Young-Cheol (Computer and Telecommunications Eng. Division, Yonsei University) ;
  • Lee, Seok-Pil (Korea Electronic Technology Institute (KETI)) ;
  • Youn, Dae-Hee (Dept. of Electrical and Electronic Engineering, Yonsei University)
  • Received : 2011.01.19
  • Accepted : 2011.04.11
  • Published : 2011.05.31

Abstract

Recently, 3-D audio technologies are commonly implemented through headphones. A major problem of the headphone-based 3-D audio is in-the-head localization, which occurs due to the inaccurate Head-Related Transfer Function (HRTF). Since the individual measurements of HRTFs are impractical, there have been several researches for HRTF customization. In this paper, an efficient method of customizing HRTFs for the sound externalization is proposed. Firstly, it is determined which part will be customized in HRTF through psychoacoustical experiments. Then, the method controlling spectral notches and envelopes to provide individual localization cues are described. Since the proposed method is based on a critical-band rate, the structure is much simpler than that of previous studies, but still effective. The performance was evaluated through a series of subjective tests, and the results confirmed that the customized HRTF using proposed method could replace the measured individual HRTF successfully.

Keywords

References

  1. C.-J. Tan, W.-s. Gan, "Direct concha excitation for the introduction of individualized hearing cues," J. Audio Eng. Soc., vol. 48, no. 7/8, pp. 642-653, July, 2000.
  2. D. R. Begault, "Auditory and non-auditory factors that potentially influence virtual acoustic imagery," AES 16th International Conference, paper no. 16-002, Mar, 1999.
  3. J. Blauert, Spatial Hearing, MIT Press, Cambridge, 1983.
  4. D. M. Green, "Psychoacoustics," CHABA Symposium on Sound Localization by Humans, National Academy of Sciences (unpublished ), 1988.
  5. W. M. Hartmann, A. Wittenberg, "On the externalization of sound images," J. Acoust. Soc. Am., vol. 99, no. 6, pp. 3678-3688, June, 1996. https://doi.org/10.1121/1.414965
  6. S. Shimada, N. Hayashi, and S. Hayashi, "A clustering method for sound localization transfer function," J. Audio Eng. Soc., vol. 42, no. 7/8, pp. 577-584, July, 1994.
  7. Y. Kahana, P. A. Nelson, M. Petyt, and S. Choi, "Numerical modeling of the transfer functions of a dummy-head and of the external ear," AES 16th International Conference, paper no. 16-029, Mar, 1999.
  8. M. J. Evans, J. A. S. Angus, and A. I. Tew, "Spherical harmonic spectra of head-related transfer functions," AES 103rd Convention, paper no. 4571, Sep, 1997.
  9. P. Rubak, "Headphone signal processing system for out-of-head localization," AES 90th Convention, paper no. 3063, Feb, 1991.
  10. Weinrich, S. Gert, "Improved externalization and frontal perception of headphone signals," AES 92nd Convention, paper no. 3291, Mar, 1992.
  11. F. E. Toole, "In-head localization of acoustic images," J. Acoust. Soc. Am., vol. 48, no. 4B, pp. 943-949, 1970. https://doi.org/10.1121/1.1912233
  12. L. Wightman, D. J. Kistler, "Headphone simulation of freefield listening I : Stimulus synthesis," J. Acoust. Soc. Am., vol. 85, no. 2, pp. 858-867, Feb, 1989. https://doi.org/10.1121/1.397557
  13. N. Sakamoto, T. Gotoh, Y. Kimura, "On out-of-head localization in headphone listening," J. Audio Eng. Soc., vol. 24, no. 9, pp. 710-716, Nov, 1976.
  14. G. S. Kendall, "The decorrelation of audio signals and its impact on spatial imagery," Computer Music Journal, vol. 19, no. 4, pp. 71-87, 1995. https://doi.org/10.2307/3680992
  15. D. R. Begault, E. M. Wenzel, A. S. Lee, and M. R. Anderson, "Direct comparison of the impact of head tracking, reverberation, and individualized Head-Related Transfer Functions on the spatial perception of a virtual speech source," AES 108th Convention, paper no. 5134, Feb, 2000.
  16. D. R. Begault, "Perceptual effects of synthetic reverberation on 3-dimensional audio systems," J. Audio Eng. Soc., vol. 40, no. 11, pp. 895-904, Nov, 1992.
  17. P. Satarzadeh, V. R. Algazi, and R. O. Duda, "Physical and filter pinna models based on anthropometry," AES 122nd Convention, paper no. 7098, May, 2007.
  18. D. Griesinger, "Binaural techniques for music reproduction," AES 8th International Conference, paper no. 8-026, May, 1990.
  19. A. J. Watkins, "Psychoacoustical aspects of synthesized vertical locale cues," J. Acoust. Soc. Am., vol. 63, no. 4, pp. 1152-1165, Apr, 1978. https://doi.org/10.1121/1.381823
  20. V. C. Raykar, R. Duraiswami, B. Yenanarayana, "Extracting frequencies of the pinna spectral notches in measured head related impulse responses," Perceptual Interfaces and Reality Lab., University of Maryland,Technical report CS-TR-4609, 2004.
  21. B. C. J. Moore, S. R. Oldfield, and G. J. Dooley, "Detection and discrimination of spectral peaks and notches at 1 and 8 kHz," J. Acoust. Soc. Am., vol. 85, no. 2, pp. 820-836, 1989. https://doi.org/10.1121/1.397554