Browse > Article
http://dx.doi.org/10.7776/ASK.2019.38.6.637

Analyses on limitations of binaural sound based on the first order Ambisonics for virtual reality audio  

Chang, Ji-Ho (Division of Physical Metrology, Korea Research Institute of Standards and Science)
Cho, Wan-Ho. (Division of Physical Metrology, Korea Research Institute of Standards and Science)
Abstract
This paper analyzes the limitations of binaural sound that is reproduced with headphones based on Ambisonics for Virtual Reality (VR) audio. VR audio can be provided with binaural sound that compensates head rotation of a listener. Ambisonics is widely used for recording and reproducing ambient sound fields around a listener in VR audio, and the First order Ambisonics (FOA) is still being used for VR audio because of its simplicity. However, the maximum frequencies with this order is too low to perfectly reproduce ear signals, and thus the binaural reproduction has inherent limitations in terms of spectrum and sound localization. This paper investigates these limitations by comparing the signals arrived at ear positions in the reference field and the reproduced field. An incidence wave is defined as a reference field, and reproduced over virtual loudspeakers. Frequency responses, inter-aural level differences, and inter-aural phase differences are compared. The results show, above the maximum cut off frequency in general, that the reproduced levels decrease, and the horizontal localization can be provided only around the forward direction.
Keywords
Binaural reproduction; First order Ambisonics; Virtual reality audio; Inter-aural level difference;
Citations & Related Records
연도 인용수 순위
  • Reference
1 P. B. Fellgett, "Ambisonic reproduction of directionality in surround-sound systems," Nature, 252, 534- 538 (1974).   DOI
2 D. B. Ward and T. D. Abhayapala. "Reproduction of a plane-wave sound field using an array of loudspeakers," IEEE Transactions on speech and audio processing, 9, 697-707 (2001).   DOI
3 J. Daniel, J. Rault, and J. Polack, "Ambisonics encoding of other audio formats for multiple listening conditions," AES 105th Convention, paper no. 4795 (1998).
4 M. A. Gerzon and G. J. Barton, "Ambisonic decoders for HDTV," AES 92nd Convention, paper no. 3345 (1992).
5 F. Zotter and M. Frank, "All-round Ambisonic panning and decoding," J. Audio Eng. Soc. 60, 807-820 (2012).
6 D. H. Cooper and J. L Bauck, "On acoustical specification of natural stereo imaging," AES 65th Convention, paper no. 1616 (1980).
7 D. S. Brungart and W. M. Rabinowitz, "Auditory localization of nearby sources. Head-related transfer functions," J. Acoust. Soc. Am. 106, 1465-1479 (1999).   DOI
8 M. A. Gerzon, "The design of precisely coincident microphone arrays for stereo and surround sound," AES 50th Convention, paper no. L-20 (1975).
9 C. Oreinos and J. M. Buchholz, "Measurement of a full 3D set of HRTFs for in-ear and hearing aid microphones on a head and torso simulator (HATS)," Acta Acustica united with Acustica, 99, 836-844 (2013).   DOI
10 M. A. Gerzon, "Practical Periphony: the reproduction of full-sphere sound," AES 65th Convention. London, UK, paper no. 1571 (1980).
11 E. G. Williams, Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography (Academic press, Cambridge, UK, 1999), pp. 224-227.
12 M. A. Gerzon, "Periphony: With-height sound reproduction," J. Audio Eng. Soc. 21, 2-10 (1973).
13 S. Favrot, M. Marschall, J. Kasbach, J. Buchholz, and T. Weller, "Mixed-order ambisonics recording and playback for improving horizontal directionality," AES 131st Convention, paper no. 8528 (2011).
14 J. Daniel, Representation of acoustic fields, application to the transmission and reproduction of complex sound scenes in a multimedia context, (Doctoral thesis, University of Paris, 2001).
15 J. Meyer and G. Elko, "A highly scalable spherical microphone array based on an orthonormal decomposition of the soundfield," Proc. IEEE ICASSP. 2, II-1781 (2002).
16 C. Travis, "A new mixed-order scheme for ambisonic signals," Proc. Ambisonics Symp. 1-6 (2009).
17 J. -H. Chang and M. Marschall, "Periphony-Lattice mixed-order Ambisonic scheme for spherical microphone arrays," Proc. IEEE/ACM trans. on audio, speech, and lang. 26, 924-936 (2018).
18 M. Noisternig, A. Sontacchi, T. Musil, and R. Hoeldrich, "A 3D Ambisonic based Binaural Sound Reproduction System," AES 24th Int. Conference on Multichannel Audio, paper no. 1 (2012).
19 A. Solvang, "Spectral impairment of two-dimensional higher order Ambisonics," J. Audio Eng. Soc. 56, 267-279 (2008).
20 M. Naef, O. Staadt and M. Gross, "Spatialized audio rendering for immersive virtual environments," Proc. the ACM symposium on Virtual reality software and technology. ACM, 65-72 (2002).
21 T. McKenzie, D. T. Murphy, and G. Kearney, "Diffusefield equalization of first-order Ambisonics," Proc. the 20th Int. Conf. Digital Audio Effects (DAFx-17), Edinburgh, 5-9 (2017).
22 B. Sebastian and M. Frank, "Localization of 3D ambisonic recordings and ambisonic virtual sources," Proc. 1st Int. Conf. on Spatial Audio, Detmold (2011).
23 T. McKenzie, D. Murphy, and G. Kearney, "Directional bias equalization of first-order binaural Ambisonic rendering," AES Conference on Audio for Virtual and Augmented Reality, paper no. 6-3 (2018).
24 D. Satongar, C. Dunn, Y. Lam, and F. Li, "Localization performance of higher-order Ambisonics for off-centre listening," BBC Research & Development white paper WHP 254 (2013).
25 E. M. Benjamin, R. Lee, and A. J. Heller, "Localization in horizontal-only Ambisonic systems," AES 121st Convention, paper no. 6967 (2006).
26 B. Stephanie, J. Daniel, E. Parizet, and O. Warusfel, "Investigation on localisation accuracy for first and higher order ambisonics reproduced sound sources," Acta Acustica united with Acustica, 99, 642-657 (2013).   DOI
27 T. Lewis, C. Armstrong, and G. Kearney, "A Direct comparison of localization performance when using first, third, and fifth Ambisonics drder for real loudspeaker and virtual loudspeaker rendering," AES 143rd Convention, paper no. 9864 (2017).
28 G. Kearney, M. Gorzel, H. Rice, and F. Boland, "Distance perception in interactive virtual acoustic environments using first and higher order Ambisonic sound fields," Acta Acustica united with Acustica, 98, 61-71 (2012).   DOI
29 G. Kearney and T. Doyle, "Height perception in Ambisonic based binaural decoding," AES 139th Int. Convention, paper no. 9423 (2015).
30 M. Gorzel, G. Kearney, and F. Boland, "Investigation of Ambisonic rendering of elevated sound sources," AES 55th Int. Conf. on Spatial Audio, paper no. 5 (2014).
31 J. -H. Chang and W. -H. Cho, "Impairments of binaural sound based on Ambisonics for virtual reality audio," Proc. IEEE 10th Sensor Array and Multichannel Signal Processing Workshop (IEEE SAM), Sheffield, UK, 341-345 (2018).
32 J. -M. Batke, "The B-format microphone revised," Proc. Ambisonics Symposium, Graz Paper no. 6621 (2009).
33 S. Spors and J. Ahrens. "Reproduction of focused sources by the spectral division method," Proc. 4th IEEE International Symposium on Communications, Control and Signal Processing (ISCCSP), 1-5 (2010).
34 V. Pulkki, "Virtual sound source positioning using vector base amplitude panning," J. Audio Eng. Soc. 45, 456-466 (1997).
35 J. Blauert, Spatial hearing: the psychophysics of human sound localization (MIT press, Cambridge, 1997), pp. 50-137.
36 D. Colton and R. Kress, Inverse Acoustic and Electromagnetic Scattering Theory, 2nd edition (Springer, New York, 1998), pp. 27.