Browse > Article

Development of a Listener Position Adaptive Real-Time Sound Reproduction System  

Lee, Ki-Seung (건국대학교 정보통신대학 전자공학부)
Lee, Seok-Pil (전자부품연구원 방송통신융합 연구센터)
Abstract
In this paper, a new audio reproduction system was developed in which the cross-talk signals would be reasonably cancelled at an arbitrary listener position. To adaptively remove the cross-talk signals according to the listener's position, a method of tracking the listener position was employed. This was achieved using the two microphones, where the listener direction was estimated using the time-delay between the two signals from the two microphones, respectively. Moreover, room reverberation effects were taken into consideration where linear prediction analysis was involved. To remove the cross-talk signals at the left-and right-ears, the paths between the sources and the ears were represented using the KEMAR head-related transfer functions (HRTFs) which were measured from the artificial dummy head. To evaluate the usefulness of the proposed listener tracking system, the performance of cross-talk cancellation was evaluated at the estimated listener positions. The performance was evaluated in terms of the channel separation ration (CSR), a -10 dB of CSR was experimentally achieved although the listener positions were more or less deviated. A real-time system was implemented using a floating-point digital signal processor (DSP). It was confirmed that the average errors of the listener direction was 5 degree and the subjects indicated that 80 % of the stimuli was perceived as the correct directions.
Keywords
Cross-talk cancellation; direction of arrival (DOA) estimation; real-time implementation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 O. Kirkeby, P. A. Nelson and H. Hamada, "Fast deconvolution of multichannel systems usng regularization", IEEE Trans. Speech and Audio Process., vol. 6, no. 2, pp. 189-194, 1998.   DOI   ScienceOn
2 N. Sakamoto, T. Gotoh, T. Kogure and M. Shimbo, "Controlling soundimage localization in stereophonic reproduction", J. Audio Eng. Soc., vol. 29, no. 11, pp. 794-799, 1981.
3 B. Gardner and K. Martin, KEMAR HRTF data, ftp://sound. media.mit.edu/ pub/Data/KEMAR (last viewed 8/21/08), 1994.
4 M. Jian, A. C. Kot and M. H. Er, "Performance study of time delay estimation in a room enviroment", Proc. IEEE ICASSP, pp. V554-V557, 1998.
5 C. H. Knapp ad G. C. Carter, "The generalized correlation method for estimation of time delay", IEEE Trans. Acoust., Speech, Singal Process., vol. 24, no. 4, pp. 320-327, 1976.   DOI
6 J. R. Hopgood and P. J. W. Rayner, "A probabilistic framework for subband autoregressive model applied to room acoustics", Proc. IEEE Signal Processing Workshop on Statistical Signal Processing, pp. 492-495, 2001.
7 TMS320C6713 Floating-point digital signal processor, Texas Instruments, Nov., 2002.
8 TMS320C6000 Programmer's Guide, Texas Instruments, Aug., 2002.
9 J. J. Lopez, F. Orduna and A. Gonzalez, "Modeling and measurement of cross-talk cancellation zones for small displacements of the listener in transaural sound reproduction with different loudspeaker arrangements", AES 109th Convention, 2000.
10 M. R. Bai and C.-C. Lee, "Comprehensive analysis of loudspeaker span effects on crosstalk cancellation in spatial sound reproduction", AES 120th Convention, 2006.
11 J. Rose , P. Nelson, B. Rafaely and T. Takeuchi, "Sweet spot size of virtual acoustic imaging systems at asymmetric listener locations," J. Acoust. Soc. Am., vol. 112, no. 5, pp. 1992-2002, 2002.   DOI   ScienceOn
12 C. Kyriakakis, T. Holman, J.-S. Lim, H. Hong and H. Neven, "Signal processing, acoustics, and psychoacoustics for high quality desktop audio," J. Visual Comm. and Image Rep., vol. 8, no. 1, pp. 51-61, 1998.
13 P. G. Georgiou, A. Mouchtaris, S. I. Roumeliotis and C. Kyriakakis, "Immersive sound rendering using laser-based tracking", AES 109th Convention, 2000.
14 S. Kim, S. Jang, D. Kong and S. Bang, "Adaptive virtual surround sound rendering method for an arbitrary listening position", AES 30th Int. Conference, 2007.
15 S. Merchel and S. Groth, "Analysis and implementation of a stereophonic play back system for adjusting the sweet spot to the listener's position", AES 126th Convention, 2009.