Browse > Article
http://dx.doi.org/10.9728/dcs.2016.17.2.71

On the speaker's position estimation using TDOA algorithm in vehicle environments  

Lee, Sang-Hun (Saman Corporation)
Choi, Hong-Sub (Dept. of Electronic Eng., Daejin Univ.)
Publication Information
Journal of Digital Contents Society / v.17, no.2, 2016 , pp. 71-79 More about this Journal
Abstract
This study is intended to compare the performances of sound source localization methods used for stable automobile control by improving voice recognition rate in automobile environment and suggest how to improve their performances. Generally, sound source location estimation methods employ the TDOA algorithm, and there are two ways for it; one is to use a cross correlation function in the time domain, and the other is GCC-PHAT calculated in the frequency domain. Among these ways, GCC-PHAT is known to have stronger characteristics against echo and noise than the cross correlation function. This study compared the performances of the two methods above in automobile environment full of echo and vibration noise and suggested the use of a median filter additionally. We found that median filter helps both estimation methods have good performances and variance values to be decreased. According to the experimental results, there is almost no difference in the two methods' performances in the experiment using voice; however, using the signal of a song, GCC-PHAT is 10% more excellent than the cross correlation function in terms of the recognition rate. Also, when the median filter was added, the cross correlation function's recognition rate could be improved up to 11%. And in regarding to variance values, both methods showed stable performances.
Keywords
GCC-PHAT; median filter; sound source localization; TDOA;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 S. H. Kim, J. Y. Ahn, "Speech Recognition System in Car Noise Environment", DMC, vol 10, No 1, 121-127, 2009.
2 J. W. Choi, H. S. Park, K. H. Kim, "The Human Vehicle Interface System for Integrating and Managing the In-Vehicle Interactions with IT Devices," JESK. vol 30, No.5, 651-657, 2011.
3 S. M. Hwang, Y. J. Park, "Sound Source Localization Using HRTF Database," KAIST, 2005.
4 Y. J. Park, M. W. Lee, S. H. Min, Y. H. Han, "Sound Localization based on LP Residual and Hilbert Transform for Intelligent Robot," KIISS vol. 18, No 2 ,2008.
5 W. Rhee, J. S. Choi, "Experimental Studies for Noise Source Positioning Using TDOA Algorithm," KSNVE, 2006.
6 B. Jang, D. Y. Sim, C. D. Kim, C. B. Lee, K. H. Cga, "A Study for Estimating Absolute Position od Sound Source Using Reference Microphone and Beamforming Method," KSPC ,1997.
7 .C. H. Knapp, G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE vol. ASSP-24, 320-327, 1976.
8 . B. C. Park, K. D. Ban, K. C. Kwak, H. S. Yoon, "Performance analysis of GCC-PHAT-based sound source localization for intelligent robots," KROS, 2007.