Browse > Article
http://dx.doi.org/10.5050/KSNVN.2008.18.8.823

Factors for Speech Signal Time Delay Estimation  

Kwon, Byoung-Ho (한국과학기술원 기계공학과)
Park, Young-Jin (한국과학기술원 기계공학과)
Park, Youn-Sik (한국과학기술원 기계공학과)
Publication Information
Transactions of the Korean Society for Noise and Vibration Engineering / v.18, no.8, 2008 , pp. 823-831 More about this Journal
Abstract
Since it needs the light computational load and small database, sound source localization method using time delay of arrival(TDOA method) is applied at many research fields such as a robot auditory system, teleconferencing and so on. Researches for time delay estimation, which is the most important thing of TDOA method, had been studied broadly. However studies about factors for time delay estimation are insufficient, especially in case of real environment application. In 1997, Brandstein and Silverman announced that performance of time delay estimation deteriorates as reverberant time of room increases. Even though reverberant time of room is same, performance of estimation is different as the specific part of signals. In order to know that reason, we studied and analyzed the factors for time delay estimation using speech signal and room impulse response. In result, we can know that performance of time delay estimation is changed by different R/D ratio and signal characteristics in spite of same reverberant time. Also, we define the performance index(PI) to show a similar tendency to R/D ratio, and propose the method to improve the performance of time delay estimation with PI.
Keywords
Time Delay Estimation; The Generalized Cross Correlation Method; Reverberation; R/D Ratio(reverberant to direct sound energy ratio);
Citations & Related Records
연도 인용수 순위
  • Reference
1 Brandstein, M. S. and Silverman, H. F., 1997, "A Practical Methodology for Speech Source Localization with Microphone Arrays", Computer Speech and Language, Vol. 11, No. 2, pp. 91-126   DOI   ScienceOn
2 Yoon, J.-R., Ro, Y.-J. and Kim, D.-H., 1998, "Effect of Noise Source Bandwidth on Noise Source Imaging Technique," Proceedings of the KSNVE Annual Autumn Conference, pp. 122-126
3 Chan, Y. T., So, H. C. and Ching, P. C., 1999, "Approximation Maximum Likelihood Delay Estimation via Orthogonal Wavelet Trnasform", IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. ASSP-47, No. 4, pp. 1193-1198
4 Brandstein, M. S. and Silverman, H. F., 1997, "A Robust Method for Speech Signal Time Delay Estimation in Reverberant Rooms", Acoustics, Speech and Signal Processing, Vol. 1, pp. 375-378
5 Ahn, S. T., 1999, "A Study on FIR Filter Design for Natural Sounding Artificial Reverberation", Master's Thesis, KAIST
6 Cabot, R. C., 1981, "A Note on the Application of the Hilbert Transform to Time Delay Estimation", IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. ASSP-29, No. 3, pp. 607-609
7 Vaccaro, R. J., Ramalingam, C. S. and Tufts, D. W., 1992, "Least-squares Time-delay Estimation for Transient Signals in a Multipath Environment", Journal of Acoustic Society of America, Vol. 92, No. 1, pp. 210-218   DOI
8 Kwon, B. H., Park, Y. J. and Park, Y.-S., 2007, "Sound Source Localization Method Applied to Robot System", Proceedings of the KSNVE Annual Autumn Conference, KSNVE07A03-02
9 Hwang, S. M., Park, Y.-S. and Park, Y. G., 2005, "Detection of Speaker Position for Robot Using HRTF", Proceedings of the KSNVE Annual Autumn Conference, pp. 637-640
10 Sasaki, Y., Kagami, S. and Mizoguchi, H., 2006, "Multiple Sound Source Mapping for a Mobile Robot by Self-motion Triangulation", Proceeding of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems
11 Wang, H. and Chu, P., 1997, "Voice Source Localization For Automatic Camera Pointing System In Videoconferencing", Acoustics, Speech, and Signal Processing, ICASSP-97, Vol. 1, pp. 187-190   DOI
12 Knapp, C. H. and Carter, G. C., 1976, "The Generalized Correlation Method for Estimation of Time Delay", IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. AS-24, No. 4, pp. 320-327