DOI QR코드

DOI QR Code

음성 신호를 이용한 시간지연 추정에 미치는 영향들에 관한 연구

Factors for Speech Signal Time Delay Estimation

  • 권병호 (한국과학기술원 기계공학과) ;
  • 박영진 (한국과학기술원 기계공학과) ;
  • 박윤식 (한국과학기술원 기계공학과)
  • 발행 : 2008.08.20

초록

Since it needs the light computational load and small database, sound source localization method using time delay of arrival(TDOA method) is applied at many research fields such as a robot auditory system, teleconferencing and so on. Researches for time delay estimation, which is the most important thing of TDOA method, had been studied broadly. However studies about factors for time delay estimation are insufficient, especially in case of real environment application. In 1997, Brandstein and Silverman announced that performance of time delay estimation deteriorates as reverberant time of room increases. Even though reverberant time of room is same, performance of estimation is different as the specific part of signals. In order to know that reason, we studied and analyzed the factors for time delay estimation using speech signal and room impulse response. In result, we can know that performance of time delay estimation is changed by different R/D ratio and signal characteristics in spite of same reverberant time. Also, we define the performance index(PI) to show a similar tendency to R/D ratio, and propose the method to improve the performance of time delay estimation with PI.

키워드

참고문헌

  1. Hwang, S. M., Park, Y.-S. and Park, Y. G., 2005, "Detection of Speaker Position for Robot Using HRTF", Proceedings of the KSNVE Annual Autumn Conference, pp. 637-640
  2. Sasaki, Y., Kagami, S. and Mizoguchi, H., 2006, "Multiple Sound Source Mapping for a Mobile Robot by Self-motion Triangulation", Proceeding of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems
  3. Brandstein, M. S. and Silverman, H. F., 1997, "A Practical Methodology for Speech Source Localization with Microphone Arrays", Computer Speech and Language, Vol. 11, No. 2, pp. 91-126 https://doi.org/10.1006/csla.1996.0024
  4. Kwon, B. H., Park, Y. J. and Park, Y.-S., 2007, "Sound Source Localization Method Applied to Robot System", Proceedings of the KSNVE Annual Autumn Conference, KSNVE07A03-02
  5. Wang, H. and Chu, P., 1997, "Voice Source Localization For Automatic Camera Pointing System In Videoconferencing", Acoustics, Speech, and Signal Processing, ICASSP-97, Vol. 1, pp. 187-190 https://doi.org/10.1109/ICASSP.1997.599595
  6. Knapp, C. H. and Carter, G. C., 1976, "The Generalized Correlation Method for Estimation of Time Delay", IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. AS-24, No. 4, pp. 320-327
  7. Vaccaro, R. J., Ramalingam, C. S. and Tufts, D. W., 1992, "Least-squares Time-delay Estimation for Transient Signals in a Multipath Environment", Journal of Acoustic Society of America, Vol. 92, No. 1, pp. 210-218 https://doi.org/10.1121/1.404285
  8. Cabot, R. C., 1981, "A Note on the Application of the Hilbert Transform to Time Delay Estimation", IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. ASSP-29, No. 3, pp. 607-609
  9. Chan, Y. T., So, H. C. and Ching, P. C., 1999, "Approximation Maximum Likelihood Delay Estimation via Orthogonal Wavelet Trnasform", IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. ASSP-47, No. 4, pp. 1193-1198
  10. Brandstein, M. S. and Silverman, H. F., 1997, "A Robust Method for Speech Signal Time Delay Estimation in Reverberant Rooms", Acoustics, Speech and Signal Processing, Vol. 1, pp. 375-378
  11. Ahn, S. T., 1999, "A Study on FIR Filter Design for Natural Sounding Artificial Reverberation", Master's Thesis, KAIST
  12. Yoon, J.-R., Ro, Y.-J. and Kim, D.-H., 1998, "Effect of Noise Source Bandwidth on Noise Source Imaging Technique," Proceedings of the KSNVE Annual Autumn Conference, pp. 122-126