Robust Speech Detection Based on Useful Bands for Continuous Digit Speech over Telephone Networks

  • Ji, Mi-Kyongi (School of Engineering, Information and Communications University) ;
  • Suh, Young-Joo (School of Engineering, Information and Communications University) ;
  • Kim, Hoi-Rin (School of Engineering, Information and Communications University) ;
  • Kim, Sang-Hun (Speech Information Technology Center, ETRI)
  • 발행 : 2003.09.01

초록

One of the most important problems in speech recognition is to detect the presence of speech in adverse environments. In other words, the accurate detection of speech boundary is critical to the performance of speech recognition. Furthermore the speech detection problem becomes severer when recognition systems are used over the telephone network, especially wireless network and noisy environment. Therefore this paper describes various speech detection algorithms for continuous digit recognition system used over wire/wireless telephone networks and we propose a algorithm in order to improve the robustness of speech detection using useful band selection under noisy telephone networks. In this paper, we compare some speech detection algorithms with the proposed one, and present experimental results done with various SNRs. The results show that the new algorithm outperforms the other speech detection methods.

키워드

참고문헌

  1. L. R. Rabiner and M. R. Sambur, 'An algorithm for determining the end-points of isolated utterances,' Bell Syst. Tech. J., 54, 297-315, Feb. 1975
  2. M. H. Savoji, 'A robust algorithm for accurate endpointing of speech,' Speech Commun., 8, 45-60, 1989 https://doi.org/10.1016/0167-6393(89)90067-8
  3. L. Lamel, L. Rabiner, A. Rosenberg, and J. Wilpon, 'An improved endpoint detector for isolated word recognition,' IEEE ASSP Mag., 29, 777-785, 1981 https://doi.org/10.1109/TASSP.1981.1163642
  4. B. Reaves,'"Comments on an improved endpoint detector for isolated word recognition,' IEEE Trans., Signal Processing, 39, 526-527, Feb. 1991 https://doi.org/10.1109/78.80847
  5. J. C. Junqua, B. Mak, and B. Reaves, 'A robust algorithm for word boundary detection in the presence of noise,' IEEE Trans. Speech Audio Processing, 2, 406-412, July 1994 https://doi.org/10.1109/89.294354
  6. J. B. Allen, 'Cochlear modeling,' IEEE Acoust., Speech, Signal Processing Mag., 2, 3-29, 1985
  7. J. L. Shen, J. W. Hung, L. S. Lee, 'Robust entropybased endpoint detection for speech recognition in noisy environments,' Proc. Int. Cont. on Spoken Lang. Processing, 3, 1015, Sydney, 1998
  8. J. G. Wilpon and L. R. Rabiner, 'Application of hidden Markov models to automatic speech endpoint detection,' Computer Speech and Language, 2, 321-341, 1987 https://doi.org/10.1016/0885-2308(87)90015-5