Browse > Article
http://dx.doi.org/10.7776/ASK.2012.31.3.161

Robust End Point Detection for Robot Speech Recognition Using Double Talk Detection  

Moon, Sung-Kyu (고려대학교 영상정보처리협동과정)
Park, Jin-Soo (고려대학교 전자전기전파공학부)
Ko, Han-Seok (고려대학교 전자전기전파공학부)
Abstract
This paper presents a robust speech end-point detector using double talk detection in echoic conditioned speech recognition robot. The proposed method consists of combining conventional end-point detector result and double talk detector result. We have tested the proposed method in isolated word recognition system under echoic conditioned environment. As a result, the proposed algorithm shows superior performance of 30 % to the available techniques in the points of speech recognition rates.
Keywords
Double talk detector; Speech end-point detector; Speech recognition robot; Acoustic echo cancellation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 L. R Labiner, M. R. Sambur, "Voiced-unvoicedsilence detection using the Itakura LPC distance measure", Proc. ICASSP, pp. 323-326, 1977.
2 B. F. Wu, K. C Wang, "Robust Endpoint Detection Algorithm Based on the Adaptive Band-Partitioning Spectral Entropy in Adverse ", IEEE Transactions on Speech and Audio Processing, vol. 13, no. 5, pp. 762-775, Sept. 2005.   DOI   ScienceOn
3 T. Fukuda, O. Ichikawa, M. Nishimura, "Long-Term Spectro-Temporal and Static Harmonic Features for Voice Activity Detection", IEEE trans. selected topics in signal processing, vol. 4, issue 5, pp. 834-844, Oct. 2010.   DOI   ScienceOn
4 Hidden Markov model Toolkit 3.2, http://htk.eng.cam.ac.uk
5 ETRI Headset Korean DB, http://voice.etri.re.kr
6 L. Yeonja, L. Youngjik, "Implementation of the POW (phonetically optimized words) algorithm for speech database", Proc. ICASSP'95, pp.89-92, May. 1995.
7 J. Chen, J. Benesty, H. Yiteng, S. Dolco "New Insights into the Noise Reduction Wiener filter", IEEE Transactions on Audio, Speech and Audio Processing, vol. 14, no. 4, pp. 1218-1234, July. 2006.   DOI   ScienceOn
8 Y. Ephraim, "Statistical-model-based Speech Enhancement Systems", Proc. IEEE, vol. 80, no. 10, pp. 1526-1555, Oct. 1992.
9 C. Paleologu, S. Ciochina, J. Benesty, "An Efficient Proportionate Affine Projection Algorithm for Echo Cancellation ," IEEE Signal processing letter, vol. 17, issue 2, 165-168, 2010.   DOI   ScienceOn
10 A. Mader, H. Puder, G. U. Schmidt, "Step-size control for acoustic echo cancellation filters -an overview", Signal Processing, vol. 80, issue 9, pp. 1697-1719, Sept. 2000.   DOI   ScienceOn
11 C. Paleologu, S. Ciochina, J. Benesty, "Variable stepsize NLMS algorithm for under-modeling acoustic echo cancellation", IEEE Signal Processing Letters, vol. 15, pp.5-8, Sept. 2008.   DOI   ScienceOn
12 T. V. Waterschoot, R. Geert, V. Piet, M. Marc "Double- Talk-Robust Prediction Error Identification Algorithms for Acoustic Echo Cancellation", IEEE Transactions on Signal Processing, vol. 55, issue 3, pp. 846-858, Mar. 2007.   DOI   ScienceOn
13 A. Mader, H. Puder, G. U. Schmidt, "Step-size control for acoustic echo cancellation filters -an overview", Signal Processing, vol. 80, issue 9, pp. 1697-1719, Sept. 2000.   DOI   ScienceOn
14 S. Gustafsson, R. Martin, P. Jax, P. Vary "A psychoacoustic approach to combined acoustic echo cancellation and noise reduction", IEEE Transactions on Audio, Speech and Audio Processing, vol. 10, issue 5, pp.245-256, Jul. 2002.   DOI   ScienceOn
15 D. Duttweiler, "A twelve-channel digital echo canceler," IEEE trans. Commun., vol.26, no. 5, pp. 647-653, May. 1978.   DOI
16 Hua Ye, Bo-Xiu Wu, "A new double-talk detection algorithm based on the orthogonality theorem", IEEE trans. communications, vol. 39, issue 11, 1542-1545, Nov. 1991.   DOI   ScienceOn
17 R. McAulay, M. Malpass "Speech Enhancement using a Soft-decision Noise Suppression Filter", IEEE Transactions on Speech and Audio Processing, vol. 28, no. 2, pp. 137-145, Apr. 1980.   DOI
18 박진수, 이윤재, 이인호, 고한석 " 스펙트럼 패턴 기반의 잡음 환경에 강인한 음성의 끝점 검출 기법", 말소리와 음성과학, 1권, 4호, 2009.
19 R. Martin, "Spectral Subtraction Based on Minimum Statistics", Proc. EUSIPCO 94, pp. 1182-1185, Apr. 1994.
20 S. Boll, "Suppression of Acoustic Noise in Speech using Spectral Subtraction", IEEE Transactions on Speech and Audio Processing, vol. 27, no. 2, pp. 113-120, Apr. 1979.   DOI