Browse > Article
http://dx.doi.org/10.6109/jkiice.2010.14.3.613

A Phase-related Feature Extraction Method for Robust Speaker Verification  

Kwon, Chul-Hong (대전대학교 정보통신공학과)
Abstract
Additive noise and channel distortion strongly degrade the performance of speaker verification systems, as it introduces distortion of the features of speech. This distortion causes a mismatch between the training and recognition conditions such that acoustic models trained with clean speech do not model noisy and channel distorted speech accurately. This paper presents a phase-related feature extraction method in order to improve the robustness of the speaker verification systems. The instantaneous frequency is computed from the phase of speech signals and features from the histogram of the instantaneous frequency are obtained. Experimental results show that the proposed technique offers significant improvements over the standard techniques in both clean and adverse testing environments.
Keywords
speaker verification; robustness; additive noise; channel mismatch; phase-related feature extraction;
Citations & Related Records
연도 인용수 순위
  • Reference
1 D.S. Kim, "Perceptual Phase Redundancy in Speech," Proc. ICASSP, pp. 1383-1386, 2000.
2 H.A. Murthy and V. Gadde, "The Modified Group Delay Function and its Application to Phoneme Recognition," Proc. ICASSP, pp. 68-71,2003.
3 P. Maragos, J.F. Kaiser and T.F. Quatieri, "Energy Separation in Signal Modulations with Application to Speech Analysis," IEEE Trans. on Signal Processing, vol. 41, pp. 3024-3051, 1993.   DOI   ScienceOn
4 D.A. Reynolds, T.F. Quatieri and R.B. Dunn, "Speaker Verification Using Adapted Gaussian Mixture Models," Digital Signal Processing, vol. 10, pp. 19-41, 2000.   DOI   ScienceOn
5 Noisex-92, http://www.speech.cs.cmu.edu/comp. speech/Sectionl/Datajnoisex.html.
6 R.J. Mammone, X. Zhang and R.P. Ramachandran, "Robust Speaker Recognition : a Feature-based Approach," IEEE Signal Processing Magazine, pp. 58-70, 1996.
7 J. Ortega-Garcia and J. Gonzalez-Rodriguez, "Overview of Speech Enhancement Techniques for Automatic Speaker Recognition," IEEE Trans. Speech and Audio Processing, pp. 929-932,1996.
8 J.M. Naik. "Speaker Verification," IEEE Communication Magazine, pp. 42-49, 1990.
9 L.R Rabiner and R.W. Schafer, Discrete-time Speech Signal Processing, Principles and Practice, Prentice Hall, NJ, 1978.
10 H. Pobloth and W.B. Kleijn, "On Phase Prception in Speech," Proc. ICASSP, pp. 29-32, 1999.
11 J. Campbell, "Speaker Recognition: a Tutorial," Proc. IEEE, vol. 85, pp. 1437-1462, 1997.   DOI   ScienceOn
12 D.A Reynolds and R.C. Rose, "Robust Text-independent Speaker Identification Using Gaussian Mixture Speaker Models," IEEE Trans. Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, 1995.   DOI   ScienceOn