[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.6109/jkiice.2010.14.3.613

A Phase-related Feature Extraction Method for Robust Speaker Verification

Kwon, Chul-Hong (대전대학교 정보통신공학과)

Publication Information

Journal of the Korea Institute of Information and Communication Engineering / v.14, no.3, 2010 , pp. 613-620 More about this Journal

Abstract

Additive noise and channel distortion strongly degrade the performance of speaker verification systems, as it introduces distortion of the features of speech. This distortion causes a mismatch between the training and recognition conditions such that acoustic models trained with clean speech do not model noisy and channel distorted speech accurately. This paper presents a phase-related feature extraction method in order to improve the robustness of the speaker verification systems. The instantaneous frequency is computed from the phase of speech signals and features from the histogram of the instantaneous frequency are obtained. Experimental results show that the proposed technique offers significant improvements over the standard techniques in both clean and adverse testing environments.

Keywords

speaker verification; robustness; additive noise; channel mismatch; phase-related feature extraction;

Citations & Related Records

Reference

1	D.S. Kim, "Perceptual Phase Redundancy in Speech," Proc. ICASSP, pp. 1383-1386, 2000.
2	H.A. Murthy and V. Gadde, "The Modified Group Delay Function and its Application to Phoneme Recognition," Proc. ICASSP, pp. 68-71,2003.
3	P. Maragos, J.F. Kaiser and T.F. Quatieri, "Energy Separation in Signal Modulations with Application to Speech Analysis," IEEE Trans. on Signal Processing, vol. 41, pp. 3024-3051, 1993. DOI ScienceOn
4	D.A. Reynolds, T.F. Quatieri and R.B. Dunn, "Speaker Verification Using Adapted Gaussian Mixture Models," Digital Signal Processing, vol. 10, pp. 19-41, 2000. DOI ScienceOn
5	Noisex-92, http://www.speech.cs.cmu.edu/comp. speech/Sectionl/Datajnoisex.html.
6	R.J. Mammone, X. Zhang and R.P. Ramachandran, "Robust Speaker Recognition : a Feature-based Approach," IEEE Signal Processing Magazine, pp. 58-70, 1996.
7	J. Ortega-Garcia and J. Gonzalez-Rodriguez, "Overview of Speech Enhancement Techniques for Automatic Speaker Recognition," IEEE Trans. Speech and Audio Processing, pp. 929-932,1996.
8	J.M. Naik. "Speaker Verification," IEEE Communication Magazine, pp. 42-49, 1990.
9	L.R Rabiner and R.W. Schafer, Discrete-time Speech Signal Processing, Principles and Practice, Prentice Hall, NJ, 1978.
10	H. Pobloth and W.B. Kleijn, "On Phase Prception in Speech," Proc. ICASSP, pp. 29-32, 1999.
11	J. Campbell, "Speaker Recognition: a Tutorial," Proc. IEEE, vol. 85, pp. 1437-1462, 1997. DOI ScienceOn
12	D.A Reynolds and R.C. Rose, "Robust Text-independent Speaker Identification Using Gaussian Mixture Speaker Models," IEEE Trans. Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, 1995. DOI ScienceOn

KSCI

A Phase-related Feature Extraction Method for Robust Speaker Verification 열악한 환경에 강인한 화자인증을 위한 위상 기반 특징 추출 기법

A Phase-related Feature Extraction Method for Robust Speaker Verification