Browse > Article

Isolated-Word Speech Recognition in Telephone Environment Using Perceptual Auditory Characteristic  

Choi, Hyung-Ki (Dept. of Electronic Eng. Chonbuk National University)
Park, Ki-Young (Dept. of Information & Communication Eng. Jeonju Technical College)
Kim, Chong-Kyo (Dept. of Electrical & Control Eng. Chonbuk National University)
Publication Information
Abstract
In this paper, we propose GFCC(gammatone filter frequency cepstrum coefficient) parameter which was based on the auditory characteristic for accomplishing better speech recognition rate. And it is performed the experiment of speech recognition for isolated word acquired from telephone network. For the purpose of comparing GFCC parameter with other parameter, the experiment of speech recognition are carried out using MFCC and LPCC parameter. Also, for each parameter, we are implemented CMS(cepstral mean subtraction)which was applied or not in order to compensate channel distortion in telephone network. Accordingly, we found that the recognition rate using GFCC parameter is better than other parameter in the experimental result.
Keywords
Citations & Related Records
연도 인용수 순위
  • Reference
1 Lawrence Rabiner, Biing-Hwang Juang, Fundamentals of Speech Recognition, Prentice-Hall, 1993
2 정호영, 김도영, 은종관, 이수영, '청각구조를 이용한 잡음 음성의 인식 성능 향상', 한국음향학회지 제14권, 제5호, 1995
3 Rivarol Vergin, Douglas O'Shaughnessy, 'Generalized Mel Frequency Cepstral Coefficients for Large-Vocabulary Speaker-Independent Continuous-Speech Recognition,' IEEE Trans. on Speech & Audio Processing, vol. 7, no. 5, pp. 512-532, 1999
4 M. Slaney, 'An Efficient Implementation of the Patterson-Holdworth Auditory Filter Bank,' Apple Computer Tech. Report #35, 1993
5 John R. Deller, Jr., John G. Proakis, John H. L. Hansen, Discrete-Time Processing of Speech Signals, Macmillan, 1993
6 C. J. Moore and R. Glasberg, 'Suggested Formulae for Calculating Auditory-Filter Bandwidths and Excitation Patterns,' J. Acoust. Soc Am., vol. 74, pp. 750-753, Sep. 1983
7 조태현, 김유진, 이재영, 정재호, '전화선 채널이 화자확인 시스템의 성능에 미치는 영향', 한국음향학회지 제18권, 제5호, 1999
8 J. M. Kates, 'A Time-Domain Digital Cochlear Model,' IEEE Trans. on Signal Processing, vol. 39, no. 12, pp. 2573-2592, Dec. 1991