Journal of the Korean Institute of Telematics and Electronics (대한전자공학회논문지)
- Volume 24 Issue 5
- /
- Pages.905-913
- /
- 1987
- /
- 1016-135X(pISSN)
An Isolated Word Recognition Using the Mellin Transform
Mellin 변환을 이용한 격리 단어 인식
Abstract
This paper presents a speaker dependent isolated digit recognition algorithm using the Mellin transform. Since the Mellin transform converts a scale information into a phase information, attempts have been made to utilize this scale invariance property of the Mellin transform in order to alleviate a time-normalization procedure required for a speech recognition. It has been found that good results can be obtained by taking the Mellin transform to the features such as a ZCR, log energy, normalized autocorrelation coefficients, first predictor coefficient and normalized prediction error. We employed a difference function for evaluating a similarity between two patterns. When the proposed algorithm was tested on Korean digit words, a recognition rate of 83.3% was obtained. The recognition accuracy is not compatible with the other technique such as LPC distance however, it is believed that the Mellin transform can effectively perform the time-normalization processing for the speech recognition.
Keywords