HMM-based Speech Recognition using DMS Model and Fuzzy Concept

Ann, Tae-Ock;

doi:10.5762/KAIS.2008.9.4.964

Journal of the Korea Academia-Industrial cooperation Society (한국산학기술학회논문지)

Volume 9 Issue 4
/
Pages.964-969
/
2008
/
1975-4701(pISSN)
/
2288-4688(eISSN)

The Korea Academia-Industrial cooperation Society (한국산학기술학회)

DOI QR Code

HMM-based Speech Recognition using DMS Model and Fuzzy Concept

DMS 모델과 퍼지 개념을 이용한 HMM에 기초를 둔 음성 인식

Ann, Tae-Ock

안태옥 (호원대학교 컴퓨터게임학부)

Published : 2008.08.31

https://doi.org/10.5762/KAIS.2008.9.4.964 Citation PDF

Download PDF

⟨ Previous Next ⟩

Abstract

This paper proposes a HMM-based recognition method using DMSVQ(Dynamic Multi-Section Vector Quantization) codebook by DMS(Dynamic Multi-Section) model and fuzzy concept, as a study for speaker- independent speech recognition. In this proposed recognition method, training data are divided into several dynamic section and multi-observation sequences which are given proper probabilities by fuzzy rule according to order of short distance from DMSVQ codebook per each section are obtained. Thereafter, the HMM using this multi-observation sequences is generated, and in case of recognition, a word that has the most highest probability is selected as a recognized word. Other experiments to compare with the results of recognition experiments using proposed method are implemented as a data by the various conventional recognition methods under the equivalent environment. Through the experiment results, it is proved that the proposed method in this study is superior to the conventional recognition methods.

본 논문은 화자 독립의 음성인식을 위한 연구로서, DMS(Dynamic Multi-Section) 모델에 의한 DMSVQ(Dynamic Multi-Section Vector Quantization) 코드북과 퍼지 개념을 이용한 HMM(Hidden Markov Model) 음성인식 방법을 제안한다. 제안된 인식 방법에서는 학습 데이터를 동적으로 몇 개의 구간(section)으로 분할한 후, 각 구간마다 DMSVQ 코드북(codebook)으로 부터 거리값이 작은 순으로 퍼지 법칙을 적용함으로써 적당한 확률값을 준 다중 관측열(multi-observation sequences)을 구한다. 그런 다음, 이 다중 관측열을 이용하여 HMM을 작성하고, 인식시에는 관측 확률값이 가장 높은 것을 인식된 것으로 선택한다. 제안된 방법에 의한 인식 실험은 기존의 다양한 인식 실험들과 비교를 위해 동일한 조건하에서 같은 데이터로 수행 하였다. 실험 결과로서, 본 연구에서 제안한 방법이 기존의 방법들보다 우수한 방법임을 입증하였다.

Keywords

References

Hiroaki Sakoe and Seibi Chiba, "Dynamic Programming Algorithm Optimization for Spoken Word Recognition",IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. ASSP-26, No. 1, pp. 43-49, Feb. 1978. https://doi.org/10.1109/TASSP.1978.1163055
R. M. Gray, " Vector Quantization", IEEE ASSP Magazine, Vol. 1, pp. 4-29, Apr. 1984 https://doi.org/10.1109/MASSP.1984.1162228
D. K. Burton, J. E. Shore and J. T. Buck, " Isolated-Word Speech Recognition using Multisection Vector Quantization Codebooks", IEEE Trans. of Acoustics, Speech, Signal Processing, Vol. ASSP-33, No. 4, Aug. 1985. https://doi.org/10.1109/TASSP.1985.1164650
Tae Ock Ann and Sun hyub Kim, "An automatic Speech Recognition of Computer Using Time Sequential Vector Quantization", The Institute of Electronics Engineers of Korea, Vol. 27, No. 7, July. 1990.
Tae Ock Ann and Young Kyu Byun, "A Study on Speech Recognition using DMS Model", The Acoustical Society of Korea, Vol. 13, No. 2E, pp. 41-50, Dec. 1994.
L. R. Rabiner and B. H. Juang, " An Intorduction to Hidden Markov Models", IEEE ASSP Magazine, JAN. 1986.
T. O. Ann, Y. G. Byun and S. H. Kim, "Korean Speech Recognition using DHMM", The Acoustical Society of Korea, Vol. 10. No. 1, pp. 52-61, Feb. 1991.
안태옥, 변용규, 김순협, “MSVQ를 이용한 HMM에 의한 단독어 인식”, 대한전자 공학회, 제 27권 제 9호, pp. 158-165, Sep. 1990.
안태옥, “Speech Recognition using MSHMM based on Fuzzy Concept", 한국 음향학회지, 제16권 2E호, pp. 55-61, Sep. 1997.