Combination Tandem Architecture with Segmental Features for Robust Speech Recognition

강인한 음성 인식을 위한 탠덤 구조와 분절 특징의 결합

  • 윤영선 (한남대학교 정보통신공학과) ;
  • 이윤근 (한국전자통신연구원 음성처리연구팀)
  • Published : 2007.06.30

Abstract

It is reported that the segmental feature based recognition system shows better results than conventional feature based system in the previous studies. On the other hand, the various studies of combining neural network and hidden Markov models within a single system are done with expectations that it may potentially combine the advantages of both systems. With the influence of these studies, tandem approach was presented to use neural network as the classifier and hidden Markov models as the decoder. In this paper, we applied the trend information of segmental features to tandem architecture and used posterior probabilities, which are the output of neural network, as inputs of recognition system. The experiments are performed on Auroral database to examine the potentiality of the trend feature based tandem architecture. From the results, the proposed system outperforms on very low SNR environments. Consequently, we argue that the trend information on tandem architecture can be additionally used for traditional MFCC features.

Keywords