Dimension Reduction Method of Speech Feature Vector for Real-Time Adaptation of Voice Activity Detection

Park Jin-Young;Lee Kwang-Seok;Hur Kang-In;

Journal of the Institute of Convergence Signal Processing (융합신호처리학회논문지)

Volume 7 Issue 3
/
Pages.116-121
/
2006
/
2765-1134(pISSN)

The Korea Institute of Convergence Signal Processing (한국융합신호처리학회)

Dimension Reduction Method of Speech Feature Vector for Real-Time Adaptation of Voice Activity Detection

음성구간 검출기의 실시간 적응화를 위한 음성 특징벡터의 차원 축소 방법

박진영 (동아대학교 전자공학과) ;
이광석 (진주산업대학교 전자공학과) ;
허강인 (동아대학교 전자공학과)

Published : 2006.07.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we propose the dimension reduction method of multi-dimension speech feature vector for real-time adaptation procedure in various noisy environments. This method which reduces dimensions non-linearly to map the likelihood of speech feature vector and noise feature vector. The LRT(Likelihood Ratio Test) is used for classifying speech and non-speech. The results of implementation are similar to multi-dimensional speech feature vector. The results of speech recognition implementation of detected speech data are also similar to multi-dimensional(10-order dimensional MFCC(Mel-Frequency Cepstral Coefficient)) speech feature vector.

본 논문에서는 다양한 잡음환경에서의 실시간 적응화 기법을 적용하기 위한 선결 과제로 다차원 음성 특정 벡터를 저차원으로 축소하는 방법을 제안한다. 제안된 방법은 특징 벡터를 확률 우도 값으로 매핑시켜 비선형적으로 축소하는 방법으로 음성 / 비음성의 분류는 우도비 검증 (Likelihood Ratio Test; LRT) 을 이용하여 분류하였다. 실험 결과 고차원 특징 벡터를 이용하여 분류한 결과와 대등하게 분류됨을 확인할 수 있었다. 그리고, 제안된 방법에 의해 검출된 음성 데이터를 이용한 음성인식 실험에서도 10차 MFCC(Mel-Frequency Cepstral Coefficient)를 사용하여 분류한 경우와 대등한 인식률을 보여주었다.

Journal of the Institute of Convergence Signal Processing (융합신호처리학회논문지)

Dimension Reduction Method of Speech Feature Vector for Real-Time Adaptation of Voice Activity Detection

음성구간 검출기의 실시간 적응화를 위한 음성 특징벡터의 차원 축소 방법

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)