Voice Activity Detection based on DBN using the Likelihood Ratio

Kim, S.K.;Lee, S.M.;

재활복지공학회논문지 (Journal of rehabilitation welfare engineering & assistive technology)

제8권3호
/
Pages.145-150
/
2014
/
1976-7102(pISSN)

한국재활복지공학회 (Rehabilitation Engineering And Assistive Technology Society of Korea)

우도비를 이용한 DBN 기반의 음성 검출기

Voice Activity Detection based on DBN using the Likelihood Ratio

김상균 (인하대학교 전자공학부) ;
이상민 (인하대학교 전자공학부)

Kim, S.K. ;
Lee, S.M.

투고 : 2014.07.25
심사 : 2014.08.19
발행 : 2014.08.31

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

본 논문에서는 입력된 신호에 의해 결정되는 각 주파수 밴드별 우도비(likelihood ratio, LR)를 deep belief networks(DBN)의 입력층으로 이용하는 새로운 음성 검출기(voice activity detection, VAD) 알고리즘을 제안한다. 기존의 통계적 모델 기반의 음성 검출기는 음성 구간을 판단하기 위해 우도비를 기하 평균을 이용한 결정식을 사용한다. 제안된 음성 검출기는 이 결정식을 대신해 DBN을 이용하여, 오검출 확률을 최소화 하도록 학습을 한다. 제안된 DBN 기반의 음성 검출 알고리즘은 통계적 모델 기반의 음성 검출기의 성능을 개선한 support vector machine(SVM) 기반의 음성 검출기와 정상 및 비정상 잡음 환경에서 다양한 조건을 부과하여 비교하였다. 제안된 알고리즘이 기존의 SVM 기반의 알고리즘보다 전체 오분류 확률 [0.7, 2.7]의 향상 폭을 보였다.

In this paper, we propose a novel scheme to improve the performance of a voice activity detection(VAD) which is based on the deep belief networks(DBN) with the likelihood ratio(LR). The proposed algorithm applies the DBN learning method which is trained in order to minimize the probability of detection error instead of the conventional decision rule using geometric mean. Experimental results show that the proposed algorithm yields better results compared to the conventional VAD algorithm in various noise environments.

재활복지공학회논문지 (Journal of rehabilitation welfare engineering & assistive technology)

우도비를 이용한 DBN 기반의 음성 검출기

Voice Activity Detection based on DBN using the Likelihood Ratio

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)