A Study on Speech Recognition in a Running Automobile

;;

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 19 Issue 5
/
Pages.3-8
/
2000
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

A Study on Speech Recognition in a Running Automobile

주행중인 자동차 환경에서의 음성인식 연구

양진우 (춘천기능대학 전자과) ;
김순협 (광운대학교 컴퓨터공학과)

Published : 2000.07.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we studied design and implementation of a robust speech recognition system in noisy car environment. The reference pattern used in the system is DMS(Dynamic Multi-Section). Two separate acoustic models, which are selected automatically depending on the noisy car environment for the speech in a car moving at below 80km/h and over 80km/h are proposed. PLP(Perceptual Linear Predictive) of order 13 is used for the feature vector and OSDP (One-Stage Dynamic Programming) is used for decoding. The system also has the function of editing the phone-book for voice dialing. The system yields a recognition rate of 89.75% for male speakers in SI (speaker independent) mode in a car running on a cemented express way at over 80km/h with a vocabulary of 33 words. The system also yields a recognition rate of 92.29% for male speakers in SI mode in a car running on a paved express way at over 80km/h.

본 논문은 주행중인 자동차 환경에서의 음성인식에 대하여 연구하였다. 여기에서 사용한 기준패턴(reference pattern)은 DMS(Dynamic Multi-Section)이며, 인식율을 높이기 위하여 2모델을 제안하였다. 또한 가변적인 차량의 잡음환경에 강인하기 위하여 일반주행(80km/h 이내), 고속주행(80km/h 이상)등으로 나누었으며 차량의 잡음에 따라 자동으로 선택하도록 하였다. 음성의 특징 벡터와 인식 알고리즘은 PLP(Perceptual Linear Predictive) 13차와 OSDP(One-Stage Dynamic Programming)를 사용하였다. 그리고 핸드폰을 사용하는 운전자의 안전을 위하여 음성으로 전화를 걸 수 있도록 하는 전화번호 등록 및 제어기능의 Voice Dialing 기능을 추가하였다. 실험결과 주행중인 자동차 환경에서 자주 사용되는 차량 편의장치 제어명령 33개에 대하여 중부, 영동 고속도로(시멘트 도로 80km/h이상)에서 남성 화자독립 89.75%의 인식율을 구하였으며, 경부고속도로(아스팔트 도로 80km/h이상)에서는 남성화자독립 92.29%의 인식율을 구하였다.

Keywords

References

1998년도 한국음향학회 학술발표대회 논문집 v.17 no.2(s) "주행중인 자동차 환경에서의 고립단어 음성인식 연구" 유봉근;이정기;김순협;박찬식;이순재
J.Acoust.Soc.Am. v.87 no.4 "Perceptual Linear Predictive (PLP) Analysis of Speech" H.Hermansky
IEEE Transaction on Acoustics,Speech, and Signal Processing v.ASSP-32 no.2 "The Use of a One-Stage Dynamic Programming Algorithm for Connected Word Recognition" Hermann Ney
IEEE ASSP Magazine v.1 "Vector Quantization" R.M.Gray
한국 통신 학회 v.16 no.2 "개선된 MSVQ 인식 시스템을 이용한 단독어 인식에 관한 연구" 안태욱 外
박사학위 논문 "Fuzzy와 MSVQ 에 기초를 둔 MSHMM 을 이용한 음성인식에 관한 연구" 안태옥
박사학위 논문 "DMS 모델을 이용한 단독어 인식에 관한 연구" 변용규

The Journal of the Acoustical Society of Korea (한국음향학회지)

A Study on Speech Recognition in a Running Automobile

주행중인 자동차 환경에서의 음성인식 연구

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)