Design and Implementation of a Bimodal User Recognition System using Face and Audio

Kim Myung-Hun;Lee Chi-Geun;So In-Mi;Jung Sung-Tae;

한국컴퓨터정보학회논문지 (Journal of the Korea Society of Computer and Information)

제10권5호
/
Pages.353-362
/
2005
/
1598-849X(pISSN)
/
2383-9945(eISSN)

한국컴퓨터정보학회 (Korean Society of Computer Information)

얼굴과 음성 정보를 이용한 바이모달 사용자 인식 시스템 설계 및 구현

Design and Implementation of a Bimodal User Recognition System using Face and Audio

김명훈 (원광대학교 컴퓨터 공학과) ;
이지근 (원광대학교 컴퓨터 공학과) ;
소인미 (원광대학교 컴퓨터 공학과) ;
정성태 (원광대학교 전기전자및정보공학부)

발행 : 2005.11.01

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

최근 들어 바이모달 인식에 관한 연구가 활발히 진행되고 있다. 본 논문에서는 음성 정보와 얼굴정보를 이용하여 바이모달 시스템을 구현하였다. 얼굴인식은 얼굴 검출과 얼굴 인식 두 부분으로 나누어서 실험을 하였다. 얼굴 검출 단계에서는 AdaBoost를 이용하여 얼굴 후보 영역을 검출 한 뒤 PCA를 통해 특징 벡터 계수를 줄였다. PCA를 통해 추출된 특징 벡터를 객체 분류 기법인 SVM을 이용하여 얼굴을 검출 및 인식하였다. 음성인식은 MFCC를 이용하여 음성 특징 추출을 하였으며 HMM을 이용하여 음성인식을 하였다. 인식결과, 단일 인식을 사용하는 것보다 얼굴과 음성을 같이 사용하였을 때 인식률의 향상을 가져왔고, 잡음 환경에서는 더욱 높은 성능을 나타냈었다.

Recently, study of Bimodal recognition has become very active. In this paper we propose a Bimodal user recognition system that uses face information and audio information. Face recognition consists of face detection step and face recognition step. Face detection uses AdaBoost to find face candidate area. After finding face candidates, PCA feature extraction is applied to decrease the dimension of feature vector. And then, SVM classifiers are used to detect and recognize face. Audio recognition uses MFCC for audio feature extraction and HMM is used for audio recognition. Experimental results show that the Bimodal recognition can improve the user recognition rate much more than audio only recognition, especially in the Presence of noise.

한국컴퓨터정보학회논문지 (Journal of the Korea Society of Computer and Information)

얼굴과 음성 정보를 이용한 바이모달 사용자 인식 시스템 설계 및 구현

Design and Implementation of a Bimodal User Recognition System using Face and Audio

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)