Face Recognition and Preprocessing Technique for Speaker Identification in hard of hearing broadcasting

Kim, Nayeon;Cho, Sukhee;Bae, Byungjun;Ahn, ChungHyun;

Proceedings of the Korean Society of Broadcast Engineers Conference (한국방송∙미디어공학회:학술대회논문집)

2020.07a
/
Pages.450-452
/
2020

The Korean Institute of Broadcast and Media Engineers (한국방송∙미디어공학회)

Face Recognition and Preprocessing Technique for Speaker Identification in hard of hearing broadcasting

청각장애인용 방송에서 화자 식별을 위한 얼굴 인식 알고리즘 및 전처리 연구

Kim, Nayeon (Korea University of Science and Technology) ;
Cho, Sukhee (Electronics and Telecommunications Research Institute) ;
Bae, Byungjun (Korea University of Science and Technology) ;
Ahn, ChungHyun (Electronics and Telecommunications Research Institute)

김나연 (과학기술연합대학원대학교) ;
조숙희 (한국전자통신연구원) ;
배병준 (과학기술연합대학원대학교) ;
안충현 (한국전자통신연구원)

Published : 2020.07.13

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

본 논문에서는 딥러닝 기반 얼굴 인식 알고리즘에 대해 살펴보고, 이를 청각장애인용 방송에서 화자를 식별하고 감정 표현 자막을 표출하기 위한 배우 얼굴 인식 기술에 적용하고자 한다. 우선, 배우 얼굴 인식을 위한 방안으로 원샷 학습 기반의 딥러닝 얼굴 인식 알고리즘인 ResNet-50 기반 VGGFace2 모델의 구성에 대해 이해하고, 이러한 모델을 기반으로 다양한 전처리 방식을 적용하여 정확도를 측정함으로써 실제 청각장애인용 방송에서 배우 얼굴을 인식하기 위한 방안에 대해 모색한다.

Proceedings of the Korean Society of Broadcast Engineers Conference (한국방송∙미디어공학회:학술대회논문집)

Face Recognition and Preprocessing Technique for Speaker Identification in hard of hearing broadcasting

청각장애인용 방송에서 화자 식별을 위한 얼굴 인식 알고리즘 및 전처리 연구

Abstract

Keywords