Development and Enhancement of Automatic Caption Generation System based on Speech-to-Text for the Hearing Impaired

Choi, Mi-Ae;Kim, Seung-Hyun;Jo, Min-Ae;Park, Dong-young;Kim, Yong-Ho;Yoon, Jong-hoo;

Proceedings of the Korean Society of Broadcast Engineers Conference (한국방송∙미디어공학회:학술대회논문집)

2020.07a
/
Pages.465-468
/
2020

The Korean Institute of Broadcast and Media Engineers (한국방송∙미디어공학회)

Development and Enhancement of Automatic Caption Generation System based on Speech-to-Text for the Hearing Impaired

청각장애인을 위한 음성-자막 자동 변환 시스템 개발 및 음성 인식률 고도화

Choi, Mi-Ae (Telecommunications Technology Association) ;
Kim, Seung-Hyun (Telecommunications Technology Association) ;
Jo, Min-Ae (Telecommunications Technology Association) ;
Park, Dong-young (Telecommunications Technology Association) ;
Kim, Yong-Ho (Telecommunications Technology Association) ;
Yoon, Jong-hoo (Telecommunications Technology Association)

최미애 (한국정보통신기술협회) ;
김승현 (한국정보통신기술협회) ;
조민애 (한국정보통신기술협회) ;
박동영 (한국정보통신기술협회) ;
김용호 (한국정보통신기술협회) ;
윤종후 (한국정보통신기술협회)

Published : 2020.07.13

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

인터넷 미디어, OTT, VOD 등 신규미디어가 비장애인의 정보제공 매체로 널리 확대되나, 자막 서비스를 제공하지 않아 청각장애인의 정보 격차가 더욱 심화되고 있다. 청각장애인의 미디어 접근성 제고를 위해 음성인식 서버 및 스마트 폰·태블릿 앱 간 연계를 통해 음성을 인식하여 자동으로 자막을 생성하고 표시하는 음성-자막 자동 변환 시스템을 개발하였고 음성인식률을 높이기 위해 뉴스/시사/다큐 장르 영상 콘텐츠의 음성에 대해 학습용 데이터를 제작하여 음성인식 성능을 고도화 시켰다. 본 논문에서는 청각장애인을 위한 음성-자막 자동 변환시스템 구성과 음성인식률 비교 평가 결과를 보여준다.

Proceedings of the Korean Society of Broadcast Engineers Conference (한국방송∙미디어공학회:학술대회논문집)

Development and Enhancement of Automatic Caption Generation System based on Speech-to-Text for the Hearing Impaired

청각장애인을 위한 음성-자막 자동 변환 시스템 개발 및 음성 인식률 고도화

Abstract

Keywords