DOI QR코드

DOI QR Code

Subtitle Automatic Generation System using Speech to Text

음성인식을 이용한 자막 자동생성 시스템

  • 손원섭 (순천대학교 컴퓨터공학과) ;
  • 김응곤 (순천대학교 컴퓨터공학과)
  • Received : 2020.12.04
  • Accepted : 2021.02.17
  • Published : 2021.02.28

Abstract

Recently, many videos such as online lecture videos caused by COVID-19 have been generated. However, due to the limitation of working hours and lack of cost, they are only a part of the videos with subtitles. It is emerging as an obstructive factor in the acquisition of information by deaf. In this paper, we try to develop a system that automatically generates subtitles using voice recognition and generates subtitles by separating sentences using the ending and time to reduce the time and labor required for subtitle generation.

최근 COVID-19로 인한 온라인 강의 영상과 같은 많은 영상이 생성되고 있는데 노동 시간의 한계와 비용의 부족 등으로 인해 자막을 보유한 영상이 일부분에 불과하여 청각장애인들의 정보 취득에 방해 요소로 대두되고 있다. 본 논문에서는 음성인식을 이용하여 자막을 자동으로 생성하고 종결 어미와 시간을 이용해 문장을 분리하여 자막을 생성함으로써 자막 생성에 드는 시간과 노동력을 줄일 수 있도록 하는 시스템을 개발하고자 한다.

Keywords

References

  1. Y. Baek, H. Lee, and J. Oh, "A Study on the Near Field IoT Medical Receipt System Based on Uncontact," J. of the Korea Institute of Electronic Communication Sciences, vol. 18, no. 1, 2020, pp. 73-110.
  2. Y. Sun, "A Semantic Diagnosis and Tracking System to Prevent the Spread of COVID-19," J. of the Korea Institute of Electronic Communication Sciences, vol. 15, no. 3, 2020, pp. 611-616. https://doi.org/10.13067/JKIECS.2020.15.3.611
  3. J. Park, "How do Creators Work? A Critical Study on the Production Experience of Personal Media," Journal of media economics & culture, vol. 18, no. 1, 2020, pp. 73-110. https://doi.org/10.21328/jmec.2020.2.18.1.73
  4. S. Kim, "Machine Learning based Automatic Caption Generation System for Speaker Diarization," Docate, Graduate School of Korea University of Technology and Education, 2019.
  5. J. Choi, "Independent component analysis based on frequency domain model for speech source signal extraction," J. of the Korea Institute of Electronic Communication Sciences, vol. 15, no. 5, 2020, pp. 807-812. https://doi.org/10.13067/JKIECS.2020.15.5.807
  6. Y. Kim and M. Chung, "Improving Performance of Continuous Speech Recognition Using Error Pattern Training and Post Processing Module," Journal of Korean Information Science Society, vol. 27, no. 1B, 2000, pp. 441-443.
  7. K. Ok, J. Park, W. Lee, and J. Ho, "An Improved Adaptive Job Allocation Method for Multiprocessor Systems," Journal of The KIPS Transactionsty, vol. 6, no. 6, 1999, pp. 1502-1510.
  8. Y. Son, "An Extended Conflict-Resolution Method using Multiple Semaphor Scheme," Journal of Bulletin of the Institute for industrial Science, vol. 15, no. 2, 1992, pp. 139-147.
  9. E. Choi, "A Study of Shortened Conclusive-Endings for Korean Language Education: Focused on Compound Forms of Double Conclusive-Endings," Journal of the International Network for Korean Language and Culture, vol 8, no 1, 2011, pp. 205-230.
  10. J. Kim and S. Kim, "Server construction for game engine development using Node.js+Nginx," Journal of the Korean Society of Information Technology, vol. 13, no. 122, 2015, pp. 109-114.
  11. Y. Bea, S. Jung, and W. Soh, "Comparative Analysis of the Virtual Machine and Containers Methods through the Web Server Configuration," Journal of the Korea Institute of Information and Communication Engineering, vol 18. no. 11, 2014, pp. 2670-2677. https://doi.org/10.6109/jkiice.2014.18.11.2670
  12. S. Shin, K. Kim, J. Jang, W. Sohn, and C. Park, "Securing Reverse Proxy Server for defending DDOS attack," Korean Society of Electronics Engineers Conference, Pyeongchang, Korea, June, 2003, vol. 2003, no. 11, pp. 430-433.