A Study on the Use of Speech Recognition Technology for Content-based Video Indexing and Retrieval

내용기반 비디오 색인 및 검색을 위한 음성인식기술 이용에 관한 연구

  • 손종목 (경북대학교 전자전기공학부) ;
  • 배건성 (경북대학교 전자전기공학부) ;
  • 강경옥 (한국전자통신연구원 방송미디어연구부) ;
  • 김재곤 (한국전자통신연구원 방송미디어연구부)
  • Published : 2001.02.01

Abstract

An important aspect of video program indexing and retrieval is the ability to segment video program into meaningful segments, in other words, the ability of content-based video program segmentation. In this paper, a new approach using speech recognition technology has been proposed for content-based video program segmentation. This approach uses speech recognition technique to synchronize closed caption with speech signal. Experimental results demonstrate that the proposed scheme is very promising for content-based video program segmentation.

비디오 프로그램 색인 및 검색에 있어서 비디오 프로그램을 의미 있는 부분으로 분할하는 것, 즉 내용기반 비디오 프로그램 분할은 중요하다. 본 논문에서는 내용기반 비디오 프로그램 분할을 위해 음성인식기술을 이용하는 새로운 방법을 제안한다. 제안한 방법은 음성신호와 캡션 (Closed Caption)의 정확한 동기를 위해 음성인식 기법을 사용한다. 실험을 통하여 내용기반 비디오 프로그램 분할을 위해 제안한 방법의 가능성을 확인하였다.

Keywords

References

  1. Int. Conf. on Acoustics, Speech and Signal Processing v.Ⅵ A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features John S. Boreczky;Lynn D. Wilcox
  2. Proc. European Conf. on Speech Communication and Technology v.5 Sound Channel Video Indexing Claude Montacie;Marie-Jose Caraty
  3. Proc. of ARPA Speech Recognition Workshop $INFORMEDIA^{TM}$ :News-On-Demand Experiments In Speech Recognition Howard D. Wactlar;Alexander G. Hauptmann;Michael J. Witbrock
  4. Proc. of the DARPA Broadcast news Transcription and Understanding Workshop An Overview of the AT&T Spoken Document Retrieval System John Choi;Don Hindle;Julia Hirschberg;Ivan Magrin-Chagnolleau;Christine Nakatani;Fernando Pereira;Amit Singhal;Steve Whittaker
  5. Int. Conf. on Acoustics, Speech and Signal Processing v.Ⅱ Speaker Identification based Text to Audio Alignment for Audio Retrieval System Deb Roy;Carl Malamud
  6. Int. Conf. on Acoustics, Speech and Signal Processing v.Ⅱ Detection of Target Speakers in Audio Databases Ivan Magrin-Chagnolleau;Aaron E. Rosenberg;S. Parthasarathy
  7. Ph. D. thesis, CMU Acoustical and Environmental Robustness in Autimatic Speech Recognition Alejandro Acero
  8. IEEE Trans. on Acoustics, Speech and Signal Processing v.32 no.6 Speech Enhancement Using a Minimum Mean-Square Erro Short-Time Spectral Amplitude Estimator Yariv Ephraim;David Malah
  9. Int. Conf. on Acoustics, Speech and Signal Processing v.2 Acoustic Modeling of Subword Units for Speech Recognition C.-H. Lee;L.R. Rabiner;R. Pieraccinit;Jay G. Wilpon
  10. IEEE Trans. on Acoustics, Speech and Signal Processing IEEE Trans. On ASSP v.38 no.9 The Segmental K-Means Algorithm for Estimation Parameters of Hidden Markov Models B.-H. Juang;L.R. Rabiner
  11. Proc. of IEEE v.77 no.2 A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition L.R. Rabiner
  12. Ph. D thesis, CMU Efficient algorithms for Speech Recognition Mosur K. Ravishankar
  13. 한국음향학회지 v.18 no.4 HMM 인식기에서 상태별 다중 특징 파라미터 가중 손종목;배건성