A Study on the Use of Speech Recognition Technology for Content-based Video Indexing and Retrieval

;;;;

The Journal of the Acoustical Society of Korea (한국음향학회지)

Volume 20 Issue 2
/
Pages.16-20
/
2001
/
1225-4428(pISSN)
/
2287-3775(eISSN)

The Acoustical Society of Korea (한국음향학회)

A Study on the Use of Speech Recognition Technology for Content-based Video Indexing and Retrieval

내용기반 비디오 색인 및 검색을 위한 음성인식기술 이용에 관한 연구

손종목 (경북대학교 전자전기공학부) ;
배건성 (경북대학교 전자전기공학부) ;
강경옥 (한국전자통신연구원 방송미디어연구부) ;
김재곤 (한국전자통신연구원 방송미디어연구부)

Published : 2001.02.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

An important aspect of video program indexing and retrieval is the ability to segment video program into meaningful segments, in other words, the ability of content-based video program segmentation. In this paper, a new approach using speech recognition technology has been proposed for content-based video program segmentation. This approach uses speech recognition technique to synchronize closed caption with speech signal. Experimental results demonstrate that the proposed scheme is very promising for content-based video program segmentation.

비디오 프로그램 색인 및 검색에 있어서 비디오 프로그램을 의미 있는 부분으로 분할하는 것, 즉 내용기반 비디오 프로그램 분할은 중요하다. 본 논문에서는 내용기반 비디오 프로그램 분할을 위해 음성인식기술을 이용하는 새로운 방법을 제안한다. 제안한 방법은 음성신호와 캡션 (Closed Caption)의 정확한 동기를 위해 음성인식 기법을 사용한다. 실험을 통하여 내용기반 비디오 프로그램 분할을 위해 제안한 방법의 가능성을 확인하였다.

Keywords

References

Int. Conf. on Acoustics, Speech and Signal Processing v.Ⅵ A Hidden Markov Model Framework for Video Segmentation Using Audio and Image Features John S. Boreczky;Lynn D. Wilcox
Proc. European Conf. on Speech Communication and Technology v.5 Sound Channel Video Indexing Claude Montacie;Marie-Jose Caraty
Proc. of ARPA Speech Recognition Workshop $INFORMEDIA^{TM}$ :News-On-Demand Experiments In Speech Recognition Howard D. Wactlar;Alexander G. Hauptmann;Michael J. Witbrock
Proc. of the DARPA Broadcast news Transcription and Understanding Workshop An Overview of the AT&T Spoken Document Retrieval System John Choi;Don Hindle;Julia Hirschberg;Ivan Magrin-Chagnolleau;Christine Nakatani;Fernando Pereira;Amit Singhal;Steve Whittaker
Int. Conf. on Acoustics, Speech and Signal Processing v.Ⅱ Speaker Identification based Text to Audio Alignment for Audio Retrieval System Deb Roy;Carl Malamud
Int. Conf. on Acoustics, Speech and Signal Processing v.Ⅱ Detection of Target Speakers in Audio Databases Ivan Magrin-Chagnolleau;Aaron E. Rosenberg;S. Parthasarathy
Ph. D. thesis, CMU Acoustical and Environmental Robustness in Autimatic Speech Recognition Alejandro Acero
IEEE Trans. on Acoustics, Speech and Signal Processing v.32 no.6 Speech Enhancement Using a Minimum Mean-Square Erro Short-Time Spectral Amplitude Estimator Yariv Ephraim;David Malah
Int. Conf. on Acoustics, Speech and Signal Processing v.2 Acoustic Modeling of Subword Units for Speech Recognition C.-H. Lee;L.R. Rabiner;R. Pieraccinit;Jay G. Wilpon
IEEE Trans. on Acoustics, Speech and Signal Processing IEEE Trans. On ASSP v.38 no.9 The Segmental K-Means Algorithm for Estimation Parameters of Hidden Markov Models B.-H. Juang;L.R. Rabiner
Proc. of IEEE v.77 no.2 A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition L.R. Rabiner
Ph. D thesis, CMU Efficient algorithms for Speech Recognition Mosur K. Ravishankar
한국음향학회지 v.18 no.4 HMM 인식기에서 상태별 다중 특징 파라미터 가중 손종목;배건성

The Journal of the Acoustical Society of Korea (한국음향학회지)

A Study on the Use of Speech Recognition Technology for Content-based Video Indexing and Retrieval

내용기반 비디오 색인 및 검색을 위한 음성인식기술 이용에 관한 연구

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)