Browse > Article

A Method of Generating Table-of-Contents for Educational Video  

Lee Gwang-Gook (Division of Electrical and Computer Engineering, Hanyang University)
Kang Jung-Won (Broadcasting Media Research Group, Digital Broadcasting Research Division, ETRI)
Kim Jae-Gon (Broadcasting Media Research Group, Digital Broadcasting Research Division, ETRI)
Kim Whoi-Yul (Division of Electrical and Computer Engineering, Hanyang University)
Publication Information
Journal of Broadcast Engineering / v.11, no.1, 2006 , pp. 28-41 More about this Journal
Abstract
Due to the rapid development of multimedia appliances, the increasing amount of multimedia data enforces the development of automatic video analysis techniques. In this paper, a method of ToC generation is proposed for educational video contents. The proposed method consists of two parts: scene segmentation followed by scene annotation. First, video sequence is divided into scenes by the proposed scene segmentation algorithm utilizing the characteristics of educational video. Then each shot in the scene is annotated in terms of scene type, existence of enclosed caption and main speaker of the shot. The ToC generated by the proposed method represents the structure of a video by the hierarchy of scenes and shots and gives description of each scene and shot by extracted features. Hence the generated ToC can help users to perceive the content of a video at a glance and. to access a desired position of a video easily. Also, the generated ToC automatically by the system can be further edited manually for the refinement to effectively reduce the required time achieving more detailed description of the video content. The experimental result showed that the proposed method can generate ToC for educational video with high accuracy.
Keywords
Multimedia Ontology; Semantic Retrieval; Semantic Integration;
Citations & Related Records
연도 인용수 순위
  • Reference
1 J. Bescos, J. M. Menendez, G. Cisneros, J. Cabrera, and J. M. Martinez, 'A Unified Approach to Gradual Shot Transition Detection', in Proceedings of International Conference on Image Processing, Vol. III, pp. 949-952, 2000
2 M. Yeung and B. L. Yeo, 'Time-constrained clustering for segmentation of video into story units,' in Proceedings of ICPR, Vol. C, Vienna, Austria, Aug. 1996, pp. 375-380
3 A. Hanjalic, R. L. Legendijk, and J. Biemond, 'Automated High-Level Movie Segmenation for Advanced Video-Retirieval Systems', in IEEE Transactions of Circuits and Systems for Video Technology, Vol. 9, No. 4, June 1999
4 B. L. Yeo and B. Liu, 'Rapid Scene Analysis on Compressed Videos,' in IEEE Transactions on Circuits and Systems for Video Technology, 5(6): 533-544, Dec. 1995   DOI   ScienceOn
5 W. Tavananpong, 'Shot Clustering Techniques for Video Browsing,' in IEEE Transactions on Multimedia, Vol. 6, No. 5, August 2004
6 D. A. Reynolds.: A Gaussian Mixture Modeling Approach to Text-Independent Speaker Identification. PhD thesis. Electrical Engineering Department, Georgia Institute of Technology, 2000
7 C. Wolf, J.-M. Jolion, F. Chassaing, 'Text Localization, Enhancement and Binarization in Multimedia Document' in Proceedings of 16th International Conference on Pattern Recognition, Volume: 2 , 11-15 Aug. 2002
8 M. Xu, N. C. Maddage, C. Xu, M. Kankanhali and Q. Tian, 'Creating Audio Keywords for Event Detection in Soccer Video,' in Proceedings of International Conference on Multimedia and Expo, pp. 281-284, 2003
9 'MPEG-7 Visual part of experimentation Model Version 10.0, 'ISO/IEC JTC1/SC29/WG11, N4063, Singapore, Mar. 2001
10 Y. Yusoff, W. Christmas, and J. Kittler, 'Video Shot Cut Detection Using Adaptive Thresholding,' in Proceedings of the 11th British Machine Vision Conference, pp. 362-372, 2000
11 W.Zhou, A.Vellaikal, and C. J. Kuo, 'Rule-based Video Classification System for Basketball Video Indexing,' in Proceedings of ACM Multimedia 2000 workshops, 2000
12 A. Ekin, A. M. Tekalp and R. Mehrotra, 'Automatic Soccer Video Analysis and Summarization,' in IEEE Transactions on Image Processing, Vol. 12, No. 7, July 2003
13 Winston H.-M. Hsu, L. Kennedy, C.-W. Huang, S.-F. Chang, C.-Y. Lin, G. Iyengar, 'News Video Story Segmentation using Fusion of Multi-Level Multi-modal Features in TRECVID 2003,' in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Montreal, Canada, May 17-21, 2004
14 H. Sundaram, S.-F. Chang, 'Computable scenes and structures in films,' in IEEE Transactions on Multimedia, Volume: 4 , Issue: 4 , Dec. 2002, Pages:482 - 491   DOI   ScienceOn
15 A. Girgensohn and J. Foote, 'Video Classification using Transform Coefficients,' in Proceedings of International Conference on Acoustics, Speech, and Signal, vol. 6, pp. 3045-3048, 1999., March 15, 1999