Browse > Article
http://dx.doi.org/10.5909/JBE.2007.12.4.311

Multi-modal Detection of Anchor Shot in News Video  

Yoo, Sung-Yul (School of Electrical Engineering, Kookmin University)
Kang, Dong-Wook (School of Electrical Engineering, Kookmin University)
Kim, Ki-Doo (School of Electrical Engineering, Kookmin University)
Jung, Kyeong-Hoon (School of Electrical Engineering, Kookmin University)
Publication Information
Journal of Broadcast Engineering / v.12, no.4, 2007 , pp. 311-320 More about this Journal
Abstract
In this paper, an efficient detection algorithm of an anchor shot in news video is presented. We observed the audio visual characteristics of news video and proposed several low level features which are appropriate for detecting an anchor shot in news video. The overall structure of the proposed algorithm is composed of 3 stages: the pause detection, the audio cluster classification, and the matching with motion activity stage. We used the audio features as well as the motion feature in order to improve the indexing accuracy and the simulation results show that the performance of the proposed algorithm is quite satisfactory.
Keywords
news video indexing; multi-modal feature; MFCC; and motion activity;
Citations & Related Records
연도 인용수 순위
  • Reference
1 C.G.M. Snoek and M. Worring, 'Multimodal Video Indexing: A Review of the State-of-the-art,' Multimedia Tools and Applications, vol.25, no.1, pp.5-35, 2005   DOI   ScienceOn
2 L. Chaisorn, T.-S. Chua, and C.-H. Lee, 'A Multi-Modal Approach to Story Segmentation for News Video,' World Wide Web, vol. 6, no.2, pp.187-208, 2003   DOI
3 D. Li, I.K Sethi, N. Dimitrova, and T. McGee, 'Classification of General Audio Data for Content based Retrieval,' Pattern Recognition Letters, vol.22, no.5, pp.533-544, 2005
4 I.K. Sethi, and G.P.R. Sarvarayudu, 'Hierarchical Classifier Design using Mutual Information,' IEEE Transactions on Pattern Recognition Machine Intelligence, vol. 4, no.4, pp.441-445, 1982   DOI   ScienceOn
5 W. Hsu, L. Kennedy, C-W. Huang, S.-F. Chang, C.-Y. Lin, and G. Iyengar, 'News Video Story Segmentation using Fusion of Multi-level Multi-modal Features in TRECVID 2003,' Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.3, pp.645-648, 2004
6 S. Quadri, S. Krishnan, and L. Guan, 'Indexing of NFL Video using MPEG 7 Descriptors and MFCC Features,' Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing, vol.2, pp.429-432, 2005
7 W. Qi, L. Gu. H. Jiang, X.-R. Chen, and H.-J. Zhang, 'Integrating Visual, Audio and Text Analysis for News Video,' Proc. IEEE International Conference on Image Processing, vol.3, pp.520-523, 2000
8 X. Wu, C.-W. Ngo, and Q. Li, 'Threading and Autodocumenting News Videos,' IEEE Signal Processing Magazine, vol.23, no.3, pp.59-68, 2006
9 P. Salembier B.S. Manjunath and T. Sikora, 'Introduction to MPEG 7: Multimedia Content Description Interface,' John Wiley and Sons, England, UK, 2002