Browse > Article
http://dx.doi.org/10.13088/jiis.2012.18.2.047

Video Scene Detection using Shot Clustering based on Visual Features  

Shin, Dong-Wook (Department of Computer Science and Engineering, Hanyang University)
Kim, Tae-Hwan (BK21 AIS Team, Hanyang University)
Choi, Joong-Min (Department of Computer Science and Engineering, Hanyang University)
Publication Information
Journal of Intelligence and Information Systems / v.18, no.2, 2012 , pp. 47-60 More about this Journal
Abstract
Video data comes in the form of the unstructured and the complex structure. As the importance of efficient management and retrieval for video data increases, studies on the video parsing based on the visual features contained in the video contents are researched to reconstruct video data as the meaningful structure. The early studies on video parsing are focused on splitting video data into shots, but detecting the shot boundary defined with the physical boundary does not cosider the semantic association of video data. Recently, studies on structuralizing video shots having the semantic association to the video scene defined with the semantic boundary by utilizing clustering methods are actively progressed. Previous studies on detecting the video scene try to detect video scenes by utilizing clustering algorithms based on the similarity measure between video shots mainly depended on color features. However, the correct identification of a video shot or scene and the detection of the gradual transitions such as dissolve, fade and wipe are difficult because color features of video data contain a noise and are abruptly changed due to the intervention of an unexpected object. In this paper, to solve these problems, we propose the Scene Detector by using Color histogram, corner Edge and Object color histogram (SDCEO) that clusters similar shots organizing same event based on visual features including the color histogram, the corner edge and the object color histogram to detect video scenes. The SDCEO is worthy of notice in a sense that it uses the edge feature with the color feature, and as a result, it effectively detects the gradual transitions as well as the abrupt transitions. The SDCEO consists of the Shot Bound Identifier and the Video Scene Detector. The Shot Bound Identifier is comprised of the Color Histogram Analysis step and the Corner Edge Analysis step. In the Color Histogram Analysis step, SDCEO uses the color histogram feature to organizing shot boundaries. The color histogram, recording the percentage of each quantized color among all pixels in a frame, are chosen for their good performance, as also reported in other work of content-based image and video analysis. To organize shot boundaries, SDCEO joins associated sequential frames into shot boundaries by measuring the similarity of the color histogram between frames. In the Corner Edge Analysis step, SDCEO identifies the final shot boundaries by using the corner edge feature. SDCEO detect associated shot boundaries comparing the corner edge feature between the last frame of previous shot boundary and the first frame of next shot boundary. In the Key-frame Extraction step, SDCEO compares each frame with all frames and measures the similarity by using histogram euclidean distance, and then select the frame the most similar with all frames contained in same shot boundary as the key-frame. Video Scene Detector clusters associated shots organizing same event by utilizing the hierarchical agglomerative clustering method based on the visual features including the color histogram and the object color histogram. After detecting video scenes, SDCEO organizes final video scene by repetitive clustering until the simiarity distance between shot boundaries less than the threshold h. In this paper, we construct the prototype of SDCEO and experiments are carried out with the baseline data that are manually constructed, and the experimental results that the precision of shot boundary detection is 93.3% and the precision of video scene detection is 83.3% are satisfactory.
Keywords
Video Scene Detection; Shot Clustering; Shot Boundary Identification; Video Parsing; Visual Feature Extraction;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 김광백, 윤홍원, 노영욱, "컬러 정보와 퍼지 C-means 알고리즘을 이용한 주차관리 시스템 개발", 지능정보연구, 8권 1호(2002), 87-101.
2 이연호, 오경진, 신위살, 조근식, "링크드 데이터를 이용한 협업적 비디오 어노테이션 및 브라우징시스템", 지능정보연구, 17권 3호(2011), 203-219.
3 허진경, 김향태, "히스토그램 분포도 역추적 변경에 의한 영상 강조", 지능정보연구, l권 8호(2004), 1-11.
4 Amiri, A., N. Abdollahi, M. Jafari, M. Fathy, "Hierarchical Key-Frame Based Video Shot Clustering Using Generalized Trace Kernel", Communicationsin Computer and Information Science, Vol.241, No.5(2011), 251-257.
5 Chasanis, V., A. Likas, and N. Galatsanos, "Scene Detection in Videos Using Shot Clustering and Sequence Alignment", IEEE Transactionson Multimedia, Vo1.11, No.1(2009), 89-100.   DOI
6 Gao, X., J. Li, and Y. Shi, "A Video Shot Boundary Detection Algorithm Based on Feature Tracking", In Proceedings of the Rough Sets and Knowledge Technology, (2006), 651-658.
7 Gargi, U., R. Kasturi, and S. Strayer, "Performance characterization of video-shot-change detection methods", IEEE Transactions on Circuits and Systems for Video Technology, Vol.10(2000), 1-13.
8 Hanjalic, A., R. Lagendijk, and J. Biemond, "Automated high-level movie segmentation for advanced video-retrieval systems", IEEE Transactionson Circuitsand Systems for Video Technology, Vol.9, No.4(1999), 580-588.   DOI   ScienceOn
9 Huang, C., H. Lee, and C. Chen, "Shot Change Detection via Local Keypoint Matching", IEEE Transactionson Multimedia, Vol.10, No.6(2008), 1097-1108.   DOI
10 Lee, M., Y. Yang, and S. Lee, "Automatic video parsing using shot boundary detection and camera operation analysis", Journal of the Pattern Recognition Society, Vol.34, No.3(2001), 711-719.   DOI   ScienceOn
11 Lu, H., Y. Tan, and X. Xue, "Real-Time, Adaptive, and Locality-Based Graph Partitioning Method for Video Scene Clustering", IEEE Transactionson Circuitsand Systems for Video Technology, Vol.21, No.11(2011), 1747-1759.   DOI
12 Manning, C. and P. Raghavan, H. Schutze, "Introduction to Information Retrieval", Cambridge University Press, 2008.
13 Mohonta, P., S. Saha, and B. Chanda, "A Hueristic Algorithm for Video Scene Detection Using Shot Cluster Sequence Analysis", In Proceedingsof the 7th Indian Conferenceon Computer Vision, Graphicsand Image Processing, (2010), 464-471.
14 Pass, G., R. Zabih, and J. Miller, "Comparing Images Using Color Coherence Vectors", ACMConferenceon Multimedia, (1996), 65-74.
15 Rasheed Z. and M. Shah, "Detection and representation of scene in videos", IEEE Transactionsof Multimedia, Vol.7, No.6(2005), 1097- 1105.   DOI
16 Sakarya, U., Z. Telatar, "Video scene detection using graph-based representations", Signal Processing Image Communication, Vol.25, No.10(2010), 774-783.   DOI   ScienceOn
17 Sangoh, J., "Histogram-Based Color Image Retrieval", Technical Report, Psych221/EE362 Project, 2001.
18 Yeung, M. and B. Yeo, "Segmentation of video by clustering and graph analysis", Journal of Computer Visionand Image Understanding, Vol.71, No.1(1998), 97-109.
19 Sobel, I. and G. Feldman, "A 3x3 Isotropic Gradient Operator for Image Processing", InProceedings Pattern Classification and Scene Analysis, (1973), 271-272.
20 Truong, B., S. Venkatesh, and C. Dorai, "Scene extraction in motion picture", IEEE Transactionson Circuitsand Systems for Video Technology, Vol.13, No.1(2003), 5-15.   DOI   ScienceOn
21 Yeung, M. and B. Yeo, "Time-Constrained Clustering for Segmentation of Video into Story Units", InProceedings of 13th International Conference on Pattern Recognition, Vol.3(1996), 375-380.
22 Zhu, S. and Y. Liu, "Video scene segmentation and semantic representation using a novel scheme", Multimedia Tools and Applications, Vol.42, No.2(2009), 183-205.   DOI   ScienceOn