• Title/Summary/Keyword: Video frame type

Search Result 73, Processing Time 0.02 seconds

Efficient Shot Change Detection Using Clustering Method on MPEG Video Frames (MPEG 비디오 프레임에서 FCM 클러스터링 기법을 이용한 효과적인 장면 전환 검출)

  • Lim, Seong-Jae;Lee, Bae-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10a
    • /
    • pp.751-754
    • /
    • 2000
  • In this paper, we propose an efficient method to detect abrupt shot changes in compressed MPEG video data by using reference ratios among video frames. The reference ratios among video frames imply the degree of similarities among adjacent frames by prediction coded type of each frames. A shot change is detected if the similarity degrees of a frame and its adjacent frames are low. This paper proposes an efficient shot change detection algorithm by using Fuzzy c-means(FCM) clustering algorithm. The FCM clustering uses the shot change probabilities evaluated in the mask matching of reference ratios and difference measure values based on frame reference ratios.

  • PDF

Video analysis using re-constructing of motion vectors on MPEG compressed domain (압축영역에서 움직임 벡터의 재추정을 이용한 비디오 해석 기법)

  • Kim, Nak-U;Kim, Tae-Yong;Gang, Eung-Gwan;Choe, Jong-Su
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.3
    • /
    • pp.78-87
    • /
    • 2002
  • A macroblock(MB) in MPEG coded domain can have zero, one, or two motion vectors depending on its frame type and prediction direction (forward-, backward-, or hi-directionally). In this paper, we propose a method that converts these motion vectors on MPEG coded domain as a uniform set, independent of the frame type and the direction of prediction, and directly utilizes these re-analyzed motion vectors for understanding video contents. Also, using this frame-type-independent motion vector, we propose novel methods for detecting and tracking moving objects with frame-based detection accuracy on the compressed domain. These algorithms are performed directly from the MPEG bitstreams after VLC decoding with little time consumption. Experimental results show validity and outstanding performance of our methods.

A Fast Intra Skip Detection Algorithm for H.264/AVC Video Encoding

  • Kim, Byung-Gyu;Kim, Jong-Ho;Cho, Chang-Sik
    • ETRI Journal
    • /
    • v.28 no.6
    • /
    • pp.721-731
    • /
    • 2006
  • A fast intra skip detection algorithm based on the ratedistortion (RD) cost for an inter frame (P-slices) is proposed for H.264/AVC video encoding. In the H.264/AVC coding standard, a robust rate-distortion optimization technique is used to select the best coding mode and reference frame for each macroblock (MB). There are three types of intra predictions according to profiles. These are $16{\times}16$ and $4{\times}4$ intra predictions for luminance and an $8{\times}8$ intra prediction for chroma. For the high profile, an $8{\times}8$ intra prediction has been added for luminance. The $4{\times}4$ prediction mode has 9 prediction directions with 4 directions for $16{\times}16$ and $8{\times}8$ luma, and $8{\times}8$ chrominance. In addition to the inter mode search procedure, an intra mode search causes a significant increase in the complexity and computational load for an inter frame. To reduce the computational load of the intra mode search at the inter frame, the RD costs of the neighborhood MBs for the current MB are used and we propose an adaptive thresholding scheme for the intra skip extraction. We verified the performance of the proposed scheme through comparative analysis of experimental results using joint model reference software. The overall encoding time was reduced up to 32% for the IPPP sequence type and 35% for the IBBPBBP sequence type.

  • PDF

Scen based MPEG video traffic modeling considering the correlations between frames (프레임간 상관관계를 고려한 장면기반 MPEG 비디오 트래픽 모델링)

  • 유상조;김성대;최재각
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.23 no.9A
    • /
    • pp.2289-2304
    • /
    • 1998
  • For the performance analysis and traffic control of ATM networks carrying video sequences, need an appropriate video traffic model. In this paper, we propose a new traffic model for MPEG compressed videos which are widely used for any type of video applications at th emoment. The proposed modeling scheme uses scene-based traffic characteristics and considers the correlation between frames of consecutiv GOPs. Using a simple scene detection algorithm, scene changes are modeled by state transitions and the number of GOPs of a scene state is modeled by a geometric distirbution. Frames of a scene stte are modeled by mean I, P, and B frame size. For more accurate traffic modeling, quantization errors (residual bits) that the state transition model using mean values has are compensated by autoregressive processes. We show that our model very well captures the traffic chracteristics of the original videos by performance analysis in terms of autocorrelation, histogram of frame bits genrated by the model, and cell loss rate in the ATM multiplexer with limited buffers. Our model is able to perrorm translations between levels (i.e., GOP, frame, and cell levels) and to estimate very accurately the stochastic characteristics of the original videos by each level.

  • PDF

Fuzzy Logic Based Temporal Error Concealment for H.264 Video

  • Lee, Pei-Jun;Lin, Ming-Long
    • ETRI Journal
    • /
    • v.28 no.5
    • /
    • pp.574-582
    • /
    • 2006
  • In this paper, a new error concealment algorithm is proposed for the H.264 standard. The algorithm consists of two processes. The first process uses a fuzzy logic method to select the size type of lost blocks. The motion vector of a lost block is calculated from the current frame, if the motion vectors of the neighboring blocks surrounding the lost block are discontinuous. Otherwise, the size type of the lost block can be determined from the preceding frame. The second process is an error concealment algorithm via a proposed adapted multiple-reference-frames selection for finding the lost motion vector. The adapted multiple-reference-frames selection is based on the motion estimation analysis of H.264 coding so that the number of searched frames can be reduced. Therefore the most accurate mode of the lost block can be determined with much less computation time in the selection of the lost motion vector. Experimental results show that the proposed algorithm achieves from 0.5 to 4.52 dB improvement when compared to the method in VM 9.0.

  • PDF

Algorithm for Realization Nonlinear Compressed Domain Video/Audio Editor (비선형 압축 영상 편집기 구현 알고리즘)

  • 박종준;정민교;이진호;송문호;김운경
    • Proceedings of the IEEK Conference
    • /
    • 1999.11a
    • /
    • pp.1045-1048
    • /
    • 1999
  • In this paper, we report on a set of new algorithms to realize a nonlinear compressed-domain video/audio editor that overcomes various realization problems. For efficiency, the underlying algorithm, which uses a central data structure in the form of doubled linked lists, performs soft edits of cut and paste (which, in turn, involves soft implementations of frame type conversion) and addresses problems relating to video/audio synchronization and random access, and decoder buffer control.

  • PDF

A Wavelet-Based Video Watermarking Approach Robust to Re-encoding

  • Yoo, Kil-Sang;Lee, Won-Hyung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.1C
    • /
    • pp.124-130
    • /
    • 2008
  • We present in this paper a method of digital watermarking for video data based on the discrete wavelet transform. In the proposed method, a watermark signal is inserted into the decompressed bitstream while detection is performed using the uncompressed video. This method allows detection if video has been manipulated or its format changed. We embed the watermark in the lowest frequency components of each frame in the un-coded video by using wavelet transform. The watermark can be extracted directly from the decoded video without access to the original video. Experimental results show that the proposed method gives the watermarked video of better quality and is robust against MPEG coding, down sampling and re-encoding to other type of video format such as MPEG4, H.264

Motion Flow Analysis using Bi-directional Prediction-Independent Framework in MPEG Compressed Domain (압축 영역에서의 양방향 예측 구조를 이용한 움직임 흐름 분석)

  • 김낙우;김태용;최종수
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.5
    • /
    • pp.13-22
    • /
    • 2004
  • Because video sequence consists of dynamic objects in nature, the object motion in video is an effective feature in describing the contents of video sequence and motion feature plays an important role in video retrieval. In this paper, we propose a method that converts motion vectors (MVs) to a uniform set on MPEG coded domain, independent of the frame type and the direction of prediction, and utilizes these normalized MVs (N-MVs) as motion descriptor to understand video contents. We describe a frame-type independent representation of the various types of frames presented in an MPEG video in which all frames can be considered equivalently, without full-decoding. In the experiments, we show that the proposed method is better than the conventional one in terms of performance.

A Scheduler and Scheduling Algorithm for Time Slot Assignment based on Wavelength (파장 단위의 Time Solt 할당을 위한 스케줄러 및 스케줄링 알고리즘)

  • Kim Kyoung-Mok;Oh Young-Hwan
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.1B
    • /
    • pp.1-7
    • /
    • 2004
  • Increase of internet users and new type of applied traffic such as game, news, distributed computing, online image conference, and real time audio and video have leaded to demand for more bandwidth for each application. This algorithm represents a complex optical exchanger having typical wavelength switching function and time-slotted transmission function. Performance assessment of the proposed OXC (Optical Cross connect) sttucture defines LFS (Limit Frame Size) and VFS (Variable Frame Size) for classification by packet type and calculates the channel effect and loss probability depending the demanded bandwidth by access node increase. Optical exchanger in this type of structure can guarantee future network expansion as well as decrease of frame collision resulted from node increase.

Stereoscopic Conversion based on Key Frames (키 프레임 기반 스테레오스코픽 변환 방법)

  • 김만배;박상훈
    • Journal of Broadcast Engineering
    • /
    • v.7 no.3
    • /
    • pp.219-228
    • /
    • 2002
  • In this paper, we propose a new method of converting 2D video into 3D stereoscopic video, called stereoscopic conversion. In general, stereoscopic images are produced using the motion informations. However unreliable motion informations obtained especially from block-based motion estimation cause the wrong generation of stereoscopic images. To solve for this problem, we propose a stereoscopic conversion method based upon the utilization of key frame that has the better accuracy of estimated motion informations. As well, as generation scheme of stereoscopic images associated with the motion type of each key frame is proposed. For the performance evaluation of our proposed method, we apply it to five test images and measure the accuracy of key frame-based stereoscopic conversion. Experimental results show that our proposed method has the accuracy more than about 90 percent in terms of the detection ratio of key frames.