• Title/Summary/Keyword: hierarchical B picture

Search Result 17, Processing Time 0.037 seconds

PSNR Estimation of H.264/AVC Bitstream for Hierarchical- B Picture Structure (계층적 B-픽쳐 구조를 고려한 H.264/AVC 비트열의 PSNR 예측)

  • Seo, Jung-Dong;Sohn, Kwang-Hoon
    • Journal of Broadcast Engineering
    • /
    • v.16 no.6
    • /
    • pp.996-1008
    • /
    • 2011
  • This paper proposes a PSNR estimation algorithm of H.264/AVC bitstream for hierarchical B-picture structure. The proposed method consists of a modeling method for DCT coefficients for I-pictures and an error estimation method for blocks encoded by skip mode. The modeling method selects an appropriate model between Laplacian and Cauchy model, and the error of skip mode is estimated from MSE values of reference pictures. Experimental results show that the modeling method predicts more accurate PSNR values than Laplacian and Cauchy model and the error estimation method of skip mode enhances PSNR estimation of hierarchical B-picture structure.

Image Coding Using DCT and Block Hierarchical Segmentation Finite-State Vector Quantization (DCT와 블록 계층 분할 유한상태 벡터 양자화를 이용한 영상 부호화)

  • Jo, Seong-Hwan;Kim, Eung-Seong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.3
    • /
    • pp.1013-1020
    • /
    • 2000
  • In this paper, we propose an algorithm which segments hierarchically blocks of image using discrete cosine transform(DCT) and execute finite-state vector quantization (FSVQ) for each block. Using DCT coefficient feature, image is segmented hierarchically to large smooth block and small edge block, then the block hierarchy informations are transmitted. The codebooks are respectively constructed for each hierarchical blocks, the encoder transmits codeword index using FSVQ for reducing encoded bit with hierarchical segmentation. Compared with side match VQ(SMVQ) and hierarchical FSVQ(HFSVQ) algorithm, about Zelda and Boat image, the new algorithm shows better picture quality with 1.97dB and 2.85 dB difference as to SMVQ, 1.78dB and 1.85dB diffences as to HFSVQ respectively.

  • PDF

Adaptive Multiview Video Coding Scheme Based on Spatiotemporal Correlation Analyses

  • Zhang, Yun;Jiang, Gang-Yi;Yu, Mei;Ho, Yo-Sung
    • ETRI Journal
    • /
    • v.31 no.2
    • /
    • pp.151-161
    • /
    • 2009
  • In this paper, we propose an adaptive multiview video coding scheme based on spatiotemporal correlation analyses using hierarchical B picture (AMVC-HBP) for the integrative encoding performances, including high compression efficiency, low complexity, fast random access, and view scalability, by integrating multiple prediction structures. We also propose an in-coding mode-switching algorithm that enables AMVC-HBP to adaptively select a better prediction structure in the encoding process without any additional complexity. Experimental results show that AMVC-HBP outperforms the previous multiview video coding scheme based on H.264/MPEG-4 AVC using the hierarchical B picture (MVC-HBP) on low complexity for 21.5%, on fast random access for about 20%, and on view scalability for 11% to 15% on average. In addition, distinct coding gain can be achieved by AMVC-HBP for dense and fast-moving sequences compared with MVC-HBP.

  • PDF

Automatic RTP Time-stamping Method for SVC Video Transmission (SVC 비디오 전송을 위한 RTP 타임스탬프 자동 생성 방법)

  • Seo, Kwang-Deok;Jung, Soon-Heung;Kim, Jae-Gon;Yoo, Jeong-Ju
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.6C
    • /
    • pp.471-479
    • /
    • 2008
  • In this paper, we propose a novel algorithm to automatically generate an RTP timestamp value that is required for the RTP packetization in order to transmit SVC video over various If networks such as Internet. Unlike the conventional single layer coding algorithms such as H.263, MPEG-4 and H.264, SVC generates a multi-layered single bitstream which is composed of a base layer and one or more enhancement layers in order to simultaneously provide temporal, spatial, and SNR scalability. Especially, in order to provide temporal scalability based on hierarchical B-picture prediction structure, the encoding (or transmission) and display order of pictures in SVC coding is completely decoupled. Thus, the timestamp value to be specified at the header of each RTP packet in video transmission does not increase monotonically according to the display time instant of each picture. Until now, no method for automatically generating an RTP timestamp when SVC video is loaded in a RTP packet has teen introduced. In this paper, a novel automatic RTP timestamp generation method exploiting the TID (temporal ID) field of the SVC NAL unit header is proposed to accommodate the SVC video transmission.

MPEG Video Segmentation using Two-stage Neural Networks and Hierarchical Frame Search (2단계 신경망과 계층적 프레임 탐색 방법을 이용한 MPEG 비디오 분할)

  • Kim, Joo-Min;Choi, Yeong-Woo;Chung, Ku-Sik
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.1_2
    • /
    • pp.114-125
    • /
    • 2002
  • In this paper, we are proposing a hierarchical segmentation method that first segments the video data into units of shots by detecting cut and dissolve, and then decides types of camera operations or object movements in each shot. In our previous work[1], each picture group is divided into one of the three detailed categories, Shot(in case of scene change), Move(in case of camera operation or object movement) and Static(in case of almost no change between images), by analysing DC(Direct Current) component of I(Intra) frame. In this process, we have designed two-stage hierarchical neural network with inputs of various multiple features combined. Then, the system detects the accurate shot position, types of camera operations or object movements by searching P(Predicted), B(Bi-directional) frames of the current picture group selectively and hierarchically. Also, the statistical distributions of macro block types in P or B frames are used for the accurate detection of cut position, and another neural network with inputs of macro block types and motion vectors method can reduce the processing time by using only DC coefficients of I frames without decoding and by searching P, B frames selectively and hierarchically. The proposed method classified the picture groups in the accuracy of 93.9-100.0% and the cuts in the accuracy of 96.1-100.0% with three different together is used to detect dissolve, types of camera operations and object movements. The proposed types of video data. Also, it classified the types of camera movements or object movements in the accuracy of 90.13% and 89.28% with two different types of video data.

GOP Adaptation Coding of H.264/SVC Based on Precise Positions of Video Cuts

  • Liu, Yunpeng;Wang, Renfang;Xu, Huixia;Sun, Dechao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.7
    • /
    • pp.2449-2463
    • /
    • 2014
  • Hierarchical B-frame coding was introduced into H.264/SVC to provide temporal scalability and improve coding performance. A content analysis-based adaptive group of picture structure (AGS) can further improve the coding efficiency, but damages the inter-frame correlation and temporal scalability of hierarchical B-frame to different degrees. In this paper, we propose a group of pictures (GOP) adaptation coding method based on the positions of video cuts. First, the cut positions are accurately detected by the combination of motion coherence (MC) and mutual information (MI); then the GOP is adaptively and proportionately set by the analysis of MC in one scene. In addition, we propose a binary tree algorithm to achieve the temporal scalability of any size of GOP. The results for test sequences and real videos show that the proposed method reduces the bit rate by up to about 15%, achieves a performance gain of about 0.28-1.67 dB over a fixed GOP, and has the advantages of better transmission resilience and video summaries.

Loss-Aware Rate-Distortion Optimization for Redundant Picture in H.264/AVC (H.264/AVC를 위한 에러 적응형 예비 슬라이스)

  • Lee, Jin-Ho;Choi, Jin-Soo;Choi, Hae-Chul
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2008.11a
    • /
    • pp.85-88
    • /
    • 2008
  • 예비 슬라이스(Redundant picture)는 에러가 발생하기 쉬운 환경에서 비디오가 전송될 때 에러를 보상하기 위한 H.264/AVC 부호화 표준의 도구들 중에 하나이다. 본 논문에서는 예비 슬라이스의 삽입으로 인해 발생하는 오버헤드 비트와 손실률을 고려한 LA-RDO(Loss-aware rate-distortion optimization) 모델을 제시하며 제안하는 모델에 기초해서 예비 슬라이스의 삽입 여부를 결정한다. 실험결과는 제안하는 방법이 참조 소프트웨어(JM: Joint Model)에 있는 HRP(Hierarchical redundant picture)와 비교했을 때 평균 0.5dB의 PSNR이 증가함을 보였다. 특별히 낮은 비트율에서 상대적으로 높은 성능을 보이므로 비디오 화상 회의와 같이 낮은 비트의 영상을 요구하는 무선 전송 환경에 적합할 것으로 보인다.

  • PDF

Temporal Prediction Structure for Multi-view Video Coding (다시점 비디오 부호화를 위한 시간적 예측 구조)

  • Yoon, Hyo-Sun;Kim, Mi-Young
    • Journal of Korea Multimedia Society
    • /
    • v.15 no.9
    • /
    • pp.1093-1101
    • /
    • 2012
  • Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. Multi-view video coding exploits inter-view correlations among pictures of neighboring views and temporal correlations among pictures of the same view. Multi-view video coding which uses many cameras requires a method to reduce the computational complexity. In this paper, we proposed an efficient prediction structure to improve performance of multi-view video coding. The proposed prediction structure exploits an average distance between the current picture and its reference pictures. The proposed prediction structure divides every GOP into several small groups to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. Experimental results show that the proposed prediction structure shows good performance in image quality and bit-rates. When compared to the performance of hierarchical B pictures of Fraunhofer-HHI, the proposed prediction structure achieved 0.07~0.13 (dB) of PSNR gain and was down by 6.5(Kbps) in bitrate.

A Fast Mode Decision of Non-anchor Pictures in Multi-view Video Coding for 3D Applications (3D 응용을 위한 다시점 영상 부호화에서 비기준 화면의 빠른 모드결정 기법)

  • Jung, Choong-Hyun;Shin, Kwang-Mu;Park, Seong-Ho;Chung, Ki-Dong
    • Journal of Korea Multimedia Society
    • /
    • v.15 no.7
    • /
    • pp.859-869
    • /
    • 2012
  • The Multi-view Video Coding (MVC) which is exploiting disparities between views has been developed to improve the coding efficiency of multi-view video. But MVC has a problem of having high computing complexities because of disparity estimation. This paper propose a fast mode decision for non-anchor picture to reduce the computational time of MVC. The proposed method uses two phases. Anchor pictures in hierarchical B picture structure have a higher correlation with prediction mode selection of non-anchor pictures, so in the first phase, prediction mode of non-anchor pictures is selected by exploiting the macro-block regions in anchor picture. In the second phase, we select a reference direction of inter prediction mode exploiting a higher correlation among reference directions of inter prediction modes of 7 block sizes. Experimental results show that the proposed method could save average about 44% in the encoding time with negligible coding efficiency losses.

Improved Prediction Structure and Motion Estimation Method for Multi-view Video Coding (다시점 비디오 부호화를 위한 개선된 예측 구조와 움직임 추정 기법)

  • Yoon, Hyo Sun;Kim, Mi Young
    • Journal of KIISE
    • /
    • v.41 no.11
    • /
    • pp.900-910
    • /
    • 2014
  • Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. The computational complexity of multi view video coding increases in proportion to the number of cameras. To reduce computational complexity and maintain the image quality, improved prediction structure and motion estimation method is proposed in this paper. The proposed prediction structure exploits an average distance between the current picture and its reference pictures. The proposed prediction structure divides every GOP into several groups to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. And the proposed motion estimation method uses a hierarchical search strategy. This strategy method consists of modified diamond search pattern, progressive diamond search pattern and modified raster search pattern. Experiment results show that the complexity reduction of the proposed prediction structure and motion estimation method over JMVC (Joint Multiview Video Coding) reference model using hierarchical B pictures of Fraunhofer-HHI and TZ search method can be up to 40~70% while maintaining similar video quality and bit rates.