• Title/Summary/Keyword: Fast mode decision

Search Result 170, Processing Time 0.03 seconds

Efficient Rate Control by Fast Adaptive Mode Selection

  • Ryu, Chul
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.4E
    • /
    • pp.43-50
    • /
    • 1999
  • A fast converging coding algorithm that adaptively selects the modes of macroblocks is introduced. For a given frame, the optimal modes are selected based on the decision curves that minimize the overall distortion at a given bit rate. The method proposed in this paper is different from the conventional ones in that it does not manipulate the quantizer to meet the target bit rate but it satisfies the target bit rate by finding optimal modes of macroblocks which result consistent visual quality. Lagrange multiplier of the unconstrained cost function is controlled to trigger decision curves to generate appropriate modes to meet bit rate and the curve is obtained by utilizing simulated annealing optimization technique. The algorithm is implemented within H.261 video codec and simulation results demonstrate superior visual quality.

  • PDF

Fast Intra Prediction Mode Decision for HEVC (HEVC의 고속 화면내 예측 모드 결정 기법)

  • Chang, Yong-Jun;Kim, Dong-Hyun;Kim, Jae-Gon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2013.11a
    • /
    • pp.174-175
    • /
    • 2013
  • HEVC의 화면내 예측 부호화는 예측 모드를 최대 35개까지 확장함으로써 기존H.264/AVC에 비해서 향상된 부호화 효율을 갖는다. 반면 화면내 부호화의 복잡도 또한 크게 증가하여 복잡도 감소를 위한 고속 부호화 기법이 요구된다. HEVC의 차조모델인 HM에서는 화면내 부호화 고속화를 위해 RMD(Rough Mode Decision) 과정을 통하여 후보 모드를 결정하고 선택된 후보 모드에서 초종 예측 모드를 결정한다. 본 논문에서는 화면내 부호화의 복잡도 감소를 위하여 고속 화면내 예측 모드 결정 기법을 제시한다. 본 기법은 후보 모드를 결정하기 위한 RMD 과정에서의 탐색 모드 수와후보 모드로부터 최종 모드를 결정하기 위한 탐색 모드 수를 제한하는 방법을 결합하여 모드 결정을 위한 복잡도를 감소한다. 본 제안 기법은 실험결과 HM 12.0대비 1.0%의 비트 증가로 13.19%의 복잡도를 감소시킬 수 있었다.

  • PDF

Enhanced Adaptive Mode Decision of H.264 Based on Efficient AZCB Prediction (능률적 AZCB 예측 기반 H.264 적응 모드 결정 개선 알고리즘)

  • Kim, Yang-Soo;Kim, Yong-Goo;Choe, Yoon-Sik;Choi, Yung-Ho
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.56 no.11
    • /
    • pp.2036-2039
    • /
    • 2007
  • This paper proposes an enhanced adaptive mode decision scheme for fast H.264 encoders. By efficiently predicting AZCB (All Zero Coefficient Block), the proposed scheme can encode motion pictures in H.264 up to 2.86 and 1.68 times faster than JM9.3 and AMD [1], respectively. Besides, this scheme significantly reduces the encoding performance fluctuation of AMD across tested bit-rates and video sequences.

CNN-based Fast Split Mode Decision Algorithm for Versatile Video Coding (VVC) Inter Prediction

  • Yeo, Woon-Ha;Kim, Byung-Gyu
    • Journal of Multimedia Information System
    • /
    • v.8 no.3
    • /
    • pp.147-158
    • /
    • 2021
  • Versatile Video Coding (VVC) is the latest video coding standard developed by Joint Video Exploration Team (JVET). In VVC, the quadtree plus multi-type tree (QT+MTT) structure of coding unit (CU) partition is adopted, and its computational complexity is considerably high due to the brute-force search for recursive rate-distortion (RD) optimization. In this paper, we aim to reduce the time complexity of inter-picture prediction mode since the inter prediction accounts for a large portion of the total encoding time. The problem can be defined as classifying the split mode of each CU. To classify the split mode effectively, a novel convolutional neural network (CNN) called multi-level tree (MLT-CNN) architecture is introduced. For boosting classification performance, we utilize additional information including inter-picture information while training the CNN. The overall algorithm including the MLT-CNN inference process is implemented on VVC Test Model (VTM) 11.0. The CUs of size 128×128 can be the inputs of the CNN. The sequences are encoded at the random access (RA) configuration with five QP values {22, 27, 32, 37, 42}. The experimental results show that the proposed algorithm can reduce the computational complexity by 11.53% on average, and 26.14% for the maximum with an average 1.01% of the increase in Bjøntegaard delta bit rate (BDBR). Especially, the proposed method shows higher performance on the sequences of the A and B classes, reducing 9.81%~26.14% of encoding time with 0.95%~3.28% of the BDBR increase.

Fast Decision Method of Geometric Partitioning Mode and Block Partitioning Mode using Hough Transform in VVC (허프 변환을 이용한 VVC의 기하학 분할 모드 및 블록 분할 고속 결정 방법)

  • Lee, Minhun;Park, Juntaek;Bang, Gun;Lim, Woong;Sim, Donggyu;Oh, Seoung-Jun
    • Journal of Broadcast Engineering
    • /
    • v.25 no.5
    • /
    • pp.698-708
    • /
    • 2020
  • VVC (Versatile Video Coding), which has been developing as a next generation video coding standard. Compared to HEVC (High Efficiency Video Coding), VVC is improved by about 34% in RA (Random Access) configuration and about 30% in LDB (Low-Delay B) configuration by adopting various techniques such as recursive block partitioning structure and GPM (Geometric Partitioning Mode). But the encoding complexity is increased by about 10x and 7x, respectively. In this paper, we propose a fast decision method of GPM mode and block partitioning using directionality of block to reduce encoding complexity of VVC. The proposed method is to apply the Hough transform to the current block to identify the directionality of the block, thereby determining the GPM mode and the specific block partitioning method to be skipped in the rate-distortion cost search process. As a result, compared to VTM8.0, the proposed method reduces about 31.01% and 29.84% encoding complexity for RA and LDB configuration with 2.48% and 2.69% BD-rate loss, respectively.

Fast Intra-Mode Decision for H.264/AVC using Inverse Tree-Structure (H.264/AVC 표준에서 역트리 구조를 이용하여 고속으로 화면내 모드를 결정하는 방법)

  • Ko, Hyun-Suk;Yoo, Ki-Won;Seo, Jung-Dong;Sohn, Kwang-Hoon
    • Journal of Broadcast Engineering
    • /
    • v.13 no.3
    • /
    • pp.310-318
    • /
    • 2008
  • The H.264/AVC standard achieves higher coding efficiency than previous video coding standards with the rate-distortion optimization (RDO) technique which selects the best coding mode and reference frame for each macroblock. As a result, the complexity of the encoder have been significantly increased. In this paper, a fast intra-mode decision algorithm is proposed to reduce the computational load of intra-mode search, which is based on the inverse tree-structure edge prediction algorithm. First, we obtained the dominant edge for each $4{\times}4$ block from local edge information, then the RDO process is only performed by the mode which corresponds to dominant edge direction. Then, for the $8{\times}8$ (or $16{\times}16$) block stage, the dominant edge is calculated from its four $4{\times}4$ (or $16{\times}16$) blocks' dominant edges without additional calculation and the RDO process is also performed by the mode which is related to dominant edge direction. Experimental results show that proposed scheme can significantly improve the speed of the intra prediction with a negligible loss in the peak signal to noise ratio (PSNR) and a little increase of bits.

A Mode Selection Algorithm using Scene Segmentation for Multi-view Video Coding (객체 분할 기법을 이용한 다시점 영상 부호화에서의 예측 모드 선택 기법)

  • Lee, Seo-Young;Shin, Kwang-Mu;Chung, Ki-Dong
    • Journal of KIISE:Information Networking
    • /
    • v.36 no.3
    • /
    • pp.198-203
    • /
    • 2009
  • With the growing demand for multimedia services and advances in display technology, new applications for 3$\sim$D scene communication have emerged. While multi-view video of these emerging applications may provide users with more realistic scene experience, drastic increase in the bandwidth is a major problem to solve. In this paper, we propose a fast prediction mode decision algorithm which can significantly reduce complexity and time consumption of the encoding process. This is based on the object segmentation, which can effectively identify the fast moving foreground object. As the foreground object with fast motion is more likely to be encoded in the view directional prediction mode, we can properly limit the motion compensated coding for a case in point. As a result, time savings of the proposed algorithm was up to average 45% without much loss in the quality of the image sequence.

Transform Skip Mode Fast Decision Method for HEVC Encoding (HEVC 부호화를 위한 변환생략 모드 고속 선택 방법)

  • Yang, Seungha;Shim, Hiuk Jae;Lee, Dahee;Jeon, Byeungwoo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39A no.4
    • /
    • pp.172-179
    • /
    • 2014
  • HEVC (High Efficiency Video Coding) fine-tuned many existing coding tools and adopted also many new coding techniques. As a result, HEVC has accomplished about 2 times of compression efficiency enhancement compared to the existing video coding standard of H.264/AVC. One of the newly adopted tools in HEVC is the transform skip scheme which performs quantization without transform. This technique improves coding efficiency especially with computer-generated images. However, the unavailability of global or local properties of general video signals demands encoder to decide whether performing transform or not for each TU (Transform Unit). The necessity of computing rate-distortion costs for this decision is one reason to increase encoder complexity. In this paper, a fast transform skip mode decision method is proposed, which is based on the fast decision of rate-distortion cost calculation for transform skip mode, by considering frequency characteristics of residual signal. The proposed method can reduce $4{\times}4$ TU encoding time by about 27.1% with only about 0.03% consequential decrement in BDBR.

The Algorithm of Angular Mode Selection for High Performance HEVC Intra Prediction (고성능 HEVC 화면내 예측을 위한 Angular 모드 선택 알고리즘)

  • Park, Seungyong;Ryoo, Kwangki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2016.10a
    • /
    • pp.969-972
    • /
    • 2016
  • In this paper, we propose an algorithm of angular mode selection for high-performance HEVC intra prediction. HEVC intra prediction is used to remove the spatial redundancy. Intra prediction has a total of 35 modes and block size of $64{\times}64$ to $4{\times}4$. Intra prediction has a high amount of calculation and operational time due to performing all 35 modes for each block size for the best cost. The angular mode algorithm proposed has a simple difference between pixels of the original image and the selected angular mode. A decision is made to select one angular mode plus planar mode and DC mode to perform the intra prediction and determine the mode with the best cost. In effect, only three modes are executed compared to the traditional 35 modes. Performance evaluation index used are BD-PSNR and BD-Bitrate. For the proposed algorithm, BD-PSNR results averagely increased by 0.035 and BD-Bitrate decreased by 0.623 relative to the HM-16.9 intra prediction. In addition, the encoding time is decreased by about 6.905%.

  • PDF

A Non-parametric Fast Block Size Decision Algorithm for H.264/AVC Intra Prediction

  • Kim, Young-Ju
    • Journal of information and communication convergence engineering
    • /
    • v.7 no.2
    • /
    • pp.193-198
    • /
    • 2009
  • The H.264/ AVC video coding standard supports the intra prediction with various block sizes for luma component and a 8x8 block size for chroma components. This new feature of H.264/AVC offers a considerably higher improvement in coding efficiency compared to previous compression standards. In order to achieve this, H.264/AVC uses the Rate-distortion optimization (RDO) technique to select the best intra prediction mode for each block size, and it brings about the drastic increase of the computation complexity of H.264 encoder. In this paper, a fast block size decision algorithm is proposed to reduce the computation complexity of the intra prediction in H.264/AVC. The proposed algorithm computes the smoothness based on AC and DC coefficient energy for macroblocks and compares with the nonparametric criteria which is determined by considering information on neighbor blocks already reconstructed, so that deciding the best probable block size for the intra prediction. Also, the use of non-parametric criteria makes the performance of intra-coding not be dependent on types of video sequences. The experimental results show that the proposed algorithm is able to reduce up to 30% of the whole encoding time with a negligible loss in PSNR and bitrates and provides the stable performance regardless types of video sequences.