Search | Korea Science

CNN-based Fast Split Mode Decision Algorithm for Versatile Video Coding (VVC) Inter Prediction

Yeo, Woon-Ha;Kim, Byung-Gyu
- Journal of Multimedia Information System
- /
- v.8 no.3
- /
- pp.147-158
- /
- 2021
Versatile Video Coding (VVC) is the latest video coding standard developed by Joint Video Exploration Team (JVET). In VVC, the quadtree plus multi-type tree (QT+MTT) structure of coding unit (CU) partition is adopted, and its computational complexity is considerably high due to the brute-force search for recursive rate-distortion (RD) optimization. In this paper, we aim to reduce the time complexity of inter-picture prediction mode since the inter prediction accounts for a large portion of the total encoding time. The problem can be defined as classifying the split mode of each CU. To classify the split mode effectively, a novel convolutional neural network (CNN) called multi-level tree (MLT-CNN) architecture is introduced. For boosting classification performance, we utilize additional information including inter-picture information while training the CNN. The overall algorithm including the MLT-CNN inference process is implemented on VVC Test Model (VTM) 11.0. The CUs of size 128×128 can be the inputs of the CNN. The sequences are encoded at the random access (RA) configuration with five QP values {22, 27, 32, 37, 42}. The experimental results show that the proposed algorithm can reduce the computational complexity by 11.53% on average, and 26.14% for the maximum with an average 1.01% of the increase in Bjøntegaard delta bit rate (BDBR). Especially, the proposed method shows higher performance on the sequences of the A and B classes, reducing 9.81%~26.14% of encoding time with 0.95%~3.28% of the BDBR increase.
https://doi.org/10.33851/JMIS.2021.8.3.147 인용 PDF KSCI HTML

Fast Mode Decision Algorithm for Scalable Video Coding (SVC) Using Directional Information of Neighboring Layer (스케일러블 비디오 코딩에서 방향성 정보를 이용한 모드 결정 고속화 기법)

Jung, Hyun-Ki;Hong, Kwang-Soo;Kim, Byung-Gyu;Kim, Chang-Ki;Yoo, Jeong-Ju
- Journal of Broadcast Engineering
- /
- v.17 no.1
- /
- pp.108-121
- /
- 2012
As Scalable Video Coding (SVC) is a video compression standard extended from H.264/AVC, it is a way to provide scalability in terms of temporal, spatial and quality. Although the compression efficiency of SVC is increased due to the scalability in many aspect, it is essential to reduce the complexity in order to efficiently use because the complexity is relatively increased. To reduce the complexity of SVC in the paper, we propose fast mode decision algorithm to reduce the complexity of encoding process using direction information of B-picture by efficiently performing inter-layer prediction. The proposed algorithm is a fast mode decision algorithm that makes different from detection mode number of forward and backward, bi-direction in the way using best mode of base-layer up-sampled after simply SKIP mode detection using the direction information of best mode of base-layer up-sampled. The experimental results show that the proposed algorithm approach can achieve the maximum computational time saving about 53% with almost no loss of rate distortion (RD) performance in the enhancement layer.
https://doi.org/10.5909/JEB.2012.17.1.108 인용 PDF KSCI

CU Depth Decision Based on FAST Corner Detection for HEVC Intra Prediction (HEVC 화면 내 예측을 위한 FAST 에지 검출 기반의 CU 분할 방법)

Jeon, Seungsu;kim, Namuk;Jeon, Byeungwoo
- Journal of Broadcast Engineering
- /
- v.21 no.4
- /
- pp.484-492
- /
- 2016
The High efficiency video coding (HEVC) is the newest video coding standard that achieves coding efficiency higher than previous video coding standards such as H.264/AVC. In intra prediction, the prediction units (PUs) are derived from a large coding unit (LCU) which is partitioned into smaller coding units (CUs) sizing from 8x8 to 64x64 in a quad-tree structure. As they are divided until having the minimum depth, Optimum CU splitting is selected in RDO (Rate Distortion Optimization) process. In this process, HEVC demands high computational complexity. In this paper, to reduce the complexity of HEVC, we propose a fast CU mode decision (FCDD) for intra prediction by using FAST (Features from Accelerated Segment Test) corner detection. The proposed method reduces computational complexity with 53.73% of the computational time for the intra prediction while coding performance degradation with 0.7% BDBR is small compared to conventional HEVC.
https://doi.org/10.5909/JBE.2016.21.4.484 인용 PDF KSCI KPUBS HTML

Fast Intra Mode Decision for H.264/AVC by Using the Approximation of DCT Coefficient (H.264/AVC에서 DCT 계수의 근사화를 이용한 고속 인트라 모드 결정 기법)

La, Byeong-Du;Eom, Min-Young;Choe, Yoon-Sik
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.44 no.3
- /
- pp.23-32
- /
- 2007
The H.264/AVC video coding standard uses rate distortion optimization (RDO) method to improve the compression performance in the intra prediction. The complexity and computational load are increased more than previous standard by using this method, even though this standard selects the best coding mode for the current macroblock. This paper proposes a fast intra mode decision algorithm for H.264/AVC encoder based on dominant edge direction (DED). To apply the idea, this algorithm uses the approximation of discrete cosine transform (DCT) coefficient. By detecting the DED, 3 modes instead of 9 modes are chosen for RDO calculation to decide the best mode in the $4{\times}4$ luma block. As for the $16{\times}16$ luma and $8{\times}8$ chroma block, instead of 4 modes, only 2 modes are searched. Experimental results show that the computation time of the proposed algorithm is decreased to about 72% of the full search method with negligible quality loss.
PDF KSCI

Fast Mode Decision using Global Disparity Vector for Multi-view Video Coding (다시점 영상 부호화에서 전역 변이 벡터를 이용한 고속 모드 결정)

Han, Dong-Hoon;Cho, Suk-Hee;Hur, Nam-Ho;Lee, Yung-Lyul
- Journal of Broadcast Engineering
- /
- v.13 no.3
- /
- pp.328-338
- /
- 2008
Multi-view video coding (MVC) based on H.264/AVC encodes multiple views efficiently by using a prediction scheme that exploits inter-view correlation among multiple views. However, with the increase of the number of views and use of inter-view prediction among views, total encoding time will be increased in multiview video coding. In this paper, we propose a fast mode decision using both MB(Macroblock)-based region segmentation information corresponding to each view in multiple views and global disparity vector among views in order to reduce encoding time. The proposed method achieves on average 40% reduction of total encoding time with the objective video quality degradation of about 0.04 dB peak signal-to-noise ratio (PSNR) by using joint multi-view video model (JMVM) 4.0 that is the reference software of the multiview video coding standard.
https://doi.org/10.5909/JBE.2008.13.3.328 인용 PDF KSCI

Fast Mode Decision using Block Size Activity for H.264/AVC (블록 크기 활동도를 이용한 H.264/AVC 부호화 고속 모드 결정)

Jung, Bong-Soo;Jeon, Byeung-Woo;Choi, Kwang-Pyo;Oh, Yun-Je
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.44 no.2 s.314
- /
- pp.1-11
- /
- 2007
H.264/AVC uses variable block sizes to achieve significant coding gain. It has 7 different coding modes having different motion compensation block sizes in Inter slice, and 2 different intra prediction modes in Intra slice. This fine-tuned new coding feature has achieved far more significant coding gain compared with previous video coding standards. However, extremely high computational complexity is required when rate-distortion optimization (RDO) algorithm is used. This computational complexity is a major problem in implementing real-time H.264/AVC encoder on computationally constrained devices. Therefore, there is a clear need for complexity reduction algorithm of H.264/AVC such as fast mode decision. In this paper, we propose a fast mode decision with early $P8\times8$ mode rejection based on block size activity using large block history map (LBHM). Simulation results show that without any meaningful degradation, the proposed method reduces whole encoding time on average by 53%. Also the hybrid usage of the proposed method and the early SKIP mode decision in H.264/AVC reference model reduces whole encoding time by 63% on average.
PDF KSCI

Fast Intra-Prediction Mode Decision Algorithm using Predetermined Prediction Block Size in H.264/AVC (H.264/AVC의 인트라 예측에서 예측 블록 크기 정보를 이용한 빠른 예측 모드 결정 기법)

Kim, Young-ju
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2009.10a
- /
- pp.211-214
- /
- 2009
H.264/AVC의 인트라 예측에서 미리 현재 블록 내의 정보 및 이전 블록의 예측 모드 정보 등을 이용하여 현재 블록의 예측 부호화 블록 크기가 결정되었을 경우, 예측된 블록 크기에 적합한 예측 모드 결정이 요구된다. 이에 사전에 결정된 예측 블록 크기 정보와 주변 블록과의 화소 변화량을 계산하여 예측 모드를 결정하는 기법을 제안하고 성능을 평가한다.
PDF

Early Decision of Inter-prediction Modes in HEVC Encoder (HEVC 부호화기에서의 화면 간 예측모드 고속 결정)

Han, Woo-Jin;Ahn, Joon-Hyung;Lee, Jong-Ho
- Journal of Broadcast Engineering
- /
- v.20 no.1
- /
- pp.171-182
- /
- 2015
HEVC can increase the coding efficiency significantly compared with H.264/AVC however it requires much larger computational complexities in both encoder and decoder. In this paper, the decision process of inter-prediction modes in the HEVC reference software has been studied and a fast algorithm to reduce the computational complexity of encoder and decoder is introduced. The proposed scheme introduces a early decision criteria using the outputs of uni-directional predictions to skip the bi-directional prediction estimation. From the experimental results, it was proven that the proposed method can reduce the encoding complexity by 12.0%, 14.6% and 17.2% with 0.6%, 1.0% and 1.5% of coding efficiency penalty, respectively. In addition, the ratio of bi-directional prediction mode was reduced by 6.3%, 11.8% and 16.6% at the same level of coding efficiency penalty, respectively, which should lead to the decoder complexity reduction. Finally, the effects of the proposed scheme are maintained regardless of the use of the early skip decision algorithm which is implemented in the HEVC reference software.
https://doi.org/10.5909/JBE.2015.20.1.171 인용 PDF KSCI KPUBS HTML

Bayesian-theory-based Fast CU Size and Mode Decision Algorithm for 3D-HEVC Depth Video Inter-coding

Chen, Fen;Liu, Sheng;Peng, Zongju;Hu, Qingqing;Jiang, Gangyi;Yu, Mei
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.12 no.4
- /
- pp.1730-1747
- /
- 2018
Multi-view video plus depth (MVD) is a mainstream format of 3D scene representation in free viewpoint video systems. The advanced 3D extension of the high efficiency video coding (3D-HEVC) standard introduces new prediction tools to improve the coding performance of depth video. However, the depth video in 3D-HEVC is time consuming. To reduce the complexity of the depth video inter coding, we propose a fast coding unit (CU) size and mode decision algorithm. First, an off-line trained Bayesian model is built which the feature vector contains the depth levels of the corresponding spatial, temporal, and inter-component (texture-depth) neighboring largest CUs (LCUs). Then, the model is used to predict the depth level of the current LCU, and terminate the CU recursive splitting process. Finally, the CU mode search process is early terminated by making use of the mode correlation of spatial, inter-component (texture-depth), and inter-view neighboring CUs. Compared to the 3D-HEVC reference software HTM-10.0, the proposed algorithm reduces the encoding time of depth video and the total encoding time by 65.03% and 41.04% on average, respectively, with negligible quality degradation of the synthesized virtual view.
https://doi.org/10.3837/tiis.2018.04.018 인용 PDF KSCI

A fast Inter Mode Decision Based on Local Statistics in H.264/AVC (지역 통계를 이용한 H.264/AVC의 고속 인터 모드 예측)

Lee, Dong-Shik;Kim, Young-Mo
- Journal of Korea Multimedia Society
- /
- v.10 no.8
- /
- pp.997-1003
- /
- 2007
H.264/AVC enhances inter-frame coding performance adopting new intra and inter mode prediction in inter-frame motion prediction. H.264 encoder provides performance superior to existing standards with the prediction modes. Instead of enhanced performance, however, both predictions increase complexity of encoder and demand a lot of coding time. This paper proposes a method using local statistics of resultant intra mode, my and mode map to predict inter mode. There are relationship between intra and inter mode, and we can predict inter mode using neighboring macroblocks' resultant mode and motion vector according to the contents of frame. The experimental results show that the proposed algorithm reduces encoding time by 31% on average with a negligible loss of PSNR and bitrate.
PDF

Search Result 95, Processing Time 0.036 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)