Search | Korea Science

Neural Network based Video Coding in JVET

Choi, Kiho
- Journal of Broadcast Engineering
- /
- v.27 no.7
- /
- pp.1021-1033
- /
- 2022
After the Versatile Video Coding (VVC)/H.266 standard was completed, the Joint Video Exploration Team (JVET) began to investigate new technologies that could significantly increase coding gain for the next generation video coding standard. One direction is to investigate signal processing based tools, while the other is to investigate Neural Network based technology. Neural Network based Video Coding (NNVC) has not been studied previously, and this is the first trial of such an approach in the standard group. After two years of research, JVET produced the first common software called Neural Compression Software (NCS) with two NN-based in-loop filtering tools at the 27th meeting and began to maintain NN-based technologies for the common experiment. The coding performances of the two filters in NCS-1.0 are shown to be 8.71% and 9.44% on average in a random access scenario, respectively. All the material related to NCS can be found in the repository of the JVET. In this paper, we provide a brief overview and review of the NNVC activity studied in JVET in order to provide trend and insight for the new direction of video coding standard.
https://doi.org/10.5909/JBE.2022.27.7.1021 인용 PDF KSCI KPUBS

JVET 신경망 기반 비디오 코딩 기술 연구 동향

최기호
- Broadcasting and Media Magazine
- /
- v.28 no.1
- /
- pp.29-37
- /
- 2023
국제표준화 단체 MPEG과 VCEG이 연합하여 만든 기구인 Joint Video Explorer Team (JVET)은 Versatile Video Coding (VVC)/H.266 완성 이후, 새로운 표준을 준비하기 위한 차세대 코딩 기술을 연구하기 시작하였다. 두 가지 큰 연구 방향이 설정되어 스터디가 진행 중인데, 하나의 방향은 기존 코덱에서 많이 활용되었던 신호 처리 기반 기술 연구이고, 다른 방향은 신경망을 활용하여 새로운 코딩 기술을 연구하는 것이다. 신경망 기반 비디오 코딩은 표준화에서 공식적으로 연구된 적이 없으며, 해당 시도는 차세대 표준을 준비하기 위해서 처음으로 하는 시도이다. 본 기고에서는 비디오 코딩 표준의 새로운 방향에 대한 통찰력을 제공하기 위해 JVET에서 새롭게 시작되고 있는 신경망 기반 비디오 코딩 연구에 대한 동향을 리뷰하고자 한다.
PDF

Performance Analysis of Hybrid Equiangular Cubemap (HEC) for 360 Video Coding (360 비디오 부호화를 위한 HEC 투영 성능분석)

Kim, Nam-Cheol;Kim, Hyun-Ho;Yoon, Yong-Uk;Kim, Jae-Gon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2018.11a
- /
- pp.141-142
- /
- 2018
360 비디오는 VR 미디어의 확산과 함께 몰입형 미디어로 주목 받고 있으며, JVET(Joint Video Experts Team)에서는 post-HEVC 로 진행중인 VVC(Versatile Video Coding) 표준화에 360 비디오 부호화도 함께 포함하고 있다. 현재 JVET 에서는 360 비디오를 부호화 하기 위한 다양한 구(sphere) 영상의 2D 투영기법이 고려되고 있다. 이러한 2D 투영에서는 구 영상의 화소 샘플이 2D 영상에 비 균일하게 매핑되는 변환 왜곡이 발생하며, 이는 360 비디오의 부호화 효율을 저하시키는 원인이 된다. 본 논문에서는 CMP 의 개선된 투영기법인 기존의 EAC(Equi-Angular Cubemap)와 HEC(Hybrid Equiangular Cubemap)를 소개하고, 이를 바탕으로 HEC 의 확장 변환 기법을 제시하여 객관적/주관적 부호화 성능을 확인한다.
PDF

Generation of Alternative Merge Candidates for Versatile Video Coding(VVC) (VVC를 위한 대체 움직임 정보 병합 후보 생성 기법)

Park, Dohyeon;Lee, Jinho;Kang, Jung Won;Kim, Jae-Gon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2018.06a
- /
- pp.147-148
- /
- 2018
최근 JVET(Joint Video Experts Team)은 새로운 비디오 압축 표준인 VVC(Versatile Video Coding)의 표준화를 시작하였다. 기존의 HEVC 및 VVC의 참조 SW 코덱인 HM 및 VTM(Versatile Test Model)에서는 효율적인 화면간 예측 부호화를 위한 움직임 정보 병합(Merge) 모드를 사용하고 있다. 본 논문에서는 VTM 의 Merge 후보 리스트 구성에서 공간적 주변블록의 움직임 정보가 존재하지 않을 경우, 이를 대체할 수 있는 Merge 후보 리스트 생성 기법을 제시한다. JVET CTC(Common Test Condition)를 이용하여 제안한 기법의 실험을 진행하였고, 실험결과 Y, U, V 성분 각각 0.2%, 0.17%, 0.12%의 BD-rate 감소를 확인하였다.
PDF

A Fast Decision Method of Quadtree plus Binary Tree (QTBT) Depth in JEM (차세대 비디오 코덱(JEM)의 고속 QTBT 분할 깊이 결정 기법)

Yoon, Yong-Uk;Park, Do-Hyun;Kim, Jae-Gon
- Journal of Broadcast Engineering
- /
- v.22 no.5
- /
- pp.541-547
- /
- 2017
The Joint Exploration Model (JEM), which is a reference SW codec of the Joint Video Exploration Team (JVET) exploring the future video standard technology, provides a recursive Quadtree plus Binary Tree (QTBT) block structure. QTBT can achieve enhanced coding efficiency by adding new block structures at the expense of largely increased computational complexity. In this paper, we propose a fast decision algorithm of QTBT block partitioning depth that uses the rate-distortion (RD) cost of the upper and current depth to reduce the complexity of the JEM encoder. Experimental results showed that the computational complexity of JEM 5.0 can be reduced up to 21.6% and 11.0% with BD-rate increase of 0.7% and 1.2% in AI (All Intra) and RA (Random Access), respectively.
https://doi.org/10.5909/JBE.2017.22.5.541 인용 PDF KSCI KPUBS

A Method of Intra Mode Coding for Joint Exploration Model (JEM) (차세대 비디오 부호화 실험모델(JEM)의 화면내 예측 모드 부호화 기법)

Park, Dohyeon;Lee, Jinho;Kang, Jung Won;Kim, Jae-Gon
- Journal of Broadcast Engineering
- /
- v.23 no.4
- /
- pp.495-502
- /
- 2018
JVET (Joint Video Exploration Team) which explored evolving technologies of video coding with capabilities beyond HEVC (High Efficiency Video Coding), released a references software codec named the Joint Exploration Model (JEM) for performance verification of coding technologies. JEM has 67 intra prediction modes that extend the 35 modes of HEVC for intra prediction. Therefore, the enhancement of the coding performance is limited due to the overhead of prediction mode coding. In this paper, we analyze the probabilities of prediction modes selections, and then we propose a more efficient intra prediction mode coding based on the results of analyzed mode occurrence. In addition, we propose a context modeling for CABAC (Context-Adaptive Binary Arithmetic Coding) of the proposed mode coding. Experimental results show that the BD-rate gain is 0.02% on the AI (All Intra) coding structure compared to JEM 7.0. We need to optimize context modeling for additional coding performance enhancement.
https://doi.org/10.5909/JBE.2018.23.4.495 인용 PDF KSCI KPUBS

Geometry Padding for Segmented Sphere Projection (SSP) in 360 Video (360 비디오의 SSP를 위한 기하학적 패딩)

Kim, Hyun-Ho;Myeong, Sang-Jin;Yoon, Yong-Uk;Kim, Jae-Gon
- Journal of Broadcast Engineering
- /
- v.24 no.1
- /
- pp.25-31
- /
- 2019
360 video is attracting attention as immersive media, and is also considered in VVC (Versatile Video Coding), which is being developed in JVET (Joint Video Expert Team) as a new video coding standard of post-HEVC. A 2D image projected from 360 video for its compression may has discontinuities between the projected faces and inactive regions, and they may cause the visual artifacts in the reconstructed video as well as decrease of coding efficiency. In this paper, we propose a method of efficient geometric padding to reduce these discontinuities and inactive regions in the projection format of SSP (Segmented Sphere Projection). Experimental results show that the proposed method improves subjective quality compared to the existing padding of SSP that uses copy padding with minor loss of coding gain.
https://doi.org/10.5909/JBE.2019.24.1.25 인용 PDF KSCI KPUBS HTML

Overview of VVC

Lee, Jong-Seok;Park, Jun-Taek;Choe, Han-Sol;Byeon, Ju-Hyeong;Sim, Dong-Gyu
- Broadcasting and Media Magazine
- /
- v.24 no.4
- /
- pp.10-25
- /
- 2019
본 고는 ISO/IEC MPEG과 ITU-T VCEG이 참여하는 JVET(Joint Video Expert Team)에서 진행되고 있는 비디오 압축 표준 기술인 VVC(Versatile Video Coding)에 대하여 설명하고자 한다.
PDF KSCI

An Efficient Frame Packing Method for Icosahedral Projection (ISP) in 360 Video (360 비디오의 ISP 를 위한 효과적인 프레임 패킹 기법)

Kim, Hyun-Ho;Yoon, Yong-Uk;Park, Do-Hyeon;Kim, Jae-Gon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2017.11a
- /
- pp.6-7
- /
- 2017
360 비디오는 몰입감을 제공해주는 새로운 타입의 미디어로 최근 그 주목도가 더해져 가고 있다. 이에 따라 차세대 비디오 표준 기술 탐색을 진행하고 있는 JVET(Joint Video Exploration Team)에서는 360 비디오를 SDR 및 HDR 비디오와 함께 표준화 대상으로 논의되고 있다. 현재 JVET 에서는 360 비디오를 부호화 하기 위한 다양한 2D 투영기법이 제시되고 있다. 2D 로 변환된 영상은 투영 면(face) 간의 불연속성과 비활성 영역이 존재할 수 있으며 이는 부호화 효율을 저하시키는 원인이 된다. 본 논문에서는 ISP(Icosahedral Projection)에서의 이러한 불연속성과 비활성 영역을 줄이는 효과적인 프레임 패킹(packing) 기법을 제시한다. 제안 기법은 투영면들 간의 불연속 경계면을 효율적으로 배치하여 주관적 화질과 부호화 효율을 향상시킨다. 실험결과 기존 CISP(Compact ISP) 대비 1.0%, 1.0%, 1.27%, 0.63%의 BD-rate 감소를 확인 할 수 있었다. 또한 기존 CISP 대비 주관적 화질이 향상된 것을 확인 할 수 있었다.
PDF

Adaptive block tree structure for video coding

Baek, Aram;Gwon, Daehyeok;Son, Sohee;Lee, Jinho;Kang, Jung-Won;Kim, Hui Yong;Choi, Haechul
- ETRI Journal
- /
- v.43 no.2
- /
- pp.313-323
- /
- 2021
The Joint Video Exploration Team (JVET) has studied future video coding (FVC) technologies with a potential compression capacity that significantly exceeds that of the high-efficiency video coding (HEVC) standard. The joint exploration test model (JEM), a common platform for the exploration of FVC technologies in the JVET, employs quadtree plus binary tree block partitioning, which enhances the flexibility of coding unit partitioning. Despite significant improvement in coding efficiency for chrominance achieved by separating luminance and chrominance tree structures in I slices, this approach has intrinsic drawbacks that result in the redundancy of block partitioning data. In this paper, an adaptive tree structure correlating luminance and chrominance of single and dual trees is presented. Our proposed method resulted in an average reduction of -0.24% in the Y Bjontegaard Delta rate relative to the intracoding of JEM 6.0 common test conditions.
https://doi.org/10.4218/etrij.2019-0217 인용 PDF KSCI

Search Result 41, Processing Time 0.018 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)