Search | Korea Science

Analysis of Training Method for Matrix Weighted Intra Prediction (MIP) in VVC (VVC 행렬가중 화면내 예측(MIP) 학습기법 분석)

Park, Dohyeon;Kwon, Hyoungjin;Jeong, Seyoon;Kim, Jae-Gon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.11a
- /
- pp.148-150
- /
- 2020
최근 VVC(Versatile Video Coding) 표준 완료 이후 JVET(Joint Video Experts Team)은 인공신경망 기반의 비디오 부호화를 위한 AhG(Ad-hoc Group) 구성하고 인공지능을 이용한 비디오 압축 기술들을 검증하고 있으며, MPEG(Moving Picture Experts Group)에서는 DNNVC(Deep Neural Network based Video Coding) 활동을 통해 딥러닝 기반의 차세대 비디오 부호화 표준 기술을 탐색하고 있다. 본 논문은 VVC 에 채택된 신경망 기반의 기술인 MIP(Matrix Weighted Intra Prediction)를 참조하여, MIP 모델의 학습에서 손실함수가 예측 성능에 미치는 영향을 분석한다. 즉, 예측의 왜곡(MSE)만을 고려한 경우와 예측오차의 부호화 비용도 함께 반영한 손실함수를 비교한다. 실험을 위해 HEVC(High Efficiency Video Coding) 화면내 예측 대비 평균적인 PSNR 향상 정도를 나타내는 성능 지표(��PSNR)를 정의한다. 실험결과 예측오차의 부호화 특성을 반영하는 손실함수를 이용한 학습이 MSE 만 고려한 학습 대비 ��PSNR 기준 평균 0.4dB 향상됨을 보였다.
PDF

Suboptimal video coding for machines method based on selective activation of in-loop filter

Ayoung Kim;Eun-Vin An;Soon-heung Jung;Hyon-Gon Choo;Jeongil Seo;Kwang-deok Seo
- ETRI Journal
- /
- v.46 no.3
- /
- pp.538-549
- /
- 2024
A conventional codec aims to increase the compression efficiency for transmission and storage while maintaining video quality. However, as the number of platforms using machine vision rapidly increases, a codec that increases the compression efficiency and maintains the accuracy of machine vision tasks must be devised. Hence, the Moving Picture Experts Group created a standardization process for video coding for machines (VCM) to reduce bitrates while maintaining the accuracy of machine vision tasks. In particular, in-loop filters have been developed for improving the subjective quality and machine vision task accuracy. However, the high computational complexity of in-loop filters limits the development of a high-performance VCM architecture. We analyze the effect of an in-loop filter on the VCM performance and propose a suboptimal VCM method based on the selective activation of in-loop filters. The proposed method reduces the computation time for video coding by approximately 5% when using the enhanced compression model and 2% when employing a Versatile Video Coding test model while maintaining the machine vision accuracy and compression efficiency of the VCM architecture.
https://doi.org/10.4218/etrij.2023-0085 인용 PDF

An efficient architecture for motion estimation processor satisfying CCITT H.261 (CCITT H.261를 위한 효율적인 구조의 움직임 추정 프로세서 VLSI 설계)

주락현;김영민
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.32B no.1
- /
- pp.30-38
- /
- 1995
In this paper, we propose an efficient architecture for motion estimation processor which performs one of essential functions in moving picture coding algorithms. Simple control mechanism of data flow in register array which stores pixel data, parallel processing of pixel data and pipelining scheme in arithmetic umit allow this architecture to process a 352*288 pixel image at the frame rate of 30fs, which is compatable with CCITT standard H.261.
PDF

MPEG G-PCC 국제표준 기술

Byeon, Ju-Hyeong;Choe, Han-Sol;Sim, Dong-Gyu
- Broadcasting and Media Magazine
- /
- v.26 no.2
- /
- pp.31-45
- /
- 2021
본 고는 ISO/IEC JTC 1/SC 29/WG 7 MPEG(Moving Picture Experts Group) 3DG(3D Graphics coding) 그룹에서 진행되고 있는 포인트 클라우드 데이터 압축 표준 기술 중 하나인 G-PCC(Geometry-based Point Cloud Compression) 표준에 대하여 설명하고자 한다. G-PCC는 포인트 클라우드의 기하 정보와 속성 정보를 3차원 공간에서 서로 다른 기술을 이용하여 압축하는 표준으로, 무손실 압축 방법의 경우 10:1의 압축율을 제공하고 손실 압축의 경우 35:1 정도의 압축율을 보인다. 본 고에서는 G-PCC의 기하 정보와 속성 정보의 압축 방법을 상세히 설명하고 같은 기능을 수행하는 압축 기술 간의 성능을 비교하고자 한다.
PDF KSCI

Wavelet Transform Coding for Image Conference (화상회의를 위한 웨이브렛 변환 부호화)

김정일
- Journal of the Korea Society of Computer and Information
- /
- v.4 no.3
- /
- pp.73-77
- /
- 1999
In this paper. wavelet transform coding for image conference is studied. Original video frames are transformed into hierarchical pyramidal images with multiresolution using the band property of wavelet transform coefficients. Moving information between neighboring frames is obtained from the low-resolution band. Also, to control the video coding procedure. a new picture set filter is proposed. This filter controls the compression ratio of each frame depending on the correlation to the reference frame by selectively eliminating less important high-resolution areas. Consequently. video quality can be preserved and bit rate can be controlled adaptively In the simulation, to test the performance of the proposed coding method, comparisons with the full search block matching algorithm and the differential image coding algorithm are made. Consequently. the proposed method shows a reasonably good performance over existing ones.

A Real Time Implementation of Picture Coder/Decoder Using AMBTC at the Data Rate of 10Mb/s (10Mb/s의 전송률을 갖는 AMBTC를 이용한 영상부호기/부호기의 실시간 구현)

고형화;이충웅
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.24 no.5
- /
- pp.849-855
- /
- 1987
This paper describes an implementation of the absolute moment block truncation coding(AMBTC) in real time for the moving picture data compression. We have realized a system composed of the encoder and decoder, and operated it using an NTSC TV signal. The encoder consists of a 4-1line buffer memory and a data processing block. Besides, there are signal conditioner and a control signal generator. Experimental results show that the quality of the processed image with a data rate of 10Mb/s is slightly degraded, but not objectionable, comparing data rate of 80Mb/s.
PDF

Design and Implementation of Real-time Moving Picture Encoder Based on the Fractal Algorithm (프랙탈 알고리즘 기반의 실시간 영상 부호화기의 설계 및 구현)

Kim, Jae-Chul;Choi, In-Kyu
- The KIPS Transactions:PartB
- /
- v.9B no.6
- /
- pp.715-726
- /
- 2002
In this paper, we construct real-time moving picture encoder based on fractal theory by using general purpose digital signal processors. The constructed encoder is implemented using two fixed-point general DSPs (ADSP2181) and performs image encoding by three stage pipeline structure. In the first pipeline stage, the image grabber acquires image data from NTSC standard image signals and stores digital image into frame memory. In the second stage, the main controller encode image dada using fractal algorithm. The last stage, output controller perform Huffman coding and result the coded data via RS422 port. The performance tests of the constructed encoder shows over 10 frames/sec encoding speed for QCIF data when all the frames are encoded. When we encode the images using the interframe and redundency based on the proposed algorithms, encoding speed increased over 30 frames/sec in average.
https://doi.org/10.3745/KIPSTB.2002.9B.6.715 인용 PDF KSCI

Performance Analysis of Scalable HEVC Coding Tools (HEVC 기반 스케일러블 비디오 부호화 툴의 성능 분석)

Kim, Yongtae;Choi, Jinhyuk;Choi, Haechul
- Journal of Broadcast Engineering
- /
- v.20 no.4
- /
- pp.497-508
- /
- 2015
Current communication networks consist of channels with various throughputs, protocols, and packet loss rates. Moreover, there are also diverse user multimedia consumption devices having different capabilities and screen sizes. Thus, a practical necessity of scalability on video coding have been gradually increasing. Recently, The Scalable High Efficiency Video Coding(SHVC) standard is developed by Joint Collaborative Team on Video Coding(JCT-VC) organized in cooperation with MPEG of ISO/IEC and VCEG of ITU-T. This paper introduces coding tools of SHVC including adopted and unadopted tools discussed in the process of the SHVC standardization. Furthermore, the individual tool and combined tool set are evaluated in terms of coding efficiency relative to a single layer coding structure. This analysis would be useful for developing a fast SHVC encoder as well as researching on a new scalable coding tool.
https://doi.org/10.5909/JBE.2015.20.4.497 인용 PDF KSCI KPUBS HTML

Study on Noise Filling algorithm of Unified Speech and Audio Coding (통합 음성/오디오 부호화기의 Noise Filling 알고리즘에 대한 연구)

Song, Jeongook;Kang, Hong-Goo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2012.07a
- /
- pp.260-261
- /
- 2012
본 논문에서는 Unified Speech and Audio Coding (USAC)에 적용된 Noise Filling의 부호화 과정에서 음질 왜곡 정도에 따라 Noise level을 설정하는 방법을 제안한다. USAC는 Moving Picture Experts Group (MPEG)에서 표준화한 최신의 음성/오디오 통합 코덱으로 현존하는 코덱 중에 최고의 성능을 가지고 있다. 하지만, 복호화기 기술만 표준화하여, 인코더를 설계하는 방법에 따라 음질의 차이가 존재한다 현재 오픈 소스 기반으로 진행되고 있는 프로젝트 JAME에서는 이러한 음질 차이를 극복하고, USAC에 적용된 핵섬 인코더 기술의 성능을 최대화 할 수 있는 여러 가지 방법을 포함하고 있다. 그 중 Noise Filling은 저 전송률 부호화 과정에서 양자화 되지 않는 스펙트럼에 대하여 일정한 noise level을 넣어 인지적으로 음질을 향상시키는 방법이다. 제안된 Noise Filling 부호화 방법은 현재 프레임의 음질 왜곡 정도를 반영하여, noise-like 신호 성분을 더욱 정교하게 부호화 할 수 있게 하였다.
PDF

Overview and Performance analysis of the HEVC based 3D Video Coding (HEVC 기반 3차원 비디오 부호화 기법 성능 분석)

Park, Daemin;Choi, Haechul
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2013.11a
- /
- pp.186-189
- /
- 2013
최근 다양한 3D 콘텐츠들에 대한 사용자의 요구에 따라 HD(High Definition)화질 및 이를 넘어서는 고해상도(FHD(full high definition), UHD(ultra high definition))의 고품질 3D 방송 서비스에 대한 연구가 진행되고 있으며, 차세대 영상 기술로 주목되고 있는 3차원 비디오 기술은 사용자에게 실감 있는 영상을 제공할 수 있다, 하지만 많은 시점을 전부 촬영하는 것은 한계가 있으므로, 카메라의 깊이 정보를 이용하여, 전송하는 시점을 줄이고, 시점영상을 합성함으로써 사용하는 카메라의 수보다 더 많은 시점을 생성하는 방법이 필요하다. 현재 국제 표준화 기구인 MPEG(Moving Picture Experts Group)의 3차원 비디오 부호화(3D Video Coding, 3DVC)에서는 깊이영상을 가지는 3차원 비디오영상에 대한 효과적인 부호화 기술들에 대해 표준화가 진행되고 있다. 이에 본 논문은 HEVC 기반의 3D-HEVC에서 사용하는 표준 기술들에 대하여 소개하고, 현재 사용되고 있는 기술들에 대한 성능 평가를 분석 하였다.
PDF

Search Result 74, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)