• Title/Summary/Keyword: VVC

Search Result 121, Processing Time 0.023 seconds

Comparison of Image Compression Performance based on RoI Extraction Methods for Machines Vision (RoI 추출 방법에 따른 기계를 위한 영상 압축 성능 비교)

  • Lee, Yegi;Kim, Shin;Yoon, Kyoungro
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.146-149
    • /
    • 2022
  • 기존 RDO(Rate Distortion Optimization) 기반 압축 방식은 압축 성능에 초점을 두기 때문에 영상 내 인지 특성이 무시될 수 있다. 따라서 RoI(Region of Interest)을 기반으로 압축률을 조절하는 연구가 고안[1, 2, 3, 4] 되었으며, HVS(Human Visual System) 관점에서 영상 내 중요한 부분에 대해 더 높은 품질로 영상을 압축하는 연구가 대부분이다. 최근 인공지능 기술이 발전함에 따라 지능형 영상 분석에 대한 수요가 증가하고 있으며, 이에 따라 머신 비전을 위한 영상 부호화 및 효율적인 전송에 대한 필요성이 대두되고 있다. 본 논문에서는 VVC(Versatile Video Coding)의 dQP(delta Quantization Parameter)를 활용하여 RoI(Region of Interest) 기반압축 방법을 제안하고, 두가지의 RoI 추출 방식을 소개한다. Detectron2 Faster R-CNN X101-FPN [5]의 첫번째 탐지기를 통해 후보 영역 기반 RoI 을 추출하고, 두번째 탐지기를 통해 객체 기반 RoI 을 추출하여, 영상 내 객체 부분과 비객체 부분으로 나누어 서로 다른 압축률로 압축을 수행하였으며, 이에 따른 성능을 비교하고자 한다.

  • PDF

Compression of Multiscale Features of FPN for VCM (VCM 을 위한 FPN 다중 스케일 특징 압축)

  • Kim, Dong-Ha;Yoon, Yong-Uk;Lee, Jooyoung;Jeong, Se-Yoon;Kim, Jae-Gon;Jeong, Dae-Gwon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.143-145
    • /
    • 2022
  • MPEG-VCM(Video Coding for Machine)은 입력된 비디오 특징(feature)를 압축하는 Track1 과 입력 영상을 직접 압축하는 Track2 로 나뉘어 표준화가 진행중이다. 본 논문은 VCM Track 1 에 해당하는 Detectron2 FPN(Feature Pyramid Network)에서 추출한 다중 스케일 특징맵을 VVC 로 압축하는 MSFC(Multi-Scale Feature Compression)을 구조를 제안한다. 본 논문의 MSFC 에서는 다중 스케일 특징을 결합하여 부호화/복호화하는 기존의 구조에서 특징맵의 해상도를 줄여 압축하는 개선된 MSFC 를 제시한다. 제안 방법은 VCM 의 Track2 의 영상 앵커(image anchor) 보다 우수한 BPP-mAP 성능을 보이고 최대 -84.98%의 BD-rate 성능향상을 보인다.

  • PDF

A Feature Map Compression Method for Multi-resolution Feature Map with PCA-based Transformation (PCA 기반 변환을 통한 다해상도 피처 맵 압축 방법)

  • Park, Seungjin;Lee, Minhun;Choi, Hansol;Kim, Minsub;Oh, Seoung-Jun;Kim, Younhee;Do, Jihoon;Jeong, Se Yoon;Sim, Donggyu
    • Journal of Broadcast Engineering
    • /
    • v.27 no.1
    • /
    • pp.56-68
    • /
    • 2022
  • In this paper, we propose a compression method for multi-resolution feature maps for VCM. The proposed compression method removes the redundancy between the channels and resolution levels of the multi-resolution feature map through PCA-based transformation. According to each characteristic, the basis vectors and mean vector used for transformation, and the transformation coefficient obtained through the transformation are compressed using a VVC-based coder and DeepCABAC. In order to evaluate performance of the proposed method, the object detection performance was measured for the OpenImageV6 and COCO 2017 validation set, and the BD-rate of MPEG-VCM anchor and feature map compression anchor proposed in this paper was compared using bpp and mAP. As a result of the experiment, the proposed method shows a 25.71% BD-rate performance improvement compared to feature map compression anchor in OpenImageV6. Furthermore, for large objects of the COCO 2017 validation set, the BD-rate performance is improved by up to 43.72% compared to the MPEG-VCM anchor.

The Application of Frequency Modulated Quartz Oscillator Using a V.V.C. Diode. (VVC 다이오드를 사용한 수정주파수 변조기의 응용)

  • Jeong, Man-Yeong;Kim, Yeong-Ung;Kim, Byeong-Sik
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.9 no.5
    • /
    • pp.19-26
    • /
    • 1972
  • A newly developed quartz frequency modulator utili3ing a V. V. C. diode is briefly described. Its electrical characteristics-including modulation linearity, modulation distortion, and carrier frequency stability depending upon the variation of the environmental temperature and the applied power voltage, etc.-are suitable for the modulator of a mobile or a portable F.M. transmitter according to the experimental results. The excellent over-all electrical characteristics were proved from the experimental development of the two kinds of transceivers. One is the single channal transceiver which contains a direct frequency modulator at the carrier frequency of 52.750 MHg. The other is the dual channel transceiver (the frequencies are selected from about 40 channels without tuning adjustment) whose operational frequency is composed of a modulated frequency of 10.7 MHz and the frequency generated at a channel control oscillator, As mentioned above, it is realized that the electrical characteristics of this modulation method are suitable for portable F. M. transceivers.

  • PDF

A Feature Map Generation Method for MSFC-Based Feature Compression without Min-Max Signaling in VCM (VCM 의 MSFC 기반 특징 압축을 위한 Min-Max 시그널링을 제외한 특징맵 생성 기법)

  • Dong-Ha Kim;Yong-Uk Yoon;Jae-Gon Kim
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.11a
    • /
    • pp.79-81
    • /
    • 2022
  • MPEG-VCM(Video Coding for Machines)에서는 머신비전(machine vision) 네트워크의 백본(backbone)에서 추출된 이미지/비디오 특징 압축을 위한 표준화를 진행하고 있다. 현재 VCM 표준기술 탐색 과정에서 가장 좋은 압축 성능을 보이는 MSFC(Multi-Scale Feature compression) 기반 압축 네트워크 모델은 추출된 멀티-스케일 특징을 단일-스케일 특징으로 변환하여 특징맵으로 구성하고 이를 VVC 로 압축한다. 본 논문에서는 MSFC 기반 압축 모델에서 Min-Max 값 시그널링을 제외한 최소-최대(Min-Max) 정규화를 포함한 개선된 특징맵 생성 기법을 제시한다. 즉, 제안기법은 VCM 디코더에서의 특징맵 복원을 위한 Min-Max 값을 학습 기반으로 생성함으로써 Min-Max 시그널링의 비트 오버헤드 절감뿐만 아니라 별도의 시그널링 기제를 생략한 보다 단순한 전송 비트스트림 구성을 가능하게 한다. 실험결과 제안기법은 이미지 앵커(Anchor) 대비 BPP-mAP 성능에서 83.24% BD-rate 이득을 보이며, 이는 기존 MSFC 보다 1.74%정도 다소 떨어지지만 별도의 Min-Max 시그널링 없이도 기존의 성능을 유지할 수 있음을 보인다.

  • PDF

Joint Training of Neural Image Compression and Super Resolution Model (신경망 이미지 부호화 모델과 초해상화 모델의 합동훈련)

  • Cho, Hyun Dong;Kim, YeongWoong;Cha, Junyeong;Kim, DongHyun;Lim, Sung Chang;Kim, Hui Yong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.1191-1194
    • /
    • 2022
  • 인터넷의 발전으로 수많은 이미지와 비디오를 손쉽게 이용할 수 있게 되었다. 이미지와 비디오 데이터의 양이 기하급수적으로 증가함에 따라, JPEG, HEVC, VVC 등 이미지와 비디오를 효율적으로 저장하기 위한 부호화 기술들이 등장했다. 최근에는 인공신경망을 활용한 학습 기반 모델이 발전함에 따라, 이를 활용한 이미지 및 비디오 압축 기술에 관한 연구가 빠르게 진행되고 있다. NNIC (Neural Network based Image Coding)는 이러한 학습 가능한 인공신경망 기반 이미지 부호화 기술을 의미한다. 본 논문에서는 NNIC 모델과 인공신경망 기반의 초해상화(Super Resolution) 모델을 합동훈련하여 기존 NNIC 모델보다 더 높은 성능을 보일 수 있는 방법을 제시한다. 먼저 NNIC 인코더(Encoder)에 이미지를 입력하기 전 다운 스케일링(Down Scaling)으로 쌍삼차보간법을 사용하여 이미지의 화소를 줄인 후 부호화(Encoding)한다. NNIC 디코더(Decoder)를 통해 부호화된 이미지를 복호화(Decoding)하고 업 스케일링으로 초해상화를 통해 복호화된 이미지를 원본 이미지로 복원한다. 이때 NNIC 모델과 초해상화 모델을 합동훈련한다. 결과적으로 낮은 비트량에서 더 높은 성능을 볼 수 있는 가능성을 보았다. 또한 합동훈련을 함으로써 전체 성능의 향상을 보아 학습 시간을 늘리고, 압축 잡음을 위한 초해상화 모델을 사용한다면 기존의 NNIC 보다 나은 성능을 보일 수 있는 가능성을 시사한다.

  • PDF

Suboptimal video coding for machines method based on selective activation of in-loop filter

  • Ayoung Kim;Eun-Vin An;Soon-heung Jung;Hyon-Gon Choo;Jeongil Seo;Kwang-deok Seo
    • ETRI Journal
    • /
    • v.46 no.3
    • /
    • pp.538-549
    • /
    • 2024
  • A conventional codec aims to increase the compression efficiency for transmission and storage while maintaining video quality. However, as the number of platforms using machine vision rapidly increases, a codec that increases the compression efficiency and maintains the accuracy of machine vision tasks must be devised. Hence, the Moving Picture Experts Group created a standardization process for video coding for machines (VCM) to reduce bitrates while maintaining the accuracy of machine vision tasks. In particular, in-loop filters have been developed for improving the subjective quality and machine vision task accuracy. However, the high computational complexity of in-loop filters limits the development of a high-performance VCM architecture. We analyze the effect of an in-loop filter on the VCM performance and propose a suboptimal VCM method based on the selective activation of in-loop filters. The proposed method reduces the computation time for video coding by approximately 5% when using the enhanced compression model and 2% when employing a Versatile Video Coding test model while maintaining the machine vision accuracy and compression efficiency of the VCM architecture.

Numerical optimization of flow uniformity inside an under body- oval substrate to improve emissions of IC engines

  • Om Ariara Guhan, C.P.;Arthanareeswaran, G.;Varadarajan, K.N.;Krishnan, S.
    • Journal of Computational Design and Engineering
    • /
    • v.3 no.3
    • /
    • pp.198-214
    • /
    • 2016
  • Oval substrates are widely used in automobiles to reduce the exhaust emissions in Diesel oxidation Catalyst of CI engine. Because of constraints in space and packaging Oval substrate is preferred rather than round substrate. Obtaining the flow uniformity is very challenging in oval substrate comparing with round substrate. In this present work attempts are made to optimize the inlet cone design to achieve the optimal flow uniformity with the help of CATIA V5 which is 3D design tool and CFX which is 3D CFD tool. Initially length of inlet cone and mass flow rate of exhaust stream are analysed to understand the effects of flow uniformity and pressure drop. Then short straight cones and angled cones are designed. Angled cones have been designed by two methodologies. First methodology is rotating flow inlet plane along the substrate in shorter or longer axis. Second method is shifting the flow inlet plane along the longer axis. Large improvement in flow uniformity is observed when the flow inlet plane is shifted along the direction of longer axis by 10, 20 and 30 mm away from geometrical centre. When the inlet plane is rotated again based on 30 mm shifted geometry, significant improvement at rotation angle of $20^{\circ}$ is observed. The flow uniformity is optimum when second shift is performed based on second rotation. This present work shows that for an oval substrate flow, uniformity index can be optimized when inlet cone is angled by rotation of flow inlet plane along axis of substrate.

A Method of Intra Mode Coding for Joint Exploration Model (JEM) (차세대 비디오 부호화 실험모델(JEM)의 화면내 예측 모드 부호화 기법)

  • Park, Dohyeon;Lee, Jinho;Kang, Jung Won;Kim, Jae-Gon
    • Journal of Broadcast Engineering
    • /
    • v.23 no.4
    • /
    • pp.495-502
    • /
    • 2018
  • JVET (Joint Video Exploration Team) which explored evolving technologies of video coding with capabilities beyond HEVC (High Efficiency Video Coding), released a references software codec named the Joint Exploration Model (JEM) for performance verification of coding technologies. JEM has 67 intra prediction modes that extend the 35 modes of HEVC for intra prediction. Therefore, the enhancement of the coding performance is limited due to the overhead of prediction mode coding. In this paper, we analyze the probabilities of prediction modes selections, and then we propose a more efficient intra prediction mode coding based on the results of analyzed mode occurrence. In addition, we propose a context modeling for CABAC (Context-Adaptive Binary Arithmetic Coding) of the proposed mode coding. Experimental results show that the BD-rate gain is 0.02% on the AI (All Intra) coding structure compared to JEM 7.0. We need to optimize context modeling for additional coding performance enhancement.

Emulsion Polymerization of Vinyl acetate-Butyl acrylate Copolymer (유화 중합에 의한 비닐 아세테이트-부틸 아크릴레이트 공중합체의 합성 연구)

  • 설수덕;임종민
    • Polymer(Korea)
    • /
    • v.28 no.2
    • /
    • pp.135-142
    • /
    • 2004
  • Poly(vinyl acetate) (PVAc) prepared by emulsion polymerization has broad applications for additives such as paint binder, adhesive for wood and paper due to its low glass transition temperature which help to plasticize substrate resins. Since emulsion polymerization has a disadvantage that surfactant and ionic initiator degrade properties of the product polymer, poly(vinyl acetate-co-butyl acrylate) (VVc-BA) was synthesized using potassium persulfate as catalyst and poly(vinyl alcohol) (PVA) as protective colloid to prevent the degradation. The copolymer latex product was internally plasticized and has enhanced colloid stability, adhesion, tensile strength and elongation. During VAc-BA emulsion polymerization, no coagulation and complete conversion occur with the reactant mixture of 0.7wt% potassium persulfate, 15wt% poly(vinyl alcohol) (PVA-217), and the balanced monomer that the weight ratio of vinyl acetate to butyl acrylate is 19. As the concentrations of PVA increase, the copolymerization becomes faster and polymer particles are more stable, resulting in enhanced mechanical stability of the VAc-BA copolymer. However, the size of the polymer particles decreases with increasing PVA contents. Properties of the VAc-BA copolymer, such as minimum film formation temperature, glass transition temperature, surface morphology, molecular weight and molecular weight distribution, tensile strength and elongation, were characterized using differential scanning calorimeter, transmission electron microscope and other instruments.