• Title/Summary/Keyword: random media

Search Result 364, Processing Time 0.026 seconds

A Context-based Fast Encoding Quad Tree Plus Binary Tree (QTBT) Block Structure Partition

  • Marzuki, Ismail;Choi, Hansol;Sim, Donggyu
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2018.06a
    • /
    • pp.175-177
    • /
    • 2018
  • This paper proposes an algorithm to speed up block structure partition of quad tree plus binary tree (QTBT) in Joint Exploration Test Model (JEM) encoder. The proposed fast encoding of QTBT block partition employs three spatially neighbor coded blocks, such as left, top-left, and top of current block, to early terminate QTBT block structure pruning. The propose algorithm is organized based on statistical similarity of those spatially neighboring blocks, such as block depths and coded block types, which are coded with overlapped block motion compensation (OBMC) and adaptive multi transform (AMT). The experimental results demonstrate about 30% encoding time reduction with 1.3% BD-rate loss on average compared to the anchor JEM-7.1 software under random access configuration.

  • PDF

Modified LIC for Bi Prediction to Reduce Complexity (양방향 예측에서 LIC 복잡도를 줄이기 위한 방법)

  • Choi, Hansol;Park, Juntaek;Lee, Jongseok;Sim, Donggyu;Bang, Gun;Kim, Huiyong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2018.11a
    • /
    • pp.163-164
    • /
    • 2018
  • 본 논문에서는 양방향 예측 모드에서 LIC(Local Illuminance Compensation)의 복잡도 감소를 위한 방법을 제안한다. 제안하는 방법은 현재 CU 가 양방향 예측 모드일 경우 움직임 보상 단계에서 예측 블록 두 개를 가중 평균을 낸 뒤 LIC 를 수행함으로써 양방향 예측모드에서 LIC 를 한번만 수행하여 기존 LIC 대비 낮은 복잡도를 보인다. 제안하는 방법의 성능 평가를 위하여 RA(Random Access) 조건에서 MPEG 표준 실험 영상의 B, C, D 클래스를 이용하여 BD-rate 를 VTM-2.0.1 과 비교한다. 실험결과로서 본 논문에서 제안하는 방법이 VTM2.0.1 대비 BD-rate 성능 관점에서 Y, U, V 각각에 대하여 평균 0.17, 0.17, 0.23 의 성능 향상을 보인다.

  • PDF

Neural Network based Video Coding in JVET

  • Choi, Kiho
    • Journal of Broadcast Engineering
    • /
    • v.27 no.7
    • /
    • pp.1021-1033
    • /
    • 2022
  • After the Versatile Video Coding (VVC)/H.266 standard was completed, the Joint Video Exploration Team (JVET) began to investigate new technologies that could significantly increase coding gain for the next generation video coding standard. One direction is to investigate signal processing based tools, while the other is to investigate Neural Network based technology. Neural Network based Video Coding (NNVC) has not been studied previously, and this is the first trial of such an approach in the standard group. After two years of research, JVET produced the first common software called Neural Compression Software (NCS) with two NN-based in-loop filtering tools at the 27th meeting and began to maintain NN-based technologies for the common experiment. The coding performances of the two filters in NCS-1.0 are shown to be 8.71% and 9.44% on average in a random access scenario, respectively. All the material related to NCS can be found in the repository of the JVET. In this paper, we provide a brief overview and review of the NNVC activity studied in JVET in order to provide trend and insight for the new direction of video coding standard.

GAN-based Quality Enhancement of Compressed Video (GAN 을 이용한 압축된 동영상 품질 향상)

  • Yongseong Kim;Yujin Lee;Bumyoon Kim;Byeungwoo Jeon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.11a
    • /
    • pp.191-192
    • /
    • 2022
  • 본 논문에서는 딥러닝의 주요 기법 중 하나인 GAN 을 활용하여 압축된 영상의 품질을 개선하는 방법을 제안한다. 제안하는 GAN 의 생성자는 U-Net 과 ResNet 을 기반으로 구성되었으며, 판별자는 합성곱층과 전연결층으로 구성하였다. 네트워크의 학습은 HEVC (High Efficiency Video Coding)의 테스트 모델인 HM16.25 를 사용하여 RA (Random Access) 구성하에 양자화 계수 37 로 압축된 영상을 입력으로 하여 수행되었다. 제안하는 네트워크의 성능 확인을 위해 학습 시와 동일한 조건으로 압축된 다른 영상을 입력으로 하여 실험하였다. 실험 결과 영상의 평균 PSNR 은 34.20dB 에서 34.24dB 로 0.04dB 의 품질 향상이 이루어진 것을 확인할 수 있었다.

  • PDF

Depth Image-based Ground Detection and Altitude Measurement Method (깊이영상을 이용한 지면 검출 및 고도 측정 방법)

  • Cheon, Muho;Jeon, Byeungwoo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • fall
    • /
    • pp.180-182
    • /
    • 2021
  • 본 논문에서는 드론의 비행 장소와 온도 및 습도에 영향을 받지 않는 적외선 기반 깊이 카메라로부터 얻어진 깊이영상을 분석하여 지면 영역을 찾고 AGL(Above Ground Level) 단위의 고도를 측정하는 방법을 제안한다. Decimation filter 와 Median filter 를 적용하여 잡음 및 빈 데이터들을 제거한 깊이영상으로부터 RANSAC (RANdom Sample Consensus) 기반 평면 모델 추정 방법을 이용하여 지면 영역과 이에 대한 평면의 방정식을 유추하고 현재 위치와의 거리를 계산한다. 성능 평가를 위해 Lidar 센서와 비교한 결과, 제안 방법이 지면에 위치한 장애물에 영향을 더 적게 받으며, 자세 정보와 독립적으로 고도를 측정할 수 있었다.

  • PDF

A multi-label Classification of Attributes on Face Images

  • Le, Giang H.;Lee, Yeejin
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2021.06a
    • /
    • pp.105-108
    • /
    • 2021
  • Generative adversarial networks (GANs) have reached a great result at creating the synthesis image, especially in the face generation task. Unlike other deep learning tasks, the input of GANs is usually the random vector sampled by a probability distribution, which leads to unstable training and unpredictable output. One way to solve those problems is to employ the label condition in both the generator and discriminator. CelebA and FFHQ are the two most famous datasets for face image generation. While CelebA contains attribute annotations for more than 200,000 images, FFHQ does not have attribute annotations. Thus, in this work, we introduce a method to learn the attributes from CelebA then predict both soft and hard labels for FFHQ. The evaluated result from our model achieves 0.7611 points of the metric is the area under the receiver operating characteristic curve.

  • PDF

Hologram Generation with Random Phase based on 3D Volumetric Model (3D 체적형 모델 기반의 랜덤 위상을 갖는 홀로그램 생성)

  • Lee, Sol;Kim, Kyung-Jin;Kim, Jin-Kyum;Park, Byung-Seo;Kim, Dong-Wook;Seo, Young-Ho
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.323-324
    • /
    • 2020
  • 논문에서는 3D 체적형 모델을 이용하여 홀로그램에 랜덤 위상 효과를 주는 방법을 제안한다. CGH(Computer Generated Hologram)에서 랜덤 위상의 추가는 실제 촬영하여 획득한 홀로그램에서 물체 표면의 난반사에 대한 영향을 고려한 것이다. 이 랜덤 위상은 생성한 홀로그램의 광 시야각 확장 효과가 있다. 하지만 이것은 랜덤으로 발생하기 때문에 홀로그램 시퀀스를 생성할 때 같은 객체 표면에 대해서 고정된 효과를 줄 수 없다. 본 논문에서는 CGH를 진행할 때 물체의 고유한 랜덤 위상 추가를 위해 3D 체적형 모델을 사용하는 방법을 제안한다.

  • PDF

Transmission of Continuous Media by Send-rate Control and Packet Drop over a Packer Network (패킷망에서 전송율 제어와 패킷 폐기에 의한 연속 미디어 전송방안)

  • 배시규
    • Proceedings of the Korea Society for Industrial Systems Conference
    • /
    • 1999.12a
    • /
    • pp.121-129
    • /
    • 1999
  • When continuous media are transmitted over the communication networks, asynchrony which can not maintain temporal relationships among packets may occur due to a random transit delay. There exist two types of synchronization schemes ; for guaranteed or non-guaranteed resource networks. The former which applies a resource reservation technique maintains delay characteristics, however, the latter supply a best-effort service. In this paper, I propose a intra-media synchronization scheme to transmit continuous media on general networks not guaranteeing a bounded delay tome. The scheme controls transmission times of the packets by estimating next delay time with the delay distribution. So, the arriving packets may be maintained within a limited delay boundary, and playout will be performed after buffering to smoothen small delay variations. The continually increasing delay due to network overload causes buffer underflow at the receiver. To solve it, the transmitter is required to speed up instantaneously. Too much increase of transmission-rate may cause network congestion. At that time, the transmitter drops the current packet when informed excessive delay from the receiver.

  • PDF

Voting-based Intra Mode Bit Skip Using Pixel Information in Neighbor Blocks (이웃한 블록 내 화소 정보를 이용한 투표 결정 기반의 인트라 예측 모드 부호화 생략 방법)

  • Kim, Ji-Eon;Cho, Hye-Jeong;Jeong, Se-Yoon;Lee, Jin-Ho;Oh, Seoung-Jun
    • Journal of Broadcast Engineering
    • /
    • v.15 no.4
    • /
    • pp.498-512
    • /
    • 2010
  • Intra coding is an indispensable coding tool since it can provide random accessibility as well as error resiliency. However, it is the problem that intra coding has relatively low coding efficiency compared with inter coding in the area of video coding. Even though H.264/AVC has significantly improved the intra coding performance compared with previous video standards, H.264/AVC encoder complexity is significantly increased, which is not suitable for low bit rate interactive services. In this paper, a Voting-based Intra Mode Bit Skip (V-IMBS) scheme is proposed to improve coding efficiency as well as to reduce encoding time complexity using decoder-side prediction. In case that the decoder can determine the same prediction mode as what is chosen by the encoder, the encoder does not send that intra prediction mode; otherwise, the conventional H.264/AVC intra coding is performed. Simulation results reveal a performance increase up to 4.44% overall rate savings and 0.24 dB in peak signal-to-noise ratio while the frame encoding speed of proposed method is about 42.8% better than that of H.264/AVC.

A Fast Decision Method of Quadtree plus Binary Tree (QTBT) Depth in JEM (차세대 비디오 코덱(JEM)의 고속 QTBT 분할 깊이 결정 기법)

  • Yoon, Yong-Uk;Park, Do-Hyun;Kim, Jae-Gon
    • Journal of Broadcast Engineering
    • /
    • v.22 no.5
    • /
    • pp.541-547
    • /
    • 2017
  • The Joint Exploration Model (JEM), which is a reference SW codec of the Joint Video Exploration Team (JVET) exploring the future video standard technology, provides a recursive Quadtree plus Binary Tree (QTBT) block structure. QTBT can achieve enhanced coding efficiency by adding new block structures at the expense of largely increased computational complexity. In this paper, we propose a fast decision algorithm of QTBT block partitioning depth that uses the rate-distortion (RD) cost of the upper and current depth to reduce the complexity of the JEM encoder. Experimental results showed that the computational complexity of JEM 5.0 can be reduced up to 21.6% and 11.0% with BD-rate increase of 0.7% and 1.2% in AI (All Intra) and RA (Random Access), respectively.