• Title/Summary/Keyword: MPEG-4 AVC

Search Result 123, Processing Time 0.019 seconds

Conditional Probability Based Early Termination of Recursive Coding Unit Structures in HEVC (HEVC의 재귀적 CU 구조에 대한 조건부 확률 기반 고속 탐색 알고리즘)

  • Han, Woo-Jin
    • Journal of Broadcast Engineering
    • /
    • v.17 no.2
    • /
    • pp.354-362
    • /
    • 2012
  • Recently, High Efficiency Video Coding (HEVC) is under development jointly by MPEG and ITU-T for the next international video coding standard. Compared to the previous standards, HEVC supports variety of splitting units, such as coding unit (CU), prediction unit (PU), and transform unit (TU). Among them, it has been known that the recursive quadtree structure of CU can improve the coding efficiency while the encoding complexity is increased significantly. In this paper, a simple conditional probability to predict the early termination condition of recursive unit structure is introduced. The proposed conditional probability is estimated based on Bayes' formula from local statistics of rate-distortion costs in encoder. Experimental results show that the proposed method can reduce the total encoding time by about 32% according to the test configuration while the coding efficiency loss is 0.4%-0.5%. In addition, the encoding time can be reduced by 50% with 0.9% coding efficiency loss when the proposed method was used jointly with HM4.0 early CU termination algorithm.

Dynamic Full-Scalability-Conversion in SVC (스케일러블 비디오 코딩에서의 실시간 스케일러빌리티 변환)

  • Lee, Dong-Su;Bae, Tae-Meon;Ro, Yong-Man
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.43 no.6 s.312
    • /
    • pp.60-70
    • /
    • 2006
  • Currently, Scalable Video Coding (SVC) is being standardized. By using scalability of SVC, QoS managed video streaming service is enabled in heterogeneous networks even with only one original bitstream. But current SVC is insufficient to dynamic video conversion for the scalability, thereby the adaptation of bitrate to meet a fluctuating network condition is limited. In this paper, we propose dynamic full-scalability conversion method for QoS adaptive video streaming in H.264/AVC SVC. To accomplish full scalability dynamic conversion, we propose corresponding bitstream extraction, encoding and decoding schemes. On the encoder, we newly insert the IDR NAL to solve the problems of spatial scalability conversion. On the extractor, we analyze the SVC bitstream to get the information which enable dynamic extraction. By using this information, real time extraction is achieved. Finally, we develop the decoder so that it can manage changing bitrate to support real time full-scalability. The experimental results showed that dynamic full-scalability conversion was verified and it was necessary for time varying network condition.

Rate-Distortion Model for HEVC Quadtree Coding (HEVC 쿼드트리 부호화를 위한 율-왜곡 모델)

  • Lee, Bumshik;Kim, Munchurl
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2011.07a
    • /
    • pp.169-172
    • /
    • 2011
  • 최근 ISO/IEC의 MPEG과 ITU-T의 VCEG이 JCT-VC (Joint Collaborative Team for Video Coding)를 구성하여 HEVC (High Efficiency Video Coding) 차세대 비디오 압축 표준 제정을 위한 작업을 진행 중이다. 과거 압축률이 가장 좋은 것으로 알려진 H.264/AVC 보다 최대 50%까지 부호화 효율 향상을 목표로 하고 있다. HEVC는 H.264/AVC와는 상이한 부호화 구조를 채택하고 있고 작은 크기의 영상뿐만 아니라 크기가 큰 영상까지도 효율적으로 부호화할 수 있도록 설계되고 있다. 예측 및 변환 부호화 과정이 계층적 쿼드트리 구조를 가지며, 특히 변환 부호화는 작은 크기의 변환 블록으로부터 $32{\times}32$ 크기의 변환 블록까지 크게 확장되어 계층적 변환 구조를 이루며 부호화하도록 되어 있다. 본 논문에서는 기존 코덱과는 상이한 부호화 구조를 갖는 쿼드트리 부호화 기반 HEVC 코덱 표준을 위한 율-왜곡 (Rate-Distortion) 모델을 제안한다. 기존의 코덱에서는 부호화되는 기본 단위가 $16{\times}16$로 일정하고, 변환 및 양자화되는 블록의 크기 역시 $4{\times}4$또는 $8{\times}8$ 크기 단위로 그 블록의 크기가 작을 뿐만 아니라 고정된 크기를 사용한다. 따라서 단일 확률 모형을 사용하여 율-왜곡 모델을 만들었으며, 그 정확도 역시 비교적 정확한 결과를 얻었다. 그러나 HEVC에서는 계층적 가변 블록 크기를 갖는 기본 부호화, 예측 및 변환/양자화 기법을 사용하기 때문에 기존의 단일 모델로는 정확한 율-왜곡 모델을 만들어 내기 어렵다. 제안하는 방법은 HEVC의 기본 단위인 CU (Coding Unit)별로 독립적인 확률 모형을 사용하여 율-왜곡모델을 사용하는 것으로 CU의 크기가 가변적이고 CU 내의 텍스처 역시 크기에 따라 매우 다른 특성을 가지고 있기 때문에 단일 모델을 사용하는 것보다 매우 효율적인 것을 실험을 통하여 확인하였다.

  • PDF

Motion Estimation Specific Instructions and Their Hardware Architecture for ASIP (ASIP을 위한 움직임 추정 전용 연산기 구조 및 명령어 설계)

  • Hwang, Sung-Jo;SunWoo, Myung-Hoon
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.48 no.3
    • /
    • pp.106-111
    • /
    • 2011
  • This paper presents an ASIP (Application-specific Instruction Processor) for motion estimation that employs specific IME instructions and its programmable and reconfigurable hardware architecture for various video codecs, such as H.264/AVC, MPEG4, etc. With the proposed specific instructions and hardware accelerator, it can handle the real-time processing requirement of High Definition (HD) video. With the parallel operations and SAD unit control using pattern information, the proposed IME instruction supports not only full search algorithm but also other fast search algorithms. The hardware size is 77K gates for each Processing Element Group (PEG) which has 256 SAD PEs. The proposed ASIP runs at 160MHz with sixteen PEGs and it can handle 1080p@30 frame in real time.

Adaptive Model-Based Quantization Parameter Decision for Video Rate Control (비디오 비트율 제어를 위한 적응적 모델 기반의 양자화 변수 결정 방법)

  • Kim, Seon-Ki;Ho, Yo-Sung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.4C
    • /
    • pp.411-417
    • /
    • 2007
  • The rate control is an essential component in video coding to provide better quality under given coding constraints, such as channel capacity, frame rates, etc. In general, source data cannot be described as a single distribution in a video coding, hence it can cause an exhaustive approximation problem. It drops a coding efficiency under weak channel environments, such as mobile communications. In this paper, we design a new quantization parameter decision model that is based on a rate-distortion function of generalized Gaussian distribution. In order to adaptively express various source data distribution, we decide a shape parameter by observing a ratio of samples, which have a small value. For experiment, the proposed algorithm is implemented into H.264/AVC video codec, and its performance is compared with that of MPEG-2 TM5, H.263 TMN8 rate control algorithm. As shown in simulation results, the proposed algorithm provides an improved quality rather than previous algorithms and generates the number of bits closed to the target bits.

Hardware Implementation of Integer Transform and Quantization for H.264 (하드웨어 기반의 H.264 정수 변환 및 양자화 구현)

  • 임영훈;정용진
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.12C
    • /
    • pp.1182-1191
    • /
    • 2003
  • In this paper, we propose a new hardware architecture for integer transform, quantizer, inverse quantizer, and inverse integer transform of a new video coding standard H.264/JVT. We describe the algorithm and derive hardware architecture emphasizing the importance of area for low cost and low power consumption. The proposed architecture has been verified by PCI-interfaced emulation board using APEX-II Alters FPGA and also by ASIC synthesis using Samsung 0.18 um CMOS cell library. The ASIC synthesis result shows that the proposed hardware can operate at 100 MHz, processing more than 1,300 QCIF video frames per second. The hardware is going to be used as a core module when implementing a complete H.264 video encoder/decoder ASIC for real-time multimedia application.

A Study on the Method of Minimizing the Bit-Rate Overhead of H.264 Video when Encrypting the Region of Interest (관심영역 암호화 시 발생하는 H.264 영상의 비트레이트 오버헤드 최소화 방법 연구)

  • Son, Dongyeol;Kim, Jimin;Ji, Cheongmin;Kim, Kangseok;Kim, Kihyung;Hong, Manpyo
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.28 no.2
    • /
    • pp.311-326
    • /
    • 2018
  • This paper has experimented using News sample video with QCIF ($176{\times}144$) resolution in JM v10.2 code of H.264/AVC-MPEG. The region of interest (ROI) to be encrypted occurred the drift by unnecessarily referring to each frame continuously in accordance with the characteristics of the motion prediction and compensation of the H.264 standard. In order to mitigate the drift, the latest related research method of re-inserting encrypted I-picture into a certain period leads to an increase in the amount of additional computation that becomes the factor increasing the bit-rate overhead of the entire video. Therefore, the reference search range of the block and the frame in the ROI to be encrypted is restricted in the motion prediction and compensation for each frame, and the reference search range in the non-ROI not to be encrypted is not restricted to maintain the normal encoding efficiency. In this way, after encoding the video with restricted reference search range, this article proposes a method of RC4 bit-stream encryption for the ROI such as the face to be able to identify in order to protect personal information in the video. Also, it is compared and analyzed the experimental results after implementing the unencrypted original video, the latest related research method, and the proposed method in the condition of the same environment. In contrast to the latest related research method, the bit-rate overhead of the proposed method is 2.35% higher than that of the original video and 14.93% lower than that of the latest related method, while mitigating temporal drift through the proposed method. These improved results have verified by experiments of this study.

Development of ATSC3.0 based UHDTV Broadcasting System providing Ultra-high-quality Service that supports HDR/WCG Video and 3D Audio, and a Fixed UHD/Mobile HD Service (HDR/WCG 비디오와 3D 오디오를 지원하는 초고품질 방송서비스와 고정 UHD/이동 HD 방송 서비스를 제공하는 ATSC 3.0 기반 UHDTV 방송 시스템 개발)

  • Ki, Myungseok;Seok, Jinwuk;Beack, Seungkwon;Jang, Daeyoung;Lee, Taejin;Kim, Hui Yong;Oh, Hyeju;Lim, Bo-mi;Bae, Byungjun;Kim, Heung Mook;Choi, Jin Soo
    • Journal of Broadcast Engineering
    • /
    • v.22 no.6
    • /
    • pp.829-849
    • /
    • 2017
  • Due to the large-scale TV display, the convergence of broadcasting and broadband, and the advancement of signal compression and transmission technology, terrestrial digital broadcasting has evolved into UHD broadcasting capable of providing simultaneous broadcasting of fixed UHD and mobile HD. The Korean standard for terrestrial UHDTV broadcasting is based on ATSC 3.0, the broadcasting standard of North America. The terrestrial UHDTV broadcasting standard chose that as a new AV codec standard, HEVC video codec which can compress with higher efficiency compared to AVC, and MPEG-H 3D audio codec for realistic audio. Also, DASH and MMT are adopted as transmission format instead of MPEG-2 TS to support broadband as well as broadcasting network, and in order to provide 4K UHD/mobile HD service simultaneously ROUTE multiplexing technology is applied. In this paper, we propose an audio/video encoder, which is required to provide HDR/WCG supported high quality video service, 10.2 channel/4 object supporting stereo sound service, fixed UHD and mobile HD simultaneous broadcasting service based on ATSC3.0, also we implemented the ATSC 3.0 LDM system for ROUTE/DASH packager, multiplexing system and physical layer transmission/reception, and verified the service ability by applying it to real time broadcast environment.

Fast Motion Estimation with Adaptive Search Range Adjustment using Motion Activities of Temporal and Spatial Neighbor Blocks (시·공간적 주변 블록들의 움직임을 이용하여 적응적으로 탐색 범위 조절을 하는 고속 움직임 추정)

  • Lee, Sang-Hak
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.5 no.4
    • /
    • pp.372-378
    • /
    • 2010
  • This paper propose the fast motion estimation algorithm with adaptive search range adjustment using motion activities of temporal and spatial neighbor blocks. The existing fast motion estimation algorithms with adaptive search range adjustment use the maximum motion vector of all blocks in the reference frame. So these algorithms may not control a optimum search range for slow moving block in current frame. The proposed algorithm use the maximum motion vector of neighbor blocks in the reference frame to control a optimum search range for slow moving block. So the proposed algorithm can reduce computation time for motion estimation. The experiment results show that the proposed algorithm can reduce the number of search points about 15% more than Simple Dynamic Search Range(SDSR) algorithm while maintaining almost the same bit-rate and motion estimation error.

A Study on Adaptive GOP Structure for SVC (스케일러블 동영상 부호화를 위한 적응적 GOP 구조에 관한 연구)

  • Jeong, Se-Yoon;Park, Min-Woo;Park, Gwang-Hoon;Kim, Kyeu-Heon;Hong, Jin-Woo
    • Journal of Broadcast Engineering
    • /
    • v.10 no.4 s.29
    • /
    • pp.463-473
    • /
    • 2005
  • In this paper, we propose Adaptive GOP Structure to enhance the coding performance of scalable video coding in JVT. Adaptive GOP Structure considers temporal variances of video sequence. In general, SVC encodes a video sequence with fixed GOP size. The coding performance is varying according to the temporal variance of a video sequence. Thus, Adaptive GOP Structure method is proposed. It selects GOP size adaptively by considering temporal variance of video sequence. In the experiments, the propose method showed the enhanced coding performance in most sequences. The PSNR gain of Crew sequence is up to 0.63 dB.