• Title/Summary/Keyword: H.264/AVC Video Coding

Search Result 326, Processing Time 0.016 seconds

Selective Inter-layer Residual Prediction Coding and Fast Mode Decision for Spatial Enhancement Layers in Scalable Video Coding (스케일러블 비디오 부호화에서 선택적 계층간 차분 신호 부호화 및 공간적 향상 계층에서의 모드 결정)

  • Lee, Bum-Shik;Hahm, Sang-Jin;Park, Chang-Seob;Park, Keun-Soo;Kim, Mun-Churl
    • Journal of Broadcast Engineering
    • /
    • v.12 no.6
    • /
    • pp.596-610
    • /
    • 2007
  • In order to reduce the complexity of SVC encoding, we introduce a fast mode decision method in the enhancement layers of spatial scalability by selectively performing the inter-layer residual prediction of SVC. The Inter-layer residual prediction coding in Scalable Video Coding has a large advantage of enhancing the coding efficiency since it utilizes the correlation between two residuals from a lower spatial layer and its next higher spatial layer. However, this entails the dramatical increase in the complexity of SVC encoders. The proposed method is to analyze the characteristics of integer transform coefficients for the subtracted signal for two residuals from lower and upper spatial layers. Then it selectively performs the inter-layer residual prediction coding and rate-distortion optimizations in the upper spatial enhancement layer if the SAD values of residuals exceed adaptive threshold values. Therefore, by classifying the residuals according to the properties of integer-transform coefficients only with SAD of residuals between two layers, the SVC encoder can perform the inter-layer residual coding selectively, thus significantly reducing the total required encoding time. The proposed method results in reduction of the total encoding time with 51.5% in average while maintaining the RD performance with negligible amounts of quality degradation.

Frame Complexity-Based Adaptive Bit Rate Normalization (프레임 복잡도를 고려한 적응적 비트율 정규화 방법)

  • Park, Sang-Hyun
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.10 no.12
    • /
    • pp.1329-1336
    • /
    • 2015
  • Due to the advances in hardware technologies for low-power CMOS cameras, there have been various researches on wireless video sensor network(WVSN) applications including agricultural monitoring and environmental tracking. In such a system, its core technologies include video compression and wireless transmission. Since data of video sensors are bigger than those of other sensors, it is particularly necessary to estimate precisely the traffic after video encoding. In this paper, we present an estimation method for the encoded video traffic in WVSN networks. To estimate traffic characteristics accurately, the proposed method first measures complexities of frames and then applies them to the bit rate estimation adaptively. It is shown by experimental results that the proposed method improves the estimation of bit rate characteristics by more than 12% as compared to the existing method.

An Efficient Coding Technique of Holographic Video Signal using 3D Segment Scanning (분할영역의 3차원 스캐닝을 이용한 홀로그래픽 비디오 신호의 효율적인 부호화 기술)

  • Seo, Young-Ho;Choi, Hyun-Jun;Kim, Dong-Wook
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.2C
    • /
    • pp.132-140
    • /
    • 2007
  • In this paper, we proposed a new technique to encode and decode the digital hologram. Since the digital hologram (or fringe pattern) is generated by interference of light, it has much different property from natural 2D (2 dimensional) images. First, we acquisite optical-sensed or computer-generated hologram by digital type, and then extract a chrominance component. The extracted digital hologram for coding is separated into segments to use multi-view properties. The segmented hologram shows the similar characteristics with picturing an object with 2D cameras in various point of view. Since fringe pattern is visually observed like as noise, we expect that the fringe pattern has poor coding efficiency. To obtain high efficiency, the segment is transformed with DCT (Discrete Cosine Transform) which resembles hologram generation process with high performance. Each transformed segment passes the 3D scanning process according to time and spatial correlation, and is organized into a video stream. Since the segment which correspond to frame of a video stream consists of the transformed coefficients with wide range of value, it is classified and re-normalized. Finally it is compressed with coding tools. The proposed algorithm illustrated that it has better properties for reconstruction of 16 times higher compression rate than the previous researches.

The Analysis of Flickering Artifact of Multimedia Video in DMB Environment (DMB 환경에서 멀티미디어 동영상의 화면 떨림 현상 분석)

  • Lim, Jong-Min;Kang, Dong-Wook;Jung, Kyeong-Hoon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.11a
    • /
    • pp.111-114
    • /
    • 2009
  • 멀티미디어 동영상의 부호화 알고리즘은 기본적으로 손실 부호화(lossy coding)이기 때문에 압축된 동영상에는 불가피하게 화질의 열화가 포함된다. 압축의 정도가 심하지 않거나 압축을 하더라도 영상이 단순하고 움직임이 적은 경우에는 이러한 열화의 영향이 인간의 눈에는 보이지 않지만, 압축률이 너무 높거나 원본 영상에 포함된 움직임이 복잡하거나 공간적으로 세밀한 영역이 많은 경우에는 손실이 심하여 시각적으로 거슬리는 다양한 잡음이 발생한다. 화면 떨림(flickering)은 시간 영역에서의 대표적인 부호화 잡음으로서 프레임율(frame rate)이 너무 낮은 경우에 움직임이 자연스럽지 않고 불연속적으로 튀거나 부호화 비트율(bit rate)을 제어하기 위해서 양자화 파라미터를 변동하는 과정에서 연속되는 프레임들의 화질에 차이를 보이면서 나타나는 현상이다. 본 논문에서는 DMB 환경에서 사용되는 주기적으로 I 프레임이 삽입되는 GOP(Group of Picture) 구조에서 화면 떨림 현상의 원인을 파악하고 멀티미디어 동영상의 시공간적 특성에 따라 특징적으로 나타나는 화면 떨림 현상을 분석한다. 실험 영상으로서 시공간적으로 다양한 특성을 보이며 H.264/AVC방식으로 부호화된 10개의 동영상을 선택하였으며, 양자화 파라미터를 낮은 화질의 영상에서부터 높은 화질의 영상까지 변화시키면서 화면 떨림 현상의 존재 여부 및 정도를 살펴보았다. 실험의 결과로 화면 떨림 현상은 영상의 화질이 너무 좋거나 너무 나쁘지 않은 중간 영역에서 중요하게 고려해야 할 부호화 잡음이며, 프레임 내의 세밀함의 정도, 움직임의 정도, 객체의 크기, 카메라의 움직임 등에 영향을 받음을 알 수 있다.

  • PDF

Initial QP Determination Algorithm using Bit Rate Model (비트율 모델을 이용한 초기 QP 결정 알고리즘)

  • Park, Sang-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.9
    • /
    • pp.1947-1954
    • /
    • 2012
  • The first frame is encoded in intra mode which generates a larger number of bits. In addition, the first frame is used for the inter mode encoding of the following frames. Thus the initial QP for the first frame affects the first frame as well as the following frames. Traditionally, the initial QP is determined among four constant values only depending on the bpp. In the case of low bit rate video coding, the initial QP value is fixed to 40 regardless of the output bandwidth. Although this initialization scheme is simple, yet it is not accurate enough. An accurate initial QP prediction scheme should not only depends on bpp but also on the complexity of the video sequence and the output bandwidth. In the proposed scheme, we determine the initial QP according to the ratio of the first frame to the total bits allocated to a GOP. To estimate the QP of the allocated bits, Rate-QP model is used. It is shown by experimental results that the new algorithm can predict the optimal initial QP more accurately and generate the PSNR performance better than that of the existing JVT algorithm.

Optimal QP Determination Method for Adaptive Intra Frame Encoding (적응적 인트라 프레임 압축을 위한 최적 QP 결정 방법)

  • Park, Sang-Hyun
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.10 no.9
    • /
    • pp.1009-1018
    • /
    • 2015
  • In video coding, the first frame of a GOP is encoded in intra mode which generates a larger number of bits. In addition, the first frame is used for the inter mode encoding of the following frames. Thus the intial QP for the first frame encoding affects the first frame as well as the following frames. To determine the initial QP that maximizes the PSNR of a GOP, several algorithms which uses the ratio of the PSNR of the I frame and the PSNR of P frames of a GOP have been proposed. In this paper, we propose a new traffic model that can be used to determine the optimal initial QP simply and exactly in algorithms that use the PSNR ratio. We first analyze the characteristics of the PSNR ratio of I and P frames and the PSNR of a GOP, and then propose a new traffic model which can represent the characteristics and determine the optimal intial QP. It is shown by experimental results that the initial QP determination method with the proposed model can predict an optimal initial QP whose difference from the optimal value is less than 2. The proposed scheme can also generate the PSNR performance better than that of the existing JVT algorithm.