• Title/Summary/Keyword: Multi-view Video Coding

Search Result 109, Processing Time 0.022 seconds

An Improved Motion/Disparity Vector Prediction for Multi-view Video Coding (다시점 비디오 부호화를 위한 개선된 움직임/변이 벡터 예측)

  • Lim, Sung-Chang;Lee, Yung-Lyul
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.2
    • /
    • pp.37-48
    • /
    • 2008
  • Generally, a motion vector and a disparity vector represent the motion information of an object in a single-view of camera and the displacement of the same scene between two cameras that located spatially different from each other, respectively. Conventional H.264/AVC does not use the disparity vector in the motion vector prediction because H.264/AVC has been developed for the single-view video. But, multi-view video coding that uses the inter-view prediction structure based on H.264/AVC can make use of the disparity vector instead of the motion vector when the current frame refers to the frame of different view. Therefore, in this paper, we propose an improved motion/disparity vector prediction method that consists of global disparity vector replacement and extended neighboring block prediction. From the experimental results of the proposed method compared with the conventional motion vector prediction of H.264/AVC, we achieved average 1.07% and 1.32% of BD (Bjontegaard delta)-bitrate saving for ${\pm}32$ and ${\pm}64$ of global vector search range, respectively, when the search range of the motion vector prediction is set to ${\pm}16$.

A Study on the Scalable Structure for Motion Picture Coding (동영상 부호화를 위한 scalable 구조에 관한 연구)

  • Shin, Joong-In;Han, Young-Oh;Kim, Hyung-Kon;Park, Sang-Hui
    • Proceedings of the KIEE Conference
    • /
    • 1993.11a
    • /
    • pp.342-345
    • /
    • 1993
  • In this paper, we study the structure of the hierarchical coding method of video signal which can contain the multi resolution video signals. To preserve the compatibility with the conventional coding methods. we accomplished a scalable structure using the subband coding, maintaining enoughly the international coding structure. The proposed scheme showed the low PSNR, a little, when compared with the conventional scheme, but showed a good image quality perceptually and proved to have a advantage in the H/W implementation in a view of processing speed.

  • PDF

Multi-View Color Video and Depth Map Coding based on HEVC (HEVC 기반 다시점 컬러 영상 및 깊이 정보 맵 부호화 방법)

  • Yoo, Sun-Mi;Nam, Jung-Hak;Lim, Woong;Sim, Dong-Gyu;Cheong, Won-Sik;Hur, Nam-Ho
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.49 no.2
    • /
    • pp.83-93
    • /
    • 2012
  • This paper proposes a method to efficiently encode multi-view color videos and depth maps. The proposed coding method for multi-view color videos and depth maps can improve the coding efficiency by additional inter-view prediction, as well as inter-frame prediction. By means of the proposed method, we achieved the coding gain of approximately 55% for 2-view color videos and approximately 12% for 2-view depth maps. For 3-view case, we found that the proposed system yields 54% of coding gain from outer view color videos and 56% of coding gain from center view color videos, respectively. Moreover, for 3-view depth map case, approximately 11% of coding gain from outer view and 13% of coding gain from center view are obtained with the proposed coder, respectively.

Inter-layer Texture and Syntax Prediction for Scalable Video Coding

  • Lim, Woong;Choi, Hyomin;Nam, Junghak;Sim, Donggyu
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.4 no.6
    • /
    • pp.422-433
    • /
    • 2015
  • In this paper, we demonstrate inter-layer prediction tools for scalable video coders. The proposed scalable coder is designed to support not only spatial, quality and temporal scalabilities, but also view scalability. In addition, we propose quad-tree inter-layer prediction tools to improve coding efficiency at enhancement layers. The proposed inter-layer prediction tools generate texture prediction signal with exploiting texture, syntaxes, and residual information from a reference layer. Furthermore, the tools can be used with inter and intra prediction blocks within a large coding unit. The proposed framework guarantees the rate distortion performance for a base layer because it does not have any compulsion such as constraint intra prediction. According to experiments, the framework supports the spatial scalable functionality with about 18.6%, 18.5% and 25.2% overhead bits against to the single layer coding. The proposed inter-layer prediction tool in multi-loop decoding design framework enables to achieve coding gains of 14.0%, 5.1%, and 12.1% in BD-Bitrate at the enhancement layer, compared to a single layer HEVC for all-intra, low-delay, and random access cases, respectively. For the single-loop decoding design, the proposed quad-tree inter-layer prediction can achieve 14.0%, 3.7%, and 9.8% bit saving.

3D 비디오 부호화 표준 기술

  • Park, Si-Nae;Sim, Dong-Gyu
    • The Magazine of the IEIE
    • /
    • v.37 no.9
    • /
    • pp.33-41
    • /
    • 2010
  • 최근 디스플레이 기술의 비약적인 발전과 함께 3D 영화의 흥행으로 인해 국내 뿐 아니라 전 세계적으로 3DTV에 대한 관심이 높아지고 있다. 3DTV는 사람의 눈 사이의 간격 때문에 두 눈에 맺히는 상이 달라지는 양안시차의 원리를 이용하는 기술로, 두 눈에 맺힐 두 영상을 각각 획득하고, 이를 사람의 두 눈에 각각 보여지도록 하는 방식으로 3차원 입체 비디오를 실현하게 된다. 이를 위한 3D 비디오의 부호화 표준 기술로는 기존에 MPEG-2 stereo 및 MPEG-C part 2가 ISO/IEC MPEG을 통하여 제정된바 있으며, 최근에는 ITU-T의 VCEG과 ISO/IEC MPEG이 비디오 부호화 표준을 위하여 Joint Video Coding (JVT)를 구성하여, Multi-view Video Coding (MVC)를 제정하였다. 그리고 현재 진행되는 3D비디오 관련 표준화로는 MPEG에서 Free view-point TV (FTV)등의 응용을 위한 3DV라는 이름으로 차세대 비디오 표준을 준비하고 있다. 본 고에서는 기존에 MPEG을 통해 진행된 3DTV 관련 표준화 기술을 알아보고, 현재 진행되고 있는 3DTV 부호화 표준화 동향을 살펴본다.

  • PDF

Illumination Mismatch Compensation Algorithm based on Layered Histogram Matching by Using Depth Information (깊이 정보에 따른 레이어별 히스토그램 매칭을 이용한 조명 불일치 보상 기법)

  • Lee, Dong-Seok;Yoo, Ji-Sang
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.8C
    • /
    • pp.651-660
    • /
    • 2010
  • In this paper, we implement an efficient histogram-based prefiltering to compensate the illumination mismatches in regions between neighboring views. In multi-view video, such illumination disharmony can primarily occur on account of different camera location and orientation and an imperfect camera calibration. This discrepancy can cause the performance decrease of multi-view video coding(MVC) algorithm. A histogram matching algorithm can be exploited to make up for these differences in a prefiltering step. Once all camera frames of a multi-view sequence are adjusted to a predefined reference through the histogram matching, the coding efficiency of MVC is improved. However general frames of multi-view video sequence are composed of several regions with different color composition and their histogram distribution which are mutually independent of each other. In addition, the location and depth of these objects from sequeuces captured from different cameras can be different with different frames. Thus we propose a new algorithm which classify a image into several subpartitions by its depth information first and then histogram matching is performed for each region individually. Experimental results show that the compression ratio for the proposed algorithm is improved comparing with the conventional image-based algorithms.

Bit-plane based Lossless Depth Map Coding Method (비트평면 기반 무손실 깊이정보 맵 부호화 방법)

  • Kim, Kyung-Yong;Park, Gwang-Hoon;Suh, Doug-Young
    • Journal of Broadcast Engineering
    • /
    • v.14 no.5
    • /
    • pp.551-560
    • /
    • 2009
  • This paper proposes a method for efficient lossless depth map coding for MPEG 3D-Video coding. In general, the conventional video coding method such as H.264 has been used for depth map coding. However, the conventional video coding methods do not consider the image characteristics of the depth map. Therefore, as a lossless depth map coding method, this paper proposes a bit-plane based lossless depth mar coding method by using the MPEG-4 Part 2 shape coding scheme. Simulation results show that the proposed method achieves the compression ratios of 28.91:1. In intra-only coding, proposed method reduces the bitrate by 24.84% in comparison with the JPEG-LS scheme, by 39.35% in comparison with the JPEG-2000 scheme, by 30.30% in comparison with the H.264(CAVLC mode) scheme, and by 16.65% in comparison with the H.264(CABAC mode) scheme. In addition, in intra and inter coding the proposed method reduces the bitrate by 36.22% in comparison with the H.264(CAVLC mode) scheme, and by 23.71% in comparison with the 0.264(CABAC mode) scheme.

MPEG-2 TS Header Extension for Efficient HTTP Adaptive Stream of SVC/MVC (SVC/MVC의 효율적인 HTTP 적응 스트리밍을 위한 MPEG-2 TS 헤더의 확장)

  • Jang, Euy-Doc;Kim, Jae-Gon;Lee, Jin-Young;Kang, Jung-Won;Bae, Seong-Jun
    • Journal of Broadcast Engineering
    • /
    • v.16 no.3
    • /
    • pp.520-529
    • /
    • 2011
  • In this paper, we propose the extension of the MPEG-2 Transport Stream (TS) header for efficient adaptation of multi-layer coded video such as scalable video coding (SVC) and multiview video coding (MVC) in the HTTP streaming. First of all, the limit of the existing TS in terms of flexible adaptation of multi-layer video is investigated, and the signaling by extending TS header is proposed to provide efficient adaptation in a TS level. The proposed extension utilizes the private data field in the adaptation field of TS header to signal scalability and/or view information, which enable us to support diverse adaptation that suits underlying constraints of client capabilities, network conditions and user preferences. In short, the extension enables adaptation of scalable video with full scalability as well as view selection of multiview video in a TS level while keeping backward compatibility with the existing TS syntax/semantics. The performance of the proposed extension is compared with the existing adaptation using PID (packet ID) in terms of efficiency and complexity of adaptation. Furthermore, the increase of TS overhead caused by proposed extension is analyzed and an extension scheme to minimized the overhead is proposed.

Moving Object Detection and Tracking in Multi-view Compressed Domain (비디오 압축 도메인에서 다시점 카메라 기반 이동체 검출 및 추적)

  • Lee, Bong-Ryul;Shin, Youn-Chul;Park, Joo-Heon;Lee, Myeong-Jin
    • Journal of Advanced Navigation Technology
    • /
    • v.17 no.1
    • /
    • pp.98-106
    • /
    • 2013
  • In this paper, we propose a moving object detection and tracking method for multi-view camera environment. Based on the similarity and characteristics of motion vectors and coding block modes extracted from compressed bitstreams, validation of moving blocks, labeling of the validated blocks, and merging of neighboring blobs are performed. To continuously track objects for temporary stop, crossing, and overlapping events, a window based object updating algorithm is proposed for single- and multi-view environments. Object detection and tracking could be performed with an acceptable level of performance without decoding of video bitstreams for normal, temporary stop, crossing, and overlapping cases. The rates of detection and tracking are over 89% and 84% in multi-view environment, respectively. The rates for multi-view environment are improved by 6% and 7% compared to those of single-view environment.

Multi-view Video Coding based on Grid-type Pyramid GOP Structure (격자 피라미드 GOP 구조 기반의 다시점 비디오 부호화 방법)

  • Oh, Kwan-Jung;Oh, Han;Ho, Yo-Sung;Choi, Byeong-Ho
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2005.11a
    • /
    • pp.25-28
    • /
    • 2005
  • 디지틸 멀티미디어 시대를 맞이하여 영상통신 기술이 급속히 발전함에 따라 보다 사실감과 몰입감을 줄 수 있는 3차원 입체 영상처리에 대한 관심이 커지고 있다. 최근 국내외 연구기관에서 다차원 멀티미디어 서비스 개발을 위한 연구가 활발히 진행되고 있으며, MPEG 표준화 그룹에서도 H.264/AVC 압축 방법을 이용한 다시점 비디오 부호화(multi-view video coding, MVC) 방법들이 제안되었다. 본 논문에서는 격자 피라미드 GOP 구조 기반의 다시점 비디오 부호화 방법에 대해 기술하였다. 이 방법은 현재 MPEG 표준화 그룹에서 권고된 ‘Anchor’ 방법에서 고려치 못한 인접 시점간의 공간적인 상관도를 효과적으로 활용하기 위해 격자 GOP구조를 제안했고, 각 시점에 대한 효율적인 부호화를 위해 계층적 피라미드 GOP 구조를 이용하였다. 또한, 공간적인 예측의 경우에 시점간의 전체 변이 (global disparity)를 고려하여 가변적인 탐색 범위를 이용하였다. 본 논문에서 제안한 방법은 현재 MPEG에서 성능 평가의 기준이 되는 ‘Anchor’ 방법에 비해 동일 비트율에서 0.5${\sim}$0.8 dB 정도의 성능 향상을 보였다.

  • PDF