• Title/Summary/Keyword: 2D Video

Search Result 910, Processing Time 0.03 seconds

Coding Technique using Depth Map in 3D Scalable Video Codec (확장된 스케일러블 비디오 코덱에서 깊이 영상 정보를 활용한 부호화 기법)

  • Lee, Jae-Yung;Lee, Min-Ho;Chae, Jin-Kee;Kim, Jae-Gon;Han, Jong-Ki
    • Journal of Broadcast Engineering
    • /
    • v.21 no.2
    • /
    • pp.237-251
    • /
    • 2016
  • The conventional 3D-HEVC uses the depth data of the other view instead of that of the current view because the texture data has to be encoded before the corresponding depth data of the current view has been encoded, where the depth data of the other view is used as the predicted depth for the current view. Whereas the conventional 3D-HEVC has no other candidate for the predicted depth information except for that of the other view, the scalable 3D-HEVC utilizes the depth data of the lower spatial layer whose view ID is equal to that of the current picture. The depth data of the lower spatial layer is up-scaled to the resolution of the current picture, and then the enlarged depth data is used as the predicted depth information. Because the quality of the enlarged depth is much higher than that of the depth of the other view, the proposed scheme increases the coding efficiency of the scalable 3D-HEVC codec. Computer simulation results show that the scalable 3D-HEVC is useful and the proposed scheme to use the enlarged depth data for the current picture provides the significant coding gain.

A Study for properties of Subdivision to 3D game character education (3D 게임 캐릭터 교육을 위한 Subdivision 특성 연구 (3ds Max의 Open subdivision을 중심으로))

  • Cho, Hyung-ik
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2016.10a
    • /
    • pp.210-212
    • /
    • 2016
  • Today, video games created via 3D softwares become a core part of the essential the video game contents field because their properties that can produce more easier than 2D games and can save budget of contents makings. It is very important that reducing polygon counts of 3D characters and environments for Gaming optimization. We can formulate elaborate 3D game models with low polygon counting in virtue of technological advancements, and these technologies continue to evolve. In 2012, Pixar made public Open subdivision which is the new technology to make high quality 3D models with low polygons and distributed that via Open source verification. This paper will compare and analyze the characteristics, and merits and demerits of these various kinds of these skills(Mesh smooth, Turbo Smooth, Open subdivision) and will inquire which method is the most efficient one to make 3D video games.

  • PDF

Inactive Regions Padding Methods for Rotated Sphere Projection of 360 Video

  • Yoon, Yong-Uk;Kim, Hyun-Ho;Kim, Jae-Gon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2018.06a
    • /
    • pp.200-201
    • /
    • 2018
  • In the workflow of 360 video coding of JVET (Joint Video Experts Team), firstly the 360 videos are projected onto the 2D plane with diverse projection formats, such as Equi-Rectangular Projection (ERP), Cubemap Projection (CMP), Rotated Sphere Projection (RSP), etc. The projection format of RSP has inactive regions in the converted 2D plane. The inactive regions may cause visual artifact as well as the reduction of the coding efficiency due to discontinuity at boundaries between active and inactive regions. In this paper, to overcome these problems, the inactive regions are padded by using two types of adjacent pixels. Then padded regions of RSP are blended with inactive regions padded by proposed method. The experimental results demonstrate that, in terms of end-to-end WS-PSNR-NN, the proposed method achieves 0.1% BD-rate reduction. In addition, the visual artifacts along the borders between discontinuous faces are noticeably reduced.

  • PDF

3D Human Reconstruction from Video using Quantile Regression (분위 회귀 분석을 이용한 비디오로부터의 3차원 인체 복원)

  • Han, Jisoo;Park, In Kyu
    • Journal of Broadcast Engineering
    • /
    • v.24 no.2
    • /
    • pp.264-272
    • /
    • 2019
  • In this paper, we propose a 3D human body reconstruction and refinement method from the frames extracted from a video to obtain natural and smooth motion in temporal domain. Individual frames extracted from the video are fed into convolutional neural network to estimate the location of the joint and the silhouette of the human body. This is done by projecting the parameter-based 3D deformable model to 2D image and by estimating the value of the optimal parameters. If the reconstruction process for each frame is performed independently, temporal consistency of human pose and shape cannot be guaranteed, yielding an inaccurate result. To alleviate this problem, the proposed method analyzes and interpolates the principal component parameters of the 3D morphable model reconstructed from each individual frame. Experimental result shows that the erroneous frames are corrected and refined by utilizing the relation between the previous and the next frames to obtain the improved 3D human reconstruction result.

Stereo Matching Algorithm Based on Fast Guided Image Filtering for 3-Dimensional Video Service (3차원 비디오 서비스를 위한 고속 유도 영상 필터링 기반 스테레오 매칭 알고리즘)

  • Hong, Gwang-Soo;Kim, Byung-Gyu
    • Journal of Digital Contents Society
    • /
    • v.17 no.6
    • /
    • pp.523-529
    • /
    • 2016
  • Stereo matching algorithm is an essential part in computer vision and photography. Accuracy and computational complexity are challenges of stereo matching algorithm. Much research has been devoted to stereo matching based on cost volume filtering of matching costs. Local stereo matching based guided image filtering (GIF) has a computational complexity of O(N), but is still not enough to provide real-time 3-dimensional (3-D) video services. The proposed algorithm concentrates reduction of computational complexity using the concept of fast guided image filter, which increase the speed up to $O(N/\small{s}^2)$ with a sub-sampling ratio $\small{s}$. Experimental results indicated that the proposed algorithm achieves effective local stereo matching as well as a fast execution time for 3-D video service.

Content-based Video Information Retrieval and Streaming System using Viewpoint Invariant Regions

  • Park, Jong-an
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.2 no.1
    • /
    • pp.43-50
    • /
    • 2009
  • This paper caters the need of acquiring the principal objects, characters, and scenes from a video in order to entertain the image based query. The movie frames are divided into frames with 2D representative images called "key frames". Various regions in a key frame are marked as key objects according to their textures and shapes. These key objects serve as a catalogue of regions to be searched and matched from rest of the movie, using viewpoint invariant regions calculation, providing the location, size, and orientation of all the objects occurring in the movie in the form of a set of structures collaborating as video profile. The profile provides information about occurrences of every single key object from every frame of the movie it exists in. This information can further ease streaming of objects over various network-based viewing qualities. Hence, the method provides an effective reduced profiling approach of automatic logging and viewing information through query by example (QBE) procedure, and deals with video streaming issues at the same time.

  • PDF

A Stereo Video Avatar for Supporting Visual Communication in a $CAVE^{TM}$-like System ($CAVE^{TM}$-like 시스템에서 시각 커뮤니케이션 지원을 위한 스테레오 비디오 아바타)

  • Rhee Seon-Min;Park Ji-Young;Kim Myoung-Hee
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.33 no.6
    • /
    • pp.354-362
    • /
    • 2006
  • This paper suggests a method for generating high qualify stereo video avatar to support visual communication in a CAVE$^{TM}$-like system. In such a system because of frequent change of light projected onto screens around user, it is not easy to extract user silhouette robustly, which is an essential step to generate a video avatar. In this study, we use an infrared reflective image acquired by a grayscale camera with a longpass filter so that the change of visible light on a screen is blocked to extract robust user silhouette. In addition, using two color cameras positioned at a distance of a binocular disparity of human eyes, we acquire two stereo images of the user for fast generation and stereoscopic display of a high quality video avatar without 3D reconstruction. We also suggest a fitting algorithm of a silhouette mask on an infrared reflective image into an acquired color image to remove background. Generated stereo images of a video avatar are texture mapped into a plane in virtual world and can be displayed in stereoscopic using frame sequential stereo method. Suggested method have advantages that it generates high quality video avatar taster than 3D approach and it gives stereoscopic feeling to a user 2D based approach can not provide.

Multi-view Video Coding using View Interpolation (영상 보간을 이용한 다시점 비디오 부호화 방법)

  • Lee, Cheon;Oh, Kwan-Jung;Ho, Yo-Sung
    • Journal of Broadcast Engineering
    • /
    • v.12 no.2
    • /
    • pp.128-136
    • /
    • 2007
  • Since the multi-view video is a set of video sequences captured by multiple array cameras for the same three-dimensional scene, it can provide multiple viewpoint images using geometrical manipulation and intermediate view generation. Although multi-view video allows us to experience more realistic feeling with a wide range of images, the amount of data to be processed increases in proportion to the number of cameras. Therefore, we need to develop efficient coding methods. One of the possible approaches to multi-view video coding is to generate an intermediate image using view interpolation method and to use the interpolated image as an additional reference frame. The previous view interpolation method for multi-view video coding employs fixed size block matching over the pre-determined disparity search range. However, if the disparity search range is not proper, disparity error may occur. In this paper, we propose an efficient view interpolation method using initial disparity estimation, variable block-based estimation, and pixel-level estimation using adjusted search ranges. In addition, we propose a multi-view video coding method based on H.264/AVC to exploit the intermediate image. Intermediate images have been improved about $1{\sim}4dB$ using the proposed method compared to the previous view interpolation method, and the coding efficiency have been improved about 0.5 dB compared to the reference model.

Visual Problems and Refractive Error at Video Display Termianls (VDT사용자의 시기능 불편과 굴절이상)

  • Seo, Y.W.;Choe, Y.J.
    • Journal of Korean Ophthalmic Optics Society
    • /
    • v.3 no.1
    • /
    • pp.75-86
    • /
    • 1998
  • The purpose of this study is to evaluate the effects of continuing work on VDT(video display terminal), therefore this study examined visual fatigue, unaided visual acuity, refractive error, accommodation and horizontal phoria of 152 subjects who did two hour long VDT work. For the ocular symptoms, the greatest number was tired eyes accounting for 45.71%. In the visual symptoms, blurred vision was the hightest rate of 80.39% and in case of systemic symptoms shoulder pain was 33.33% marked top ranking. The average of near visual acuity decresed almost 10% from 0.47 to 0.42, but refractive error increased about 0.10D to the direction of myopic shift. The amplitude of accommodation decreased approximately 0.72D from 7.46D to 6.74D. Accommodation facility was delayed from 2.27 second to 2.50 second, the amplitude of positive relative accommodation was decreased from 4.76D to 4.16D and the amplitude of negative relative accommodation was decreased from 2.46D to 2.33D. The horizontal phoria shifted to the direction of esophoria from $1.82{\Delta}$ to $3.24{\Delta}$.

  • PDF

MPEG Video-based Point Cloud Compression 표준 소개

  • Jang, Ui-Seon
    • Broadcasting and Media Magazine
    • /
    • v.26 no.2
    • /
    • pp.18-30
    • /
    • 2021
  • 본 고에서는 최근 국제표준으로 완성된 MPEG Video-based Point Cloud Compression(V-PCC) 표준 기술에 대해 소개하고자 한다. AR/VR 등 새로운 미디어 응용의 출현과 함께 그 관심이 3D 그래픽 데이터에 더 많이 모아지는 가운데, 지금까지는 효율적인 압축에 관심이 높지 않았던 포인트 클라우드 데이터의 표준 압축 기술로 만들어진 V-PCC 표준의 표준화 현황과 주요 응용분야, 그리고 주요 압축 기술에 대하여 살펴보고자 한다.