• 제목/요약/키워드: three-dimensional video

검색결과 229건 처리시간 0.021초

3차원 비트율-왜곡 최적화 기반 블록 부호화를 이용하는 임베디드 비디오 압축 방법 (An Embedded Video Compression Scheme Using a Three-Dimensional Rate-Distortion Optimization Based Block Coder)

  • 양창모;정광수
    • 한국통신학회논문지
    • /
    • 제41권10호
    • /
    • pp.1155-1166
    • /
    • 2016
  • 본 논문에서는 3차원 비트율-왜곡 최적화 기반 블록 부호화를 이용하는 새로운 임베디드 비디오 압축 방법을 제안한다. 제안한 방법에서는 입력되는 비디오 프레임에 움직임 보상 시간적 필터링(Motion Compensated Temporal Filtering, MCTF)를 적용하여 비디오의 시간적 중복성을 제거한 후, 비디오 프레임에 2차원 이산 웨이브렛 변환을 수행하여 공간적 중복성을 제거한다. 이러한 방법으로 생성된 3차원 웨이브렛 계수들은 비트율-왜곡비 기댓값에 따라 정렬되며 3차원 블록분할 부호화 방법을 이용하여 부호화된다. 또한 제안한 방법은 임베디드 특징을 유지하면서도 효과적으로 컬러 비디오를 부호화하는 방법과 효율적인 비트율 제어 방법을 사용한다. 실험 결과는 제안한 방법이 임베디드 비트스트림을 생성하면서도 기존의 비디오 압축 방법과 비교하여 우수한 성능을 제공함을 보여준다.

The fast DCT algorithm based on the new prime factor and common factor decomposition

  • Choi, Byeong-Ho;Kim, Jong-Uk;Suh, Ki-Bum;Chong, Jong-Wha;Bang, Gyo-Yoon
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1992년도 한국자동제어학술회의논문집(국제학술편); KOEX, Seoul; 19-21 Oct. 1992
    • /
    • pp.245-250
    • /
    • 1992
  • In this paper, we present a nev algorithm for the fast computation of the discrete cosine transform(DCT). This algorithm consists of the three dimensional prime factor-decomposed algorithm(PFA) and three dimensional common factor-decomposed algorithm(CFA). We can compute N-point DCT for the number N decomposable Into three relative prime numbers using PFA and into three common numbers using CFA. We also show input and output index mapping for the three decomposition. it results in requiring fever multiplicaions than the previous algorithms. Particularly, for the large number N, it is more powerful in reducing the number of multiplication.

  • PDF

Graphical Video Representation for Scalability

  • Jinzenji, Kumi;Kasahara, Hisashi
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 1996년도 Proceedings International Workshop on New Video Media Technology
    • /
    • pp.29-34
    • /
    • 1996
  • This paper proposes a new concept in video called Graphical Video. Graphical Video is a content-based and scalable video representation. A video consists of several elements such as moving images, still images, graphics, characters and charts. All of these elements can be represented graphically except moving images. It is desirable to transform these moving images graphical elements so that they can be treated in the same way as other graphical elements. To achieve this, we propose a new graphical representation of moving images using spatio-temporal clusters, which consist of texture and contours. The texture is described by three-dimensional fractal coefficients, while the contours are described by polygons. We propose a method that gives domain pool location and size as a means to describe cluster texture within or near a region of clusters. Results of an experiment on texture quality confirm that the method provides sufficiently high SNR as compared to that in the original three-dimensional fractal approximation.

  • PDF

Technical Improvement Using a Three-Dimensional Video System for Laparoscopic Partial Nephrectomy

  • Komatsuda, Akari;Matsumoto, Kazuhiro;Miyajima, Akira;Kaneko, Gou;Mizuno, Ryuichi;Kikuchi, Eiji;Oya, Mototsugu
    • Asian Pacific Journal of Cancer Prevention
    • /
    • 제17권5호
    • /
    • pp.2475-2478
    • /
    • 2016
  • Background: Laparoscopic partial nephrectomy is one of the major surgical techniques for small renal masses. However, it is difficult to manage cutting and suturing procedures within acceptable time periods. To overcome this difficulty, we applied a three-dimensional (3D) video system with laparoscopic partial nephrectomy, and evaluated its utility. Materials and Methods: We retrospectively enrolled 31 patients who underwent laparoscopic partial nephrectomy between November 2009 and June 2014. A conventional two-dimensional (2D) video system was used in 20 patients, and a 3D video system in 11. Patient characteristics and video system type (2D or 3D) were recorded, and correlations with perioperative outcomes were analyzed. Results: Mean age of the patients was $55.8{\pm}12.4$, mean body mass index was $25.7{\pm}3.9kg/m^2$, mean tumor size was $2.0{\pm}0.8cm$, mean R.E.N.A.L nephrometry score was $6.9{\pm}1.9$, and clinical stage was T1a in all patients. There were no significant differences in operative time (p=0.348), pneumoperitoneum time (p=0.322), cutting time (p=0.493), estimated blood loss (p=0.335), and Clavien grade of >II complication rate (p=0.719) between the two groups. However, warm ischemic time was significantly shorter in the 3D group than the 2D group (16.1 min vs. 21.2min, p=0.021), which resulted from short suturing time (9.1 min vs. 15.2 min, p=0.008). No open conversion occurred in either group. Conclusions: A 3D video system allows the shortening of warm ischemic time in laparoscopic partial nephrectomy and thus may be useful in improving the procedure.

Color T.V Set를 이용한 삼차원 영상장치의 개발 (Development of Three Dimensional Vision Using a Color T.V. Set)

  • 김철중;정상수
    • 비파괴검사학회지
    • /
    • 제5권1호
    • /
    • pp.3-8
    • /
    • 1985
  • A three dimensional vision is obtained by stereoscopic view using a modified commercial TV set and matching color filter glasses. Two video signals from two CCTV cameras are connected to the RGB (red, green, blue) inputs of picture tube selecting two different colors for two video signals. A synchronizing signal drives a CCTV camera and the color TV set. On the other hand, a delayed synchronizing signal drives the other CCTV camera shifting its image on display. This shift is used in correcting image distortion.

  • PDF

3-D 블록분할을 이용하는 웨이브렛 기반 임베디드 비디오 부호화 (Wavelet based Embedded Video Coding with 3-D Block Partition)

  • 양창모;임태범;이석필
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2003년도 신호처리소사이어티 추계학술대회 논문집
    • /
    • pp.133-136
    • /
    • 2003
  • In this paper, we propose a low bit-rate embedded video coding scheme with 3-D block partition in the wavelet domain. The proposed video coding scheme includes multi-level three dimensional dyadic wavelet decomposition, raster scanning within each subband, partitioning of blocks, and adaptive arithmetic entropy coding. Although the proposed video coding scheme is quite simple, it produces bit-streams with good features, including SNR scalability from the embedded nature. Experimental results demonstrate that the proposed video coding scheme is quite competitive to other good wavelet-based video coders in the literature.

  • PDF

실감미디어 동영상정보를 이용한 실내 공간 정보 제공 시스템 구현 (The Implementation of Information Providing Method System for Indoor Area by using the Immersive Media's Video Information)

  • 이상윤;안희학
    • 디지털산업정보학회논문지
    • /
    • 제12권3호
    • /
    • pp.157-166
    • /
    • 2016
  • This paper presents the interior space information using 6D-360 degree immersive media video information. And we implement the augmented reality, which includes a variety of information such as position information, movement information of the specific location in the interior space GPS signal does not reach the position information. Augmented reality containing the 6D-360 degree immersive media video information provides the position information and the three dimensional space image information to identify the exact location of a user in an interior space of a moving object as well as a fixed interior space. This paper constitutes a three dimensional image database based on the 6D-360 degree immersive media video information and provides augmented reality service. Therefore, to map the various information to 6D-360 degree immersive media video information, the user can check the plant in the same environment as the actual. It suggests the augmented reality service for the emergency escape and repair to the passengers and employees.

A Novel Selective Frame Discard Method for 3D Video over IP Networks

  • Chung, Young-Uk
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제4권6호
    • /
    • pp.1209-1221
    • /
    • 2010
  • Three dimensional (3D) video is expected to be an important application for broadcast and IP streaming services. One of the main limitations for the transmission of 3D video over IP networks is network bandwidth mismatch due to the large size of 3D data, which causes fatal decoding errors and mosaic-like damage. This paper presents a novel selective frame discard method to address the problem. The main idea of the proposed method is the symmetrical discard of the two dimensional (2D) video frame and the depth map frame. Also, the frames to be discarded are selected after additional consideration of the playback deadline, the network bandwidth, and the inter-frame dependency relationship within a group of pictures (GOP). It enables the efficient utilization of the network bandwidth and high quality 3D IPTV service. The simulation results demonstrate that the proposed method enhances the media quality of 3D video streaming even in the case of bad network conditions.

Human Activity Recognition Based on 3D Residual Dense Network

  • Park, Jin-Ho;Lee, Eung-Joo
    • 한국멀티미디어학회논문지
    • /
    • 제23권12호
    • /
    • pp.1540-1551
    • /
    • 2020
  • Aiming at the problem that the existing human behavior recognition algorithm cannot fully utilize the multi-level spatio-temporal information of the network, a human behavior recognition algorithm based on a dense three-dimensional residual network is proposed. First, the proposed algorithm uses a dense block of three-dimensional residuals as the basic module of the network. The module extracts the hierarchical features of human behavior through densely connected convolutional layers; Secondly, the local feature aggregation adaptive method is used to learn the local dense features of human behavior; Then, the residual connection module is applied to promote the flow of feature information and reduced the difficulty of training; Finally, the multi-layer local feature extraction of the network is realized by cascading multiple three-dimensional residual dense blocks, and use the global feature aggregation adaptive method to learn the features of all network layers to realize human behavior recognition. A large number of experimental results on benchmark datasets KTH show that the recognition rate (top-l accuracy) of the proposed algorithm reaches 93.52%. Compared with the three-dimensional convolutional neural network (C3D) algorithm, it has improved by 3.93 percentage points. The proposed algorithm framework has good robustness and transfer learning ability, and can effectively handle a variety of video behavior recognition tasks.

디지털 화상처리를 이용한 유동장의 비접촉 3차원 고속류 계측법의 개발 (Developemet of noncontact velocity tracking algorithm for 3-dimensional high speed flows using digital image processing technique)

  • 도덕희
    • Journal of Advanced Marine Engineering and Technology
    • /
    • 제23권2호
    • /
    • pp.259-269
    • /
    • 1999
  • A new algorithm for measuring 3-D velocity components of high speed flows were developed using a digital image processing technique. The measuring system consists of three CCD cameras an optical instrument called AOM a digital image grabber and a host computer. The images of mov-ing particles arranged spatially on a rotation plate are taken by two or three CCD cameras and are recorderd onto the image grabber or a video tape recoder. The three-dimensionl velocity com-ponents of the particles are automatically obtained by the developed algorithm In order to verify the validity of this technique three-dimensional velocity data sets obtained from a computer simu-lation of a backward facing step flow were used as test data for the algorithm. an uncertainty analysis associated with the present algorithm is systematically evaluated, The present technique is proved to be used as a tookl for the measurement of unsteady three-dimensional fluid flows.

  • PDF