• Title/Summary/Keyword: three-dimensional video

Search Result 229, Processing Time 0.024 seconds

Human Gait Recognition Based on Spatio-Temporal Deep Convolutional Neural Network for Identification

  • Zhang, Ning;Park, Jin-ho;Lee, Eung-Joo
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.8
    • /
    • pp.927-939
    • /
    • 2020
  • Gait recognition can identify people's identity from a long distance, which is very important for improving the intelligence of the monitoring system. Among many human features, gait features have the advantages of being remotely available, robust, and secure. Traditional gait feature extraction, affected by the development of behavior recognition, can only rely on manual feature extraction, which cannot meet the needs of fine gait recognition. The emergence of deep convolutional neural networks has made researchers get rid of complex feature design engineering, and can automatically learn available features through data, which has been widely used. In this paper,conduct feature metric learning in the three-dimensional space by combining the three-dimensional convolution features of the gait sequence and the Siamese structure. This method can capture the information of spatial dimension and time dimension from the continuous periodic gait sequence, and further improve the accuracy and practicability of gait recognition.

Three Dimensional Networked Virtual Reailty Architecture Enabling Flexible Configuration Based on Function Distribution

  • Yasuyuki-KIYOSUE;Shohei-SUGAWARA;Shigeki-MASAKI;Susumu-ICHINOSE
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1999.06a
    • /
    • pp.23.1-28
    • /
    • 1999
  • InterSpaceTM is an advanced networked virtual reality system that presents shared three-dimensional computer graphics (CG) worlds via the Internet where multiple users can enjoy synchronous communications with voice, video and text. Users can control their avatars as a surrogate interface. In InterSpace users can walk around and interact with other people and interact with contents. In this paper, we describe the function-distributed architecture used in InterSpace. The architecture enables flexible configuration of server functions and load distribution. It also allows users to select media and client PCs to switch servers dynamically.

CROSS FLOW EFFECTS ON THE FLAME HEIGHT OF AN INTERMEDIATE SCALE DIFFUSION FLAME

  • Kolb, Gilles;Torero, Jose L.;Most, Jean-Michel;Joulain, Pierre
    • Proceedings of the Korea Institute of Fire Science and Engineering Conference
    • /
    • 1997.11a
    • /
    • pp.169-177
    • /
    • 1997
  • An experimental study has been conducted at an intermediate scale to study the effect of a cross flow on a purely buoyant fire. Video taping of the flame and post processing of the images by means of a novel technique provide a contour of a mean flame for all cases studied. This flame contour allows the determination of a mean flame length and a mean flame height. The mean flame length and height are recorded as functions of the forced flow velocity. Three dimensional flow patterns are formed in the flame trailing edge affecting both the mean flame length and height. The three dimensional patterns are studied systematically as functions of the cross flow velocity to quantify the effect of confinement on the flame geometry.

  • PDF

Fast Hierarchical Search Method for Multi-view Video Coding (다시점 비디오 부호화를 위한 고속 계층적 탐색 기법)

  • Yoon, Hyo-Sun;Kim, Mi-Young
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.7
    • /
    • pp.495-502
    • /
    • 2013
  • Motion estimation (ME) that limits the performance of image quality and encoding speed has been developed to reduce temporal redundancy in video sequences and plays an important role in digital video compression. But it is computational demanding part of the encoder. Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. ME for Multi-view video requires high computational complexity. To reduce computational complexity and maintain the image quality, a fast motion estimation method is proposed in this paper. The proposed method uses a hierarchical search strategy. This strategy method consists of modified diamond search patten, multi gird diamond search pattern, and raster search pattern. These search patterns place search points symmetrically and evenly that can cover the overall search area not to fall into the local minimum or exploits the characteristics of the distribution of motion vectors to place the search points. Experiment results show that the speedup improvement of the proposed method over TZ search method (JMVC) can be up to 1.2 ~3 times faster while maintaining similar video quality and bit rates.

3D Conversion of 2D H.264 Video (2D H.264 동영상의 3D 입체 변환)

  • Hong, Ho-Ki;Baek, Yun-Ki;Lee, Seung-Hyun;Kim, Dong-Wook;Yoo, Ji-Sang
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.12C
    • /
    • pp.1208-1215
    • /
    • 2006
  • In this paper, we propose an algorithm that creates three-dimensional (3D) stereoscopic video from two-dimensional (2D) video encoded by H.264 instead of using the conventional stereo-camera process. Motion information of each frame can be obtained by the given motion vectors in most of videos encoded by MPEG standards. Especially, we have accurate motion vectors for H.264 streams because of the availability of a variety of block sizes. 2D/3D video conversion algorithm proposed in this paper can create the left and right images that correspond to the original image by using cut detection method, delay factors, motion types, and image types. We usually have consistent motion type na direction in a given cut because each frame in the same cut has high correlation. We show the improved performance of the proposed algorithm through experimental results.

Fractal Depth Map Sequence Coding Algorithm with Motion-vector-field-based Motion Estimation

  • Zhu, Shiping;Zhao, Dongyu
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.1
    • /
    • pp.242-259
    • /
    • 2015
  • Three-dimensional video coding is one of the main challenges restricting the widespread applications of 3D video and free viewpoint video. In this paper, a novel fractal coding algorithm with motion-vector-field-based motion estimation for depth map sequence is proposed. We firstly add pre-search restriction to rule the improper domain blocks out of the matching search process so that the number of blocks involved in the search process can be restricted to a smaller size. Some improvements for motion estimation including initial search point prediction, threshold transition condition and early termination condition are made based on the feature of fractal coding. The motion-vector-field-based adaptive hexagon search algorithm on the basis of center-biased distribution characteristics of depth motion vector is proposed to accelerate the search. Experimental results show that the proposed algorithm can reach optimum levels of quality and save the coding time. The PSNR of synthesized view is increased by 0.56 dB with 36.97% bit rate decrease on average compared with H.264 Full Search. And the depth encoding time is saved by up to 66.47%. Moreover, the proposed fractal depth map sequence codec outperforms the recent alternative codecs by improving the H.264/AVC, especially in much bitrate saving and encoding time reduction.

Review of Video Imaging Technology in Coastal Wave Observations and Suggestion for Its Applications (비디오 영상 자료를 이용한 연안 국지파랑 관측기술과 그 활용에 대한 고찰)

  • Lee, Dong-Young;Yoo, Je-Seon;Park, Kwang-Soon
    • Ocean and Polar Research
    • /
    • v.31 no.4
    • /
    • pp.415-422
    • /
    • 2009
  • The wave observation system in Korea has been established with an emphasis on pointmeasurement based on in situ instrumentations. However, the system cannot fully investigate the coastal wave-related problems that are significantly localized and intensified with three-dimensional regional geometries. Observation technique that can cover local processes with large time and spatial variation needs to be established. Video imaging techniques that can provide continuous monitoring of coastal waves and related phenomena with high spatial and temporal resolutions at minimum cost of instrumentation risks are reviewed together with present status of implementation in Korea. Practical applications of the video imaging techniques are suggested to tackle with various coastal issues of public concern in Korea including, real-time monitoring of wave runup and overtopping of swells on the east coast of Korea, longshore and rip currents, morphological and bathymetric changes, storm surge and tsunami inundation, and abnormal extreme waves in the west coast of Korea, etc.

Depth Video Post-processing for Immersive Teleconference (원격 영상회의 시스템을 위한 깊이 영상 후처리 기술)

  • Lee, Sang-Beom;Yang, Seung-Jun;Ho, Yo-Sung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.6A
    • /
    • pp.497-502
    • /
    • 2012
  • In this paper, we present an immersive videoconferencing system that enables gaze correction between users in the internet protocol TV (IPTV) environment. The proposed system synthesizes the gaze corrected images using the depth estimation and the virtual view synthesis algorithms as one of the most important techniques of 3D video system. The conventional processes, however, causes several problems, especially temporal inconsistency of a depth video. This problem leads to flickering artifacts discomforting viewers. Therefore, in order to reduce the temporal inconsistency problem, we exploit the joint bilateral filter which is extended to the temporal domain. In addition, we apply an outlier reduction operation in the temporal domain. From experimental results, we have verified that the proposed system is sufficient to generate the natural gaze-corrected image and realize immersive videoconferencing.

A Bit Allocation Method Based on Proportional-Integral-Derivative Algorithm for 3DTV

  • Yan, Tao;Ra, In-Ho;Liu, Deyang;Zhang, Qian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1728-1743
    • /
    • 2021
  • Three-dimensional (3D) video scenes are complex and difficult to control, especially when scene switching occurs. In this paper, we propose two algorithms based on an incremental proportional-integral-derivative (PID) algorithm and a similarity analysis between views to improve the method of bit allocation for multi-view high efficiency video coding (MV-HEVC). Firstly, an incremental PID algorithm is introduced to control the buffer "liquid level" to reduce the negative impact on the target bit allocation of the view layer and frame layer owing to the fluctuation of the buffer "liquid level". Then, using the image similarity between views is used to establish, a bit allocation calculation model for the multi-view video main viewpoint and non-main viewpoint is established. Then, a bit allocation calculation method based on hierarchical B frames is proposed. Experimental simulation results verify that the algorithm ensures a smooth transition of image quality while increasing the coding efficiency, and the PSNR increases by 0.03 to 0.82dB while not significantly increasing the calculation complexity.

Design and implementation of a three-dimensional broadcasting system based on DTV (디지털 TV기반 3차원 방송 시스템 설계 및 구현)

  • Yun, Kug-Jin;Cho, Suk-Hee;Hur, Nam-Ho;Kim, Jin-Woong;Lee, Soo-In;Ahn, Chi-Eteuk
    • Journal of Broadcast Engineering
    • /
    • v.11 no.4 s.33
    • /
    • pp.483-494
    • /
    • 2006
  • In this paper, we propose a novel program specific information structure(PSI) in MPEG-2 system for a three-dimensional broadcasting system while guaranteeing compatibility with the conventional digital broadcasting system. The proposed PSI structure including a descriptor of contents type, 2D or 3D, provides service providers with a function of combining 2D and 3D contents freely. However the descriptor is ignored in the conventional digital broadcasting set-top box. With the proposed PSI structure, we designed and implemented a three-dimensional broadcasting system. The proposed system has 2D or 3D display mode for providing user's preference in a three-dimensional broadcasting environment.