• Title/Summary/Keyword: 3D video

Search Result 1,154, Processing Time 0.023 seconds

An Objective No-Reference Perceptual Quality Assessment Metric based on Temporal Complexity and Disparity for Stereoscopic Video

  • Ha, Kwangsung;Bae, Sung-Ho;Kim, Munchurl
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.2 no.5
    • /
    • pp.255-265
    • /
    • 2013
  • 3DTV is expected to be a promising next-generation broadcasting service. On the other hand, the visual discomfort/fatigue problems caused by viewing 3D videos have become an important issue. This paper proposes a perceptual quality assessment metric for a stereoscopic video (SV-PQAM). To model the SV-PQAM, this paper presents the following features: temporal variance, disparity variation in intra-frames, disparity variation in inter-frames and disparity distribution of frame boundary areas, which affect the human perception of depth and visual discomfort for stereoscopic views. The four features were combined into the SV-PQAM, which then becomes a no-reference stereoscopic video quality perception model, as an objective quality assessment metric. The proposed SV-PQAM does not require a depth map but instead uses the disparity information by a simple estimation. The model parameters were estimated based on linear regression from the mean score opinion values obtained from the subjective perception quality assessments. The experimental results showed that the proposed SV-PQAM exhibits high consistency with subjective perception quality assessment results in terms of the Pearson correlation coefficient value of 0.808, and the prediction performance exhibited good consistency with a zero outlier ratio value.

  • PDF

Fast Transcoding from H.264 to MPEG-4 (H.264에서 MPEG-4로 빠른 트랜스코딩)

  • 권혁균;이영렬
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.41 no.6
    • /
    • pp.91-99
    • /
    • 2004
  • This paper proposed two transcodiing methods, which maintain the same spatio-temporal resolution and reduce a spatial resolution, to convert a H.264 video bitstream into an MPEG-4 video bitstream. When the H.264 video bitstream is transformed into the MPEG-4 video bitstream, the conversions between H.264 block types and MPEG-4 block types are performed by minimizing distortion and the ${\times}4$ block-based motion vector mapping is performed. The proposed two transcoding methods run 4.14~5.1 times as fast as the cascaded transcoding methods in MPEG-4 encoder side, while the PSNR (peak-signal-to ratio) is slightly degrade with maximum 0.3dB.

Design and Implementation of Scalable Multi-view Video Coding Based on Integration of SHVC and MVC (SHVC 및 MVC 통합 기반의 스케일러블 다시점 비디오 부호화 설계 및 구현)

  • Jung, Tae-jun;Seo, Kwang-deok
    • Journal of Broadcast Engineering
    • /
    • v.22 no.3
    • /
    • pp.405-408
    • /
    • 2017
  • Based on the fact that high similarities exist between viewpoints of multi-view images, MV-HEVC achieves high encoding efficiency by performing conventional temporal direction prediction in a single viewpoint as well as inter-view prediction between viewpoints. In this paper, we propose to integrate SHVC and MVC (Multi-view Video Coding) to implement scalable multi-view video encoder using HEVC as a base layer. According to experimental results, it is verified that the BD-PSNR improvement reaches up to 1.5dB while reducing the BD-Bitrate by around 50~60%.

FFT Based Information Concealing Method for Video Copyright Protection (동영상 저작권보호를 위한 FFT 기반 정보 은닉 기법)

  • Choi, Il-Mok;Hwang, Seon-Cheol
    • The Transactions of the Korean Institute of Electrical Engineers P
    • /
    • v.62 no.4
    • /
    • pp.204-209
    • /
    • 2013
  • FFT based fingerprinting to conceal more information has developed for video copyright protection. More complex information of video is necessary to prove an ownership and legal distributions in invisible form. This paper describes a method to insert more information and to detect them. $3{\times}3$ points structure is used to present information. The possible ways to show are 8bit, $2^8$ = 256 where one point of 9 is always turn on. The points are marked in frequency domain that both real and imaginary party numbers are modified. The five successive frames of same scenes are used to mark because the same scene has very similar shape in FFT result. However, the detail values of coefficients are totally different each other to recognize the marked points. This paper also describes a method to detect the marked points by averaging and correlation algorithm. The PSNRs of marked images by our method had 51.138[dB] to 51.143[dB]. And we could get the correlation values from 0.79 to 0.87.

A Bit Allocation Method Based on Proportional-Integral-Derivative Algorithm for 3DTV

  • Yan, Tao;Ra, In-Ho;Liu, Deyang;Zhang, Qian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1728-1743
    • /
    • 2021
  • Three-dimensional (3D) video scenes are complex and difficult to control, especially when scene switching occurs. In this paper, we propose two algorithms based on an incremental proportional-integral-derivative (PID) algorithm and a similarity analysis between views to improve the method of bit allocation for multi-view high efficiency video coding (MV-HEVC). Firstly, an incremental PID algorithm is introduced to control the buffer "liquid level" to reduce the negative impact on the target bit allocation of the view layer and frame layer owing to the fluctuation of the buffer "liquid level". Then, using the image similarity between views is used to establish, a bit allocation calculation model for the multi-view video main viewpoint and non-main viewpoint is established. Then, a bit allocation calculation method based on hierarchical B frames is proposed. Experimental simulation results verify that the algorithm ensures a smooth transition of image quality while increasing the coding efficiency, and the PSNR increases by 0.03 to 0.82dB while not significantly increasing the calculation complexity.

A Method for 3D Human Pose Estimation based on 2D Keypoint Detection using RGB-D information (RGB-D 정보를 이용한 2차원 키포인트 탐지 기반 3차원 인간 자세 추정 방법)

  • Park, Seohee;Ji, Myunggeun;Chun, Junchul
    • Journal of Internet Computing and Services
    • /
    • v.19 no.6
    • /
    • pp.41-51
    • /
    • 2018
  • Recently, in the field of video surveillance, deep learning based learning method is applied to intelligent video surveillance system, and various events such as crime, fire, and abnormal phenomenon can be robustly detected. However, since occlusion occurs due to the loss of 3d information generated by projecting the 3d real-world in 2d image, it is need to consider the occlusion problem in order to accurately detect the object and to estimate the pose. Therefore, in this paper, we detect moving objects by solving the occlusion problem of object detection process by adding depth information to existing RGB information. Then, using the convolution neural network in the detected region, the positions of the 14 keypoints of the human joint region can be predicted. Finally, in order to solve the self-occlusion problem occurring in the pose estimation process, the method for 3d human pose estimation is described by extending the range of estimation to the 3d space using the predicted result of 2d keypoint and the deep neural network. In the future, the result of 2d and 3d pose estimation of this research can be used as easy data for future human behavior recognition and contribute to the development of industrial technology.

The depth quality enhancement algorithm for Autostereoscopic 3D Monitor (무안경 3D 모니터를 위한 Depth 화질 향상 Algorithm)

  • Song, Sung-Ho;Lee, Kyoung-Il;Lee, Dong-Ha;Park, Jong-Cheol;Lee, Jea-Jun;Kim, Young-Kil
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2012.05a
    • /
    • pp.133-136
    • /
    • 2012
  • In this paper, we found the many effective ways and apply for improve the 3D quality of Autostereoscopic 3D display products. Autostereoscopic products compared to traditional 3D glasses, the disadvantage is the poor depth of 3D picture quality and it only can see the fixed distance and position. So, for the compensate this disadvantage, we use the Head tracking technology and video placement algorithms and several techniques. In this paper, the will report on how to improve the Parallax Barrier Autostereoscopic 3D quality through the Head tracking of the user identification, video replacement algorithms and crosstalk improving method.

  • PDF

2D/3D conversion algorithm on broadcast and mobile environment and the platform (방송 및 모바일 실감형 2D/3D 컨텐츠 변환 방법 및 플랫폼)

  • Song, Hyok;Bae, Jin-Woo;Yoo, Ji-Sang;Choi, Byeoung-Ho
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2007.08a
    • /
    • pp.386-389
    • /
    • 2007
  • TV technology started from black and white TV. Color TV invented and users request more realistic TV technology. The next technology is 3DTV. For 3DTV, 3D display technology, 3D coding technology, digital mux/demux technology in broadcast and 3D video acquisition are needed. Moreover, Almost every contents now exist are 2D contents. It causes necessity to convert from 2D to 3D. This article describes 2D/3D conversion algorithm and H/W platform on FPGA board. Time difference makes 3D effect and convolution filter increased the effect. Distorted image and original image give 3D effect. The algorithm is shown on 3D display. The display device shows 3D effect by parallax barrier method and has FPGA board.

  • PDF

Effective Broadcasting and Caching Technique for Video on Demand over Wireless Network

  • Alomari, Saleh Ali;Sumari, Putra
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.6 no.3
    • /
    • pp.919-940
    • /
    • 2012
  • Video on Demand (VOD) is a multimedia service which allows a remote user to select and then view video at his convenience at any time he wants, which makes the VOD become an important technology for many applications. Numerous periodic VOD broadcasting protocols have been proposed to support a large number of receivers. Broadcasting is an efficient transmission scheme to provide on-demand service for very popular movies. This paper proposes a new broadcasting scheme called Popularity Cushion Staggered Broadcasting (PCSB). The proposed scheme improves the Periodic Broadcasting (PB) protocols in the latest mobile VOD system, which is called MobiVoD system. It also, reduces the maximum waiting time of the mobile node, by partitioning the $1^{st}$ segment of the whole video and storing it in the Local Media Forwarder (LMF) exactly in the Pool of RAM (PoR), and then transmitting them when the mobile nodes miss the $1^{st}$ broadcasted segment. The results show that the PCSB is more efficient and better than the all types of broadcasting and caching techniques in the MobiVoD system. Furthermore, these results exhibits that system performance is stable under high dynamics of the system and the viewer's waiting time are less than the previous system.

Clinical Analysis of Video-assisted Thoracoscopic Spinal Surgery in the Thoracic or Thoracolumbar Spinal Pathologies

  • Kim, Sung-Jin;Sohn, Moon-Jun;Ryoo, Ji-Yoon;Kim, Yeon-Soo;Whang, Choong-Jin
    • Journal of Korean Neurosurgical Society
    • /
    • v.42 no.4
    • /
    • pp.293-299
    • /
    • 2007
  • Objective : Thoracoscopic spinal surgery provides minimally invasive approaches for effective vertebral decompression and reconstruction of the thoracic and thoracolumbar spine, while surgery related morbidity can be significantly lowered. This study analyzes clinical results of thoracoscopic spinal surgery performed at our institute. Methods : Twenty consecutive patients underwent video-assisted thoracosopic surgery (VATS) to treat various thoracic and thoracolumbar pathologies from April 2000 to July 2006. The lesions consisted of spinal trauma (13 cases), thoracic disc herniation (4 cases), tuberculous spondylitis (1 case), post-operative thoracolumbar kyphosis (1 case) and thoracic tumor (1 case). The level of operation included upper thoracic lesions (3 cases), midthoracic lesions (6 cases) and thoracolumbar lesions (11 cases). We classified the procedure into three groups: stand-alone thoracoscopic discectomy (3 cases), thoracoscopic fusion (11 cases) and video assisted mini-thoracotomy (6 cases). Results : Analysis on the Frankel performance scale in spinal trauma patients (13 cases), showed a total of 7 patients who had neurological impairment preoperatively : Grade D (2 cases), Grade C (2 cases), Grade B (1 case), and Grade A (2 cases). Four patients were neurologically improved postoperatively, two patients were improved from C to E, one improved from grade D to E and one improved from grade B to grade D. The preoperative Cobb's and kyphotic angle were measured in spinal trauma patients and were $18.9{\pm}4.4^{\circ}$ and $18.8{\pm}4.6^{\circ}$, respectively. Postoperatively, the angles showed statistically significant improvement, $15.1{\pm}3.7^{\circ}$ and $11.3{\pm}2.4^{\circ}$, respectively(P<0.001). Conclusion : Although VATS requires a steep learning curve, it is an effective and minimally invasive procedure which provides biomechanical stability in terms of anterior column decompression and reconstruction for anterior load bearing, and preservation of intercostal muscles and diaphragm.