• Title/Summary/Keyword: 3D video

검색결과 1,152건 처리시간 0.026초

An Objective No-Reference Perceptual Quality Assessment Metric based on Temporal Complexity and Disparity for Stereoscopic Video

  • Ha, Kwangsung;Bae, Sung-Ho;Kim, Munchurl
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제2권5호
    • /
    • pp.255-265
    • /
    • 2013
  • 3DTV is expected to be a promising next-generation broadcasting service. On the other hand, the visual discomfort/fatigue problems caused by viewing 3D videos have become an important issue. This paper proposes a perceptual quality assessment metric for a stereoscopic video (SV-PQAM). To model the SV-PQAM, this paper presents the following features: temporal variance, disparity variation in intra-frames, disparity variation in inter-frames and disparity distribution of frame boundary areas, which affect the human perception of depth and visual discomfort for stereoscopic views. The four features were combined into the SV-PQAM, which then becomes a no-reference stereoscopic video quality perception model, as an objective quality assessment metric. The proposed SV-PQAM does not require a depth map but instead uses the disparity information by a simple estimation. The model parameters were estimated based on linear regression from the mean score opinion values obtained from the subjective perception quality assessments. The experimental results showed that the proposed SV-PQAM exhibits high consistency with subjective perception quality assessment results in terms of the Pearson correlation coefficient value of 0.808, and the prediction performance exhibited good consistency with a zero outlier ratio value.

  • PDF

H.264에서 MPEG-4로 빠른 트랜스코딩 (Fast Transcoding from H.264 to MPEG-4)

  • 권혁균;이영렬
    • 전자공학회논문지CI
    • /
    • 제41권6호
    • /
    • pp.91-99
    • /
    • 2004
  • 본 논문은 H.264와 MPEG-4 간의 원활한 통신을 하기위한 두 가지 트랜스코딩 방법을 제안한다. 같은 공간적 시간적 해상도(spatio-temporal resolution)를 유지하는 트랜스코팅 방법과 공간적 해상도(temporal resolution)를 줄이는 트랜스코팅 방법을 제안한다. H.264 비트스트림(bitstream)이 MPEG-4 비트스트림으로 변환 시 H.264 블록형태를 MPEG-4에서 사용 할 수 있는 블록형태로 변환 시켜야 하며, 4×4 블록단위의 움직임 벡터도 8×8 블록단위의 움직임 벡터로 조정하여야 한다. 두 가지 제안된 트랜스코딩 방법은 직렬 화소영역 트랜스코팅 방법(cascade pixel-domain transcoding) 보다 MPEG-4 부호화기 측에서 4.1~5.1배 부호화 속도가 빠를 뿐만 아니라 영상의 화질 저하는 최고 0.3dB정도 밖에 떨어 지지 않는다.

SHVC 및 MVC 통합 기반의 스케일러블 다시점 비디오 부호화 설계 및 구현 (Design and Implementation of Scalable Multi-view Video Coding Based on Integration of SHVC and MVC)

  • 정태준;서광덕
    • 방송공학회논문지
    • /
    • 제22권3호
    • /
    • pp.405-408
    • /
    • 2017
  • 다시점 이미지의 뷰포인트 간에 높은 유사도가 존재함을 바탕으로 MV-HEVC는 뷰포인트 내에서 전통적인 시간적 방향 예측 뿐만 아니라 뷰포인트 간에 예측을 수행함으로써 높은 부호화 효율을 얻는다. 본 논문에서는 HEVC를 기본 계층으로 사용하는 스케일러블다시점 비디오 부호화를 구현하기 위해 SHVC와 MVC를 통합 구현함을 제안한다. 실험결과에 의해 BD-PSNR 개선이 1.5dB에 이르고 동시에 BD-Bitrate를 50~60% 가량 줄일 수 있음을 확인하였다.

동영상 저작권보호를 위한 FFT 기반 정보 은닉 기법 (FFT Based Information Concealing Method for Video Copyright Protection)

  • 최일목;황선철
    • 전기학회논문지P
    • /
    • 제62권4호
    • /
    • pp.204-209
    • /
    • 2013
  • FFT based fingerprinting to conceal more information has developed for video copyright protection. More complex information of video is necessary to prove an ownership and legal distributions in invisible form. This paper describes a method to insert more information and to detect them. $3{\times}3$ points structure is used to present information. The possible ways to show are 8bit, $2^8$ = 256 where one point of 9 is always turn on. The points are marked in frequency domain that both real and imaginary party numbers are modified. The five successive frames of same scenes are used to mark because the same scene has very similar shape in FFT result. However, the detail values of coefficients are totally different each other to recognize the marked points. This paper also describes a method to detect the marked points by averaging and correlation algorithm. The PSNRs of marked images by our method had 51.138[dB] to 51.143[dB]. And we could get the correlation values from 0.79 to 0.87.

A Bit Allocation Method Based on Proportional-Integral-Derivative Algorithm for 3DTV

  • Yan, Tao;Ra, In-Ho;Liu, Deyang;Zhang, Qian
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권5호
    • /
    • pp.1728-1743
    • /
    • 2021
  • Three-dimensional (3D) video scenes are complex and difficult to control, especially when scene switching occurs. In this paper, we propose two algorithms based on an incremental proportional-integral-derivative (PID) algorithm and a similarity analysis between views to improve the method of bit allocation for multi-view high efficiency video coding (MV-HEVC). Firstly, an incremental PID algorithm is introduced to control the buffer "liquid level" to reduce the negative impact on the target bit allocation of the view layer and frame layer owing to the fluctuation of the buffer "liquid level". Then, using the image similarity between views is used to establish, a bit allocation calculation model for the multi-view video main viewpoint and non-main viewpoint is established. Then, a bit allocation calculation method based on hierarchical B frames is proposed. Experimental simulation results verify that the algorithm ensures a smooth transition of image quality while increasing the coding efficiency, and the PSNR increases by 0.03 to 0.82dB while not significantly increasing the calculation complexity.

RGB-D 정보를 이용한 2차원 키포인트 탐지 기반 3차원 인간 자세 추정 방법 (A Method for 3D Human Pose Estimation based on 2D Keypoint Detection using RGB-D information)

  • 박서희;지명근;전준철
    • 인터넷정보학회논문지
    • /
    • 제19권6호
    • /
    • pp.41-51
    • /
    • 2018
  • 최근 영상 감시 분야에서는 지능형 영상 감시 시스템에 딥 러닝 기반 학습 방법이 적용되어 범죄, 화재, 이상 현상과 같은 다양한 이벤트들을 강건하게 탐지 할 수 있게 되었다. 그러나 3차원 실세계를 2차원 영상으로 투영시키면서 발생하는 3차원 정보의 손실로 인하여 폐색 문제가 발생하기 때문에 올바르게 객체를 탐지하고, 자세를 추정하기 위해서는 폐색 문제를 고려하는 것이 필요하다. 따라서 본 연구에서는 기존 RGB 정보에 깊이 정보를 추가하여 객체 탐지 과정에서 나타나는 폐색 문제를 해결하여 움직이는 객체를 탐지하고, 탐지된 영역에서 컨볼루션 신경망을 이용하여 인간의 관절 부위인 14개의 키포인트의 위치를 예측한다. 그 다음 자세 추정 과정에서 발생하는 자가 폐색 문제를 해결하기 위하여 2차원 키포인트 예측 결과와 심층 신경망을 이용하여 자세 추정의 범위를 3차원 공간상으로 확장함으로써 3차원 인간 자세 추정 방법을 설명한다. 향후, 본 연구의 2차원 및 3차원 자세 추정 결과는 인간 행위 인식을 위한 용이한 데이터로 사용되어 산업 기술 발달에 기여 할 수 있다.

무안경 3D 모니터를 위한 Depth 화질 향상 Algorithm (The depth quality enhancement algorithm for Autostereoscopic 3D Monitor)

  • 송성호;이경일;이동하;박종철;이재준;김영길
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2012년도 춘계학술대회
    • /
    • pp.133-136
    • /
    • 2012
  • 본 논문은 무안경 3D 디스플레이 제품의 품질을 향상시키기 위한 여러 가지 효과적인 방법을 구축하는데 목적을 두었다. 무안경 제품은 기존의 안경 방식에 비하여 3D의 depth 화질 품질이 떨어졌고, 특정 거리, 위치에서만 볼 수 있는 단점이 있어, 이를 보완하고자 Head Tracking 기술 및 영상 배치알고리즘 등 여러 가지 기술을 적용하여 기존 system의 단점을 보완하였다. 본 논문은 3D 무안경 구현 방식 중 Parallax Barrier의 3D 품질 향상을 위한 Head Tracking을 통한 사용자의 위치 파악, 영상의 재배치 기술 및 Crosstalk 개선 방법에 대해 보고합니다.

  • PDF

방송 및 모바일 실감형 2D/3D 컨텐츠 변환 방법 및 플랫폼 (2D/3D conversion algorithm on broadcast and mobile environment and the platform)

  • 송혁;배진우;유지상;최병호
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 한국정보통신설비학회 2007년도 학술대회
    • /
    • pp.386-389
    • /
    • 2007
  • TV technology started from black and white TV. Color TV invented and users request more realistic TV technology. The next technology is 3DTV. For 3DTV, 3D display technology, 3D coding technology, digital mux/demux technology in broadcast and 3D video acquisition are needed. Moreover, Almost every contents now exist are 2D contents. It causes necessity to convert from 2D to 3D. This article describes 2D/3D conversion algorithm and H/W platform on FPGA board. Time difference makes 3D effect and convolution filter increased the effect. Distorted image and original image give 3D effect. The algorithm is shown on 3D display. The display device shows 3D effect by parallax barrier method and has FPGA board.

  • PDF

Effective Broadcasting and Caching Technique for Video on Demand over Wireless Network

  • Alomari, Saleh Ali;Sumari, Putra
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제6권3호
    • /
    • pp.919-940
    • /
    • 2012
  • Video on Demand (VOD) is a multimedia service which allows a remote user to select and then view video at his convenience at any time he wants, which makes the VOD become an important technology for many applications. Numerous periodic VOD broadcasting protocols have been proposed to support a large number of receivers. Broadcasting is an efficient transmission scheme to provide on-demand service for very popular movies. This paper proposes a new broadcasting scheme called Popularity Cushion Staggered Broadcasting (PCSB). The proposed scheme improves the Periodic Broadcasting (PB) protocols in the latest mobile VOD system, which is called MobiVoD system. It also, reduces the maximum waiting time of the mobile node, by partitioning the $1^{st}$ segment of the whole video and storing it in the Local Media Forwarder (LMF) exactly in the Pool of RAM (PoR), and then transmitting them when the mobile nodes miss the $1^{st}$ broadcasted segment. The results show that the PCSB is more efficient and better than the all types of broadcasting and caching techniques in the MobiVoD system. Furthermore, these results exhibits that system performance is stable under high dynamics of the system and the viewer's waiting time are less than the previous system.

Clinical Analysis of Video-assisted Thoracoscopic Spinal Surgery in the Thoracic or Thoracolumbar Spinal Pathologies

  • Kim, Sung-Jin;Sohn, Moon-Jun;Ryoo, Ji-Yoon;Kim, Yeon-Soo;Whang, Choong-Jin
    • Journal of Korean Neurosurgical Society
    • /
    • 제42권4호
    • /
    • pp.293-299
    • /
    • 2007
  • Objective : Thoracoscopic spinal surgery provides minimally invasive approaches for effective vertebral decompression and reconstruction of the thoracic and thoracolumbar spine, while surgery related morbidity can be significantly lowered. This study analyzes clinical results of thoracoscopic spinal surgery performed at our institute. Methods : Twenty consecutive patients underwent video-assisted thoracosopic surgery (VATS) to treat various thoracic and thoracolumbar pathologies from April 2000 to July 2006. The lesions consisted of spinal trauma (13 cases), thoracic disc herniation (4 cases), tuberculous spondylitis (1 case), post-operative thoracolumbar kyphosis (1 case) and thoracic tumor (1 case). The level of operation included upper thoracic lesions (3 cases), midthoracic lesions (6 cases) and thoracolumbar lesions (11 cases). We classified the procedure into three groups: stand-alone thoracoscopic discectomy (3 cases), thoracoscopic fusion (11 cases) and video assisted mini-thoracotomy (6 cases). Results : Analysis on the Frankel performance scale in spinal trauma patients (13 cases), showed a total of 7 patients who had neurological impairment preoperatively : Grade D (2 cases), Grade C (2 cases), Grade B (1 case), and Grade A (2 cases). Four patients were neurologically improved postoperatively, two patients were improved from C to E, one improved from grade D to E and one improved from grade B to grade D. The preoperative Cobb's and kyphotic angle were measured in spinal trauma patients and were $18.9{\pm}4.4^{\circ}$ and $18.8{\pm}4.6^{\circ}$, respectively. Postoperatively, the angles showed statistically significant improvement, $15.1{\pm}3.7^{\circ}$ and $11.3{\pm}2.4^{\circ}$, respectively(P<0.001). Conclusion : Although VATS requires a steep learning curve, it is an effective and minimally invasive procedure which provides biomechanical stability in terms of anterior column decompression and reconstruction for anterior load bearing, and preservation of intercostal muscles and diaphragm.