• 제목/요약/키워드: perceptual video quality

검색결과 52건 처리시간 0.021초

Foveated Contrast Sensitivity를 이용한 인지품질 기반 비디오 코딩 (Perceptual Quality-based Video Coding with Foveated Contrast Sensitivity)

  • 유지우;심동규
    • 방송공학회논문지
    • /
    • 제19권4호
    • /
    • pp.468-477
    • /
    • 2014
  • 본 논문은 FCS(foveated contrast sensitivity)를 이용한 인지품질 기반 비디오 코딩 방법을 제안한다. CS(contrast sensitivity)를 이용한 기존의 인지품질 기반 비디오 코딩 방법은 공간주파수에 따라 시각적 인지능력이 달라지는 인간시각체계(HVS, human visual system)의 특징을 이용하여 비디오 압축 시 인지품질의 손상을 최소화하며, FM(foveated masking)을 이용한 방법에서는 HVS의 중심시(central vision) 와 주변시(peripheral vision)의 차를 이용한다. 본 연구에서는, 정신물리학 실험을 통하여 기존의 DCT(discrete cosine transform)기반 JND(Just-noticeable difference) 모델과 FM이 서로 의존성을 갖고 동시에 고려된 새로운 FCS 모델을 제안하였고, 이를 HM10.0 부호화기에 적용하여 인지품질기반 부호화를 수행하였다. 제안된 방법으로 부호화된 영상은 인지품질 관점에서 동일한 화질을 유지하면서 평균 10%의 비트율 감소를 보였다.

패킷 손실시 H.264 SVC의 무기준법 영상 화질 평가 방법 (No-Referenced Video-Quality Assessment for H.264 SVC with Packet Loss)

  • 김현태;김요한;신지태;원석호
    • 한국통신학회논문지
    • /
    • 제36권11C호
    • /
    • pp.655-661
    • /
    • 2011
  • 다양한 네트워크 환경에서 적응적인 서비스 품질을 제공할 수 있는 H.264 SVC 전송에 대한 연구가 활발하다. 본 논문은 H.264 SVC의 무기준법 객관적 화질 평가 방법으로서, H.264 SVC의 계층성을 이용한 품질 평가 지표를 제안한다. 제안하는 지표는 패킷 손실의 위치에 따라 움직임 벡터, 계층적 예측 구조에 의한 에러 전파 패턴, 양자화 파라미터, 영향을 받은 영상프레임 수 등 에러를 반영한 인지적 화질 평가를 예측한다. 제안하는 품질평가 지표는 사람의 인지적인 영상 품질을 반영한 객관적 지표이며 이 지표를 주관적 화질평가 결과인 DMOS와의 상관관계를 통해 성능을 검증하였다.

A Multi-category Task for Bitrate Interval Prediction with the Target Perceptual Quality

  • Yang, Zhenwei;Shen, Liquan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권12호
    • /
    • pp.4476-4491
    • /
    • 2021
  • Video service providers tend to face user network problems in the process of transmitting video streams. They strive to provide user with superior video quality in a limited bitrate environment. It is necessary to accurately determine the target bitrate range of the video under different quality requirements. Recently, several schemes have been proposed to meet this requirement. However, they do not take the impact of visual influence into account. In this paper, we propose a new multi-category model to accurately predict the target bitrate range with target visual quality by machine learning. Firstly, a dataset is constructed to generate multi-category models by machine learning. The quality score ladders and the corresponding bitrate-interval categories are defined in the dataset. Secondly, several types of spatial-temporal features related to VMAF evaluation metrics and visual factors are extracted and processed statistically for classification. Finally, bitrate prediction models trained on the dataset by RandomForest classifier can be used to accurately predict the target bitrate of the input videos with target video quality. The classification prediction accuracy of the model reaches 0.705 and the encoded video which is compressed by the bitrate predicted by the model can achieve the target perceptual quality.

웨이브릿 변환에서 인지적 가중치를 이용한 SPIHT 비디오 부호기 (SPIHT Video Coder Using Perceptual Weight in Wavelet transform)

  • 정용재;강경원;문광석
    • 융합신호처리학회논문지
    • /
    • 제3권1호
    • /
    • pp.15-20
    • /
    • 2002
  • 동영상 부호기에서 화면내 프레임 부호화는 전체 프레임의 화질에 중요한 영향을 미친다. 표준화된 동영상의 부호기는 DCT를 쓰지만, 저 비트율에서의 블록화 현상으로 화질의 열화를 가져올 수 있다. 본 논문에서는 화질의 열화를 감소시키고 인간 시각적인 측면에서의 화질 개선을 위한 비디오 코딩을 제안한다. 제한안 방법에서는 웨이브릿 변환에서 인지적 가중치를 화면내 프레임에 적용하여 SPIHT와 VLC를 이용하여 부호화하였고, 인간 시각 특성을 고려하여 시각적인 노이즈를 제거하여 주관적인 화질을 향상 시켰다.

  • PDF

Analysis of the JND-Suppression Effect in Quantization Perspective for HEVC-based Perceptual Video Coding

  • Kim, Jaeil;Kim, Munchurl
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제4권1호
    • /
    • pp.22-27
    • /
    • 2015
  • Transform-domain JND (Just Noticeable Difference)-based for PVC (Perceptual Video Coding) is often performed in quantization processes to effectively remove perceptual redundancy. This study examined the JND-suppression effects on quantized coefficients of transform in HEVC (High Efficiency Video Coding). To reveal the JND-suppression effect in quantization, the properties of the floor functions were used for modeling the quantized coefficients, and a JND-adjustment process in an HEVC-compliant PVC scheme was used to tune the JND values by analyzing the JND suppression effect. In the experimental results, the bitrate reduction decreases slightly, but the PSNR and perceptual quality are improved significantly when the proposed JND adjustment process is applied.

Visual-Attention-Aware Progressive RoI Trick Mode Streaming in Interactive Panoramic Video Service

  • Seok, Joo Myoung;Lee, Yonghun
    • ETRI Journal
    • /
    • 제36권2호
    • /
    • pp.253-263
    • /
    • 2014
  • In the near future, traditional narrow and fixed viewpoint video services will be replaced by high-quality panorama video services. This paper proposes a visual-attention-aware progressive region of interest (RoI) trick mode streaming service (VA-PRTS) that prioritizes video data to transmit according to the visual attention and transmits prioritized video data progressively. VA-PRTS enables the receiver to speed up the time to display without degrading the perceptual quality. For the proposed VA-PRTS, this paper defines a cutoff visual attention metric algorithm to determine the quality of the encoded video slice based on the capability of visual attention and the progressive streaming method based on the priority of RoI video data. Compared to conventional methods, VA-PRTS increases the bitrate saving by over 57% and decreases the interactive delay by over 66%, while maintaining a level of perceptual video quality. The experiment results show that the proposed VA-PRTS improves the quality of the viewer experience for interactive panoramic video streaming services. The development results show that the VA-PRTS has highly practical real-field feasibility.

정지영상 및 동영상 인지화질 측정 기술 동향 (Technology Trends on Image/Video Perceptual Quality Assessment)

  • 이대열;김종호;정세윤;조승현;김휘용;최진수
    • 전자통신동향분석
    • /
    • 제33권3호
    • /
    • pp.11-21
    • /
    • 2018
  • Assessment technologies regarding the perceptual quality of images and videos have been receiving significant attention, as they serve as essential tools for monitoring and improving the quality of various media services. In this paper, we review the technology trends of recent studies on the perceptual quality assessment of images and videos, and discuss the future direction of this research field.

JND 모델을 사용한 코딩 유닛 레벨 멀티-루프 인코딩 기반의 비디오 압축 방법 (Coding Unit-level Multi-loop Encoding Method based on JND for Perceptual Coding)

  • 임웅;심동규
    • 전자공학회논문지
    • /
    • 제52권5호
    • /
    • pp.147-154
    • /
    • 2015
  • 본 논문에서는 주변의 밝기에 대한 HVS의 민감도를 모델링한 JND (Just Noticeable Difference)를 비디오 코딩에 적용함으로써, JND 모델에 따른 임계치를 기준으로 현재 코딩 유닛에 적용 가능한 최대 양자화 파라미터를 결정하여 유사한 주관적 화질에서 비트율을 절감시키는 방법을 제안한다. 제안하는 방법은 입력된 현재 코딩 유닛에 대하여 기준이 되는 양자화 파라미터가 적용된 복원 신호 대비 더 높은 양자화 파라미터를 적용한 복원 신호가 JND 관점에서 유사하게 인지되는 경우에 더 높은 양자화 파라미터를 선택함으로써 비트율을 절감시킨다. 제안하는 알고리즘의 성능 검증을 위하여 최신 비디오 압축 표준인 HEVC (High Efficiency Video Coding)의 참조 소프트웨어인 HM16.0에 본 알고리즘을 적용하였으며, HM16.0을 통해 압축된 영상 대비 유사한 화질에서 최대 20.21%, 평균적으로 약 6.18%의 비트율 절감을 달성하였다.

Lightweight Quality Metric Based on No-Reference Bitstream for H.264/AVC Video

  • Kim, Yo-Han;Shin, Ji-Tae;Kim, Ho-Kyom
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제6권5호
    • /
    • pp.1388-1399
    • /
    • 2012
  • This paper proposes a quality metric based on a No-Reference Bitstream (NR-B) having least computational complexity for the assessment of the human-perceptual quality of H.264 encoded video. The proposed NR-B method performs a modeling of encoding distortion with three bit-stream information (i.e. frame-rate, motion-vector, and quantization-parameter) that can be directly extractable from the encoded bitstream and does not require additional complex processing of final pictures. From performance evaluation using 165 compressed video sequences, the experiment results show that the proposed metric has a higher correlation with subjective quality than is achieved with other comparable methods.

An Objective No-Reference Perceptual Quality Assessment Metric based on Temporal Complexity and Disparity for Stereoscopic Video

  • Ha, Kwangsung;Bae, Sung-Ho;Kim, Munchurl
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제2권5호
    • /
    • pp.255-265
    • /
    • 2013
  • 3DTV is expected to be a promising next-generation broadcasting service. On the other hand, the visual discomfort/fatigue problems caused by viewing 3D videos have become an important issue. This paper proposes a perceptual quality assessment metric for a stereoscopic video (SV-PQAM). To model the SV-PQAM, this paper presents the following features: temporal variance, disparity variation in intra-frames, disparity variation in inter-frames and disparity distribution of frame boundary areas, which affect the human perception of depth and visual discomfort for stereoscopic views. The four features were combined into the SV-PQAM, which then becomes a no-reference stereoscopic video quality perception model, as an objective quality assessment metric. The proposed SV-PQAM does not require a depth map but instead uses the disparity information by a simple estimation. The model parameters were estimated based on linear regression from the mean score opinion values obtained from the subjective perception quality assessments. The experimental results showed that the proposed SV-PQAM exhibits high consistency with subjective perception quality assessment results in terms of the Pearson correlation coefficient value of 0.808, and the prediction performance exhibited good consistency with a zero outlier ratio value.

  • PDF