• Title/Summary/Keyword: Perceptual Quantization

Search Result 39, Processing Time 0.025 seconds

Hardware Implementation of GA HDTV Video Encoder Using Hierarchical Motion Estimation and Adaptive Quantization (계층적 움직임 추정 및 적응 양자화 기법을 사용한 GA HDTV 동영상 부호화기 개발에 관한 연구)

  • 임경원;최병선;조현덕;최정필;유한주;송병철;김성득;박현상;나종범
    • Journal of Broadcast Engineering
    • /
    • v.1 no.2
    • /
    • pp.152-164
    • /
    • 1996
  • This paper describes the hardware architecture and implementation trade-offs of the Grand Alliance HDTV video encoder system. The implemented video encoder accepts video in 1125 line(30Hz) interlaced format, and produces a bit-stream compliant with the motion picture experts group version 2(MPEG-2) standards. The encoder processing includes large- area motion estimation and an advanced rate control mechanism. To keep the system complexity realizable, we adopt a fast hierarchical motion estimation method and developed its hardware architecture. Furthermore an adaptive perceptual quantization method is adopted to improve the perceptual quality. The developed system Is based on the 4-way parallel processing architecture and is implemented by using programmable IC, memory IC, and special-purpose processors such as DCT and motion estimation processors.

  • PDF

Perceptual Decomposition and Sequential Principal Edge Vector Quantization of DCT Coefficients for Image Coding (영상 부호화를 위한 DCT 계수의 시각적 분석 및 순차적 규에지 벡터 양자화)

  • 강동욱;송준석;이충웅
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.32B no.1
    • /
    • pp.64-72
    • /
    • 1995
  • We propose a new image coding method which takes into account both statistical redundancy and perceptual irrelevancy of the DCT coefficients so as to provide a high quality of the reconstructed images with a reduced transmission bit rate First, a block of DCT coefficients are decomposed into 16 subvectors so as for a subvector to convey key information about one of the low-pass or the dirctional filtered images. Then, the most significant subvector is selected as the principal edge of the block and then vector quantized. After that, the residuals of the block are computed and then sequentially quantized through aforementioned procedure until the quantization distortion is smaller than the target distortion. The proposed scheme is good at encoding images with a variety of transmission bit rates, especially at very low bit rate coding. In addition, it is another benifit of the proposed scheme that an image can be quantized with a wide range of the transmission bit rates by simply adapting the stopping criterion of the sequential vector quantizer according to the target distortion of the reconstructed image.

  • PDF

Perceptual and Adaptive Quantization of Line Spectral Frequency Parameters (선 스펙트럼 주파수의 청각 적응 부호화)

  • 한우진;김은경;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.8
    • /
    • pp.68-77
    • /
    • 2000
  • Line special frequency (LSF) parameters have been widely used in low bit-rate speech coding due to their efficiency for representing the short-time speech spectrum. In this paper, a new distance measure based on the masking properties of human ear is proposed for quantizing LSF parameters whereas most conventional quantization methods are based on the weighted Euclidean distance measure. The proposed method derives the perceptual distance measure from the definition of noise-to-mask ratio (NMR) which has high correspondence with the actual distortion received in the human ear and uses it for quantizing LSF parameters. In addition, we propose an adaptive bit allocation scheme, which allocates minimal bits to LSF parameters maintaining the perceptual transparency of given speech frame for reducing the average bit-rates. For the performance evaluation, we has shown the ratio of perceptually transparent frames and the corresponding average bit-rates for the conventional and proposed methods. By jointly combining the proposed distance measure and adaptive bit allocation scheme, the proposed system requires only 770 bps for obtaining 95.5% perceptually transparent frames, while the conventional systems produce 89.9% at even 1800 bps.

  • PDF

Adaptive Watermarking Using Successive Subband Quantization and Perceptual Model Based on Multiwavelet Transform Domain (멀티웨이브릿 변환 영역 기반의 연속 부대역 양자화 및 지각 모델을 이용한 적응 워터마킹)

  • 권기룡;이준재
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.7
    • /
    • pp.1149-1158
    • /
    • 2003
  • Content adaptive watermark embedding algorithm using a stochastic image model in the multiwavelet transform is proposed in this paper. A watermark is embedded into the perceptually significant coefficients (PSCs) of each subband using multiwavelet transform. The PSCs in high frequency subband are selected by SSQ, that is, by setting the thresholds as the one half of the largest coefficient in each subband. The perceptual model is applied with a stochastic approach based on noise visibility function (NVF) that has local image properties for watermark embedding. This model uses stationary Generalized Gaussian model characteristic because watermark has noise properties. The watermark estimation use shape parameter and variance of subband region. it is derive content adaptive criteria according to edge and texture, and flat region. The experiment results of the proposed watermark embedding method based on multiwavelet transform techniques were found to be excellent invisibility and robustness.

  • PDF

Adaptive Image Watermarking Using a Stochastic Multiresolution Modeling

  • Kim, Hyun-Chun;Kwon, Ki-Ryong;Kim, Jong-Jin
    • Proceedings of the IEEK Conference
    • /
    • 2002.07a
    • /
    • pp.172-175
    • /
    • 2002
  • This paper presents perceptual model with a stochastic rnultiresolution characteristic that can be applied with watermark embedding in the biorthogonal wavelet domain. The perceptual model with adaptive watermarking algorithm embed at the texture and edge region for more strongly embedded watermark by the SSQ(successive subband quantization). The watermark embedding is based on the computation of a NVF(noise visibility function) that have local image properties. This method uses non-stationary Gaussian model stationary Generalized Gaussian model because watermark has noise properties. In order to determine the optimal NVF, we consider the watermark as noise. The particularities of embedding in the stationary GG model use shape parameter and variance of each subband regions in multiresolution. To estimate the shape parameter, we use a moment matching method. Non-stationary Gaussian model use the local mean and variance of each subband. The experiment results of simulation were found to be excellent invisibility and robustness. Experiments of such distortion are executed by Stirmark benchmark test.

  • PDF

Lightweight Quality Metric Based on No-Reference Bitstream for H.264/AVC Video

  • Kim, Yo-Han;Shin, Ji-Tae;Kim, Ho-Kyom
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.6 no.5
    • /
    • pp.1388-1399
    • /
    • 2012
  • This paper proposes a quality metric based on a No-Reference Bitstream (NR-B) having least computational complexity for the assessment of the human-perceptual quality of H.264 encoded video. The proposed NR-B method performs a modeling of encoding distortion with three bit-stream information (i.e. frame-rate, motion-vector, and quantization-parameter) that can be directly extractable from the encoded bitstream and does not require additional complex processing of final pictures. From performance evaluation using 165 compressed video sequences, the experiment results show that the proposed metric has a higher correlation with subjective quality than is achieved with other comparable methods.

Perceptual Quality Improvement of KLT based Entropy-Constrained Quantizer using a SAW Filter (SAW 필터를 이용한 KLT 기반 Entropy-Constrained Quantizer 성능 향상)

  • Lim, Dong-Seok;Kim, Moo Young
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2013.06a
    • /
    • pp.1-2
    • /
    • 2013
  • KLT-AECQ 는 지각적인 성능 향상을 위하여 formant weighting 필터를 사용한다.Code Excited Linear Prediction(CELP) 코더는 사람의 음성신호를 압축하는 대표적인 방식이다. CELP 의 Rate-Distortion 성능을 향상 시키기 위해서 Karhunen-Loeve-Transform (KLT) 기반의 Classified Vector Quantization (KLT-CVQ) 방식이 제안되었으며, 이는 KLT 기반의 Adaptive Entropy-Constrained Quantization (KLT-AECQ) 방식으로 확장되었다. 기존의 KLT-AECQ 에서는 지각적인 성능 향상을 위하여 formant weighting 필터를 사용한다. 본 논문에서는 이 필터 대신에 Spectral Amplitude Warping (SAW) 필터를 적용함으로써, KLT-AECQ 코더의 지각적인 성능을 향상하였다.

  • PDF

JND based Video Pre-processing Adaptive to Quantization Step sizes for Perceptual Redundancy Reduction (시각적 인지 중복성 제거를 위해 양자화 크기값에 적응적인 최소 인지 왜곡 기반 전처리 방법)

  • Ki, Sehwan;Kim, Munchurl
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2016.11a
    • /
    • pp.100-102
    • /
    • 2016
  • 본 논문에서는 기존의 인지 영상 부호화에 사용되던 Just Noticeable Distortion(JND) 보다 더 압축에 적합한 모델인 Just Noticeable Quantization Distortion(JNQD) 모델을 제시하고, 이를 사용한 인지적 영상 압축 방법을 제안한다. 제안하는 인지적 영상 압축 방식은 영상 코덱 내부의 Rate-Distortion Optimization(RDO)을 수정하지 않고 입력되는 영상의 불필요한 정보들을 미리 제거하는 전처리 과정으로서, JNQD 모델을 사용하여 보다 간단하면서 압축 효율을 크게 증가 시킬 수 있다. 기존 영상 압축의 전처리 방법들은 부호화기의 양자화 값을 전처리 과정에서 고려하지 못하여 부정확한 인지 중복성 제거 결과를 초래하였으나, 제안하는 방법은 영상의 특성뿐만 아니라 양자화 크기 값을 고려하여 적응적으로 인지 왜곡이 발생하지 않는 주관적 인지 중복성 제거를 전처리 과정에서 수행할 수 있다. 거의 유사한 주관적 품질 수준을 유지하면서 HEVC 참조 소프트웨어 대비 약 15%의 압축효율 향상을 보인다.

  • PDF

No-Referenced Video-Quality Assessment for H.264 SVC with Packet Loss (패킷 손실시 H.264 SVC의 무기준법 영상 화질 평가 방법)

  • Kim, Hyun-Tae;Kim, Yo-Han;Shin, Ji-Tae;Won, Seok-Ho
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.36 no.11C
    • /
    • pp.655-661
    • /
    • 2011
  • The transmission issues for the scalable video coding extension of H.264/AVC (H.264 SVC) video has been widely studied. In this paper, we propose an objective video-quality assessment metric based on no-reference for H.264 SVC using scalability information. The proposed metric estimate the perceptual video-quality reflecting error conditions with the consideration of the motion vectors, error propagation patterns with the hierarchical prediction structure, quantization parameters, and number of frame which damaged by packet loss. The proposed metric reflects the human perceptual quality of video and we evaluate the performance of proposed metric by using correlation relationship between differential mean opinion score (DMOS) as a subjective quality and proposed one.

Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding (저전송률 오디오 부호화에서 음성 신호의 성능 개선을 위한 마스킹 임계값 적응기법 향상)

  • Lee, Chang-Heon;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.1
    • /
    • pp.62-68
    • /
    • 2010
  • This paper proposes a new masking threshold adjustment strategy to improve the performance for speech signals in low bit-rate audio coding. After determining formant regions, the masking threshold is adjusted by using the energy ratio of each sub-band to the average energy of each formant. More quantization noises are added to the bands that have relatively large energy, but less distortion is allowed in spectral valley regions by allocating more bits, which reflects the concept of perceptual weighting widely used in speech coding. From the results of objective speech quality measure, we verified that the proposed method improves quality for the speech input signals compared to the conventional one.