• Title/Summary/Keyword: Rate Distortion

Search Result 818, Processing Time 0.023 seconds

Conditional Probability Based Early Termination of Recursive Coding Unit Structures in HEVC (HEVC의 재귀적 CU 구조에 대한 조건부 확률 기반 고속 탐색 알고리즘)

  • Han, Woo-Jin
    • Journal of Broadcast Engineering
    • /
    • v.17 no.2
    • /
    • pp.354-362
    • /
    • 2012
  • Recently, High Efficiency Video Coding (HEVC) is under development jointly by MPEG and ITU-T for the next international video coding standard. Compared to the previous standards, HEVC supports variety of splitting units, such as coding unit (CU), prediction unit (PU), and transform unit (TU). Among them, it has been known that the recursive quadtree structure of CU can improve the coding efficiency while the encoding complexity is increased significantly. In this paper, a simple conditional probability to predict the early termination condition of recursive unit structure is introduced. The proposed conditional probability is estimated based on Bayes' formula from local statistics of rate-distortion costs in encoder. Experimental results show that the proposed method can reduce the total encoding time by about 32% according to the test configuration while the coding efficiency loss is 0.4%-0.5%. In addition, the encoding time can be reduced by 50% with 0.9% coding efficiency loss when the proposed method was used jointly with HM4.0 early CU termination algorithm.

Space-Time Quantization and Motion-Aligned Reconstruction for Block-Based Compressive Video Sensing

  • Li, Ran;Liu, Hongbing;He, Wei;Ma, Xingpo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.1
    • /
    • pp.321-340
    • /
    • 2016
  • The Compressive Video Sensing (CVS) is a useful technology for wireless systems requiring simple encoders but handling more complex decoders, and its rate-distortion performance is highly affected by the quantization of measurements and reconstruction of video frame, which motivates us to presents the Space-Time Quantization (ST-Q) and Motion-Aligned Reconstruction (MA-R) in this paper to both improve the performance of CVS system. The ST-Q removes the space-time redundancy in the measurement vector to reduce the amount of bits required to encode the video frame, and it also guarantees a low quantization error due to the fact that the high frequency of small values close to zero in the predictive residuals limits the intensity of quantizing noise. The MA-R constructs the Multi-Hypothesis (MH) matrix by selecting the temporal neighbors along the motion trajectory of current to-be-reconstructed block to improve the accuracy of prediction, and besides it reduces the computational complexity of motion estimation by the extraction of static area and 3-D Recursive Search (3DRS). Extensive experiments validate that the significant improvements is achieved by ST-Q in the rate-distortion as compared with the existing quantization methods, and the MA-R improves both the objective and the subjective quality of the reconstructed video frame. Combined with ST-Q and MA-R, the CVS system obtains a significant rate-distortion performance gain when compared with the existing CS-based video codecs.

Korean Word Recognition Using Vector Quantization Speaker Adaptation (벡터 양자화 화자적응기법을 사용한 한국어 단어 인식)

  • Choi, Kap-Seok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.10 no.4
    • /
    • pp.27-37
    • /
    • 1991
  • This paper proposes the ESFVQ(energy subspace fuzzy vector quantization) that employs energy subspaces to reduce the quantizing distortion which is less than that of a fuzzy vector quatization. The ESFVQ is applied to a speaker adaptation method by which Korean words spoken by unknown speakers are recognized. By generating mapped codebooks with fuzzy histogram according to each energy subspace in the training procedure and by decoding a spoken word through the ESFVQ in the recognition proecedure, we attempt to improve the recognition rate. The performance of the ESFVQ is evaluated by measuring the quantizing distortion and the speaker adaptive recognition rate for DDD telephone area names uttered by 2 males and 1 female. The quatizing distortion of the ESFVQ is reduced by 22% than that of a vector quantization and by 5% than that of a fuzzy vector quantization, and the speaker adaptive recognition rate of the ESFVQ is increased by 26% than that without a speaker adaptation and by 11% than that of a vector quantization.

  • PDF

A Fast Inter Prediction Encoding Technique for Real-time Compression of H.264/AVC (H.264/AVC의 실시간 압축을 위한 고속 인터 예측 부호화 기술)

  • Kim, Young-Hyun;Choi, Hyun-Jun;Seo, Young-Ho;Kim, Dong-Wook
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.11C
    • /
    • pp.1077-1084
    • /
    • 2006
  • This paper proposed a fast algorithm to reduce the amount of calculation for inter prediction which takes a great deal of the operational time in H.264/AVC. This algorithm decides a search range according to the direction of predicted motion vector, and then performs an adaptive spiral search for the candidates with JM(Joint Model) FME(Fast Motion Estimation) which employs the rate-distortion optimization(RDO) method. Simultaneously, it decides a threshold cost value for each of the variable block sizes and performs the motion estimation for the variable search ranges with the threshold. These activities reduce the great amount of the complexity in inter prediction encoding. Experimental results by applying the proposed method .to various video sequences showed that the process time was decreased up to 80% comparing to the previous prediction methods. The degradation of video quality was only from 0.05dB to 0.19dB and the compression ratio decreased as small as 0.58% in average. Therefore, we are sure that the proposed method is an efficient method for the fast inter prediction.

Complexity Reduction Method Using Inter-layer CU Depth Information for Scalable Video Coding Base on HEVC (계층 간 CU 깊이 예측을 이용한 HEVC SVC 고속 부호화 방법)

  • Jang, Hyeong-Moon;Nam, Jung-Hak;Sim, Dong-Gyu
    • Journal of Broadcast Engineering
    • /
    • v.17 no.5
    • /
    • pp.765-780
    • /
    • 2012
  • In this paper, we propose a fast mode decision method that determines the coding unit depth for enhancement layers to improve an encoding speed of a scalable video encoder based on HEVC. To decide the coding unit depth of the enhancement layer, firstly, the coding unit depth of the corresponded coding unit in the basement layer is employed. At this stage, the final CU depth is decided by calculating the rate-distortion costs of one lower depth to one upper depth of the referenced depth. The proposed method can reduce a computational load since it does not calculate the rate-distortion costs for all the depths of a target CU. We found that the proposed algorithm decreases encoding complexity of 26% with approximately 1.4% bit increment, compared with the simulcast encoder of the HM 4.0.

Equal Bit Rate Control for Low Bit-rate Coder based on Frame Statistics (저 전송률 부호화기를 위한 프레임 특성에 근간한 균등 비트 할당 기법)

  • Seo Dong-Wan;Choe Yoon-Sik
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.6 no.4
    • /
    • pp.176-181
    • /
    • 2005
  • This paper presents an equal bit rate control algorithm utilizing the statistical change between the previous frame and the current frame. The previous studies on the model-based rate control have focused on the models of bit rate and distortion in types of coders, in terms of the quantization parameter. The proposed algorithm improves the typical model-based rate control by updating a model parameter instead of modeling a better model of the rate and distortion. The proposed algorithm updates this model parameter by recognizing the change in statistics between the previous frame and the current frame. We implement the proposed algorithm in MPEG-4 coders and verify its performance while comparing it to the TMN8's approach (up to 0.6dB of improvement).

  • PDF

Prosody Control of the Synthetic Speech using Sampling Rate Conversion (표본화율 변환을 이용한 합성음의 운율제어)

  • 이현구;홍광석
    • Proceedings of the IEEK Conference
    • /
    • 1999.11a
    • /
    • pp.676-679
    • /
    • 1999
  • In this paper, we presents a method to control prosody of the synthetic speech using sampling rate conversion technique. In prosody control, the conventional methods perform overlap and add. So the synthetic speech has a distortion and the voice quality is not satisfied. Using sampling rate conversion technique, we can get high Qualify of the synthetic speech. Also we can control various talking speeds according to speaker's patterns.

  • PDF

FRF Distortion Caused by Exponential Window Function on Impact Hammer Testing and Its Solution (지수창함수를 사용한 임팩트햄머 실험에서 주파수응답함수의 왜곡과 개선책)

  • 안세진;정의봉
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.13 no.5
    • /
    • pp.334-340
    • /
    • 2003
  • Exponential window function Is widely used In impact hammer testing to reduce leakage error as well as to get a good S/N ratio. The larger its decaying rate is, the more effectively the leakage errors are reduced. But if the decay rate of the exponential window is too large, the FRF is distorted. And the modal parameters of the system can not be exactly identified by modal analysis technique. Therefore, it is a difficult problem to determine proper decay rate in impact hammer testing. In this paper, amount of the FRF distortion caused by exponential window is theoretically uncovered. A new circle fitting method is also proposed so that the modal parameters are directly extracted from impulse response spectrum distorted by the exponential-windowed impulse response data. The results by the conventional and proposed circle fitting method are compared through a numerical example.

Adaptive rate control for video communication (동영상 통신을 위한 적응 비트율 제어)

  • 김학수;정연식
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.9A
    • /
    • pp.1383-1391
    • /
    • 1999
  • This paper presents a rate control method that minimizes global distortion under given target bit rates for video communication. This method makes the quality of reconstructed images better than that of the conventional ones based on R-D model at the same bit rates. Given a set of quantizers, a sequence of macroblocks to be quantized selects the optimal quantizer for each macroblock so that the total cost measure is minimized and the finite buffer is never in overflow. To solve this problem we provide a heuristic algorithm based on Lagrangian optimization using an operational rate-distortion framework and a quantization method follows H.263recommendation.

  • PDF

Adaptive Coding Mode Decision Algorithm using Motion Vector Map in H.264/AVC Video Coding (H.264/AVC 부호기에서 움직임 벡터 맵을 이용한 적응적인 부호화 모드 결정 방법)

  • Kim, Tae-Jung;Ko, Man-Geun;Suh, Jae-Won
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.2
    • /
    • pp.48-56
    • /
    • 2009
  • We propose a fast intra mode skip decision algorithm for H.264/AVC video encoding. Although newly added MB encoding algorithms based on various prediction methods increase compression ratio, they require a significant increase in the computational complexity because we calculate rate-distortion(RD) cost for all possible MB coding modes and then choose the best one. In this paper, we propose a fast mode decision algorithm based on an adaptive motion vector map(AMVM) method for H.264/AVC video encoding to reduce the processing time for the inter frame. We verify that the proposed algorithm generates generally good performances in PSNR, bit rates, and processing time.