• Title/Summary/Keyword: Vector Quantization(VQ)

Search Result 129, Processing Time 0.024 seconds

Comparison of Vector Quantization for Image Coding (영상 코딩을 위한 벡터 양자화 방법의 성능 비교)

  • 박광훈;박용철;차일환;윤대희
    • Proceedings of the Korean Institute of Communication Sciences Conference
    • /
    • 1987.04a
    • /
    • pp.35-38
    • /
    • 1987
  • The purpose of this paper is to compare a class of vector quantization techniques which include GVQ(Genera VQ) MSVQ(Mean separated VQ) and DCT_VQ The VQ techniques are applied to six images and both subjective and objective performance comparison are made The results indicate that the transform domain approach(DCT_VQ) yields more syable results than the spatial domain method (GVQ, MSVQ)

  • PDF

Fast Codebook Search for Vector Quantization in Image Coding (영상 부호화를 위한 벡터 양자화기에서의 고속 탐색 기법)

  • 고종석;김재균
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.13 no.4
    • /
    • pp.302-308
    • /
    • 1988
  • The paper describes a very simple algorithm for reducing the encoding complexity of vector quantization(VQ), exploiting the feature of a vector currently being encoded. A proposed VQ of 16(=4x4) vector dimension shows a slight performance degradation of about 0.1-1.9dB, however, with only 16-32 among 256 codeword searches, i.e., with just 1/16-1/8 search complexity compared to a full-search VQ. And the proposed VQ scheme is also compared to outperform tree-search VQ with regard to their SNR performance and memory requirement.

  • PDF

Sample-Adaptive Product Quantization and Design Algorithm (표본 적응 프러덕트 양자화와 설계 알고리즘)

  • 김동식;박섭형
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.12B
    • /
    • pp.2391-2400
    • /
    • 1999
  • Vector quantizer (VQ) is an efficient data compression technique for low bit rate applications. However, the major disadvantage of VQ is its encoding complexity which increases dramatically as the vector dimension and bit rate increase. Even though one can use a modified VQ to reduce the encoding complexity, it is nearly impossible to implement such a VQ at a high bit rate or for a large vector dimension because of the enormously large memory requirement for the codebook and the very large training sequence (TS) size. To overcome this difficulty, in this paper we propose a novel structurally constrained VQ for the high bit rate and the large vector dimension cases in order to obtain VQ-level performance. Furthermore, this VQ can be extended to the low bit rate applications. The proposed quantization scheme has a form of feed-forward adaptive quantizer with a short adaptation period. Hence, we call this quantization scheme sample-adaptive product quantizer (SAPQ). SAPQ can provide a 2 ~3dB improvement over the Lloyd-Max scalar quantizers.

  • PDF

A Simple Algorithm for Fast Codebook Search in Image Vector Quantization (벡터 양자화에서 벡터의 특성을 이용한 단축 탐색방법)

  • Koh, Jong-Seog;Kim, Jae-Kyoon;Kim, Seong-Dae
    • Proceedings of the KIEE Conference
    • /
    • 1987.07b
    • /
    • pp.1434-1437
    • /
    • 1987
  • We present a very simple algorithm for reducing the encoding (codebook search) complexity of vector quantization (VQ), exploiting some features of a vector currently being encoded. A proposed VQ of 16 (=$4{\times}4$) vector dimension and 256 codewords shows a slight performance degradation of about 0.1-0.9 dB, however, with only 16 or 32 among 256 codeword searches, i.e., with just 1/16 or 1/8 search complexity compared to a full-search VQ. And the proposed VQ scheme is also compared to and shown to be a bit superior to tree-search VQ with regard to their SNR performance and memory requirement.

  • PDF

STRUCTURED CODEWORD SEARCH FOR VECTOR QUANTIZATION (백터양자화가의 구조적 코더 찾기)

  • 우홍체
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2000.11a
    • /
    • pp.467-470
    • /
    • 2000
  • Vector quantization (VQ) is widely used in many high-quality and high-rate data compression applications such as speech coding, audio coding, image coding and video coding. When the size of a VQ codebook is large, the computational complexity for the full codeword search method is a significant problem for many applications. A number of complexity reduction algorithms have been proposed and investigated using such properties of the codebook as the triangle inequality. This paper proposes a new structured VQ search algorithm that is based on a multi-stage structure for searching for the best codeword. Even using only two stages, a significant complexity reduction can be obtained without any loss of quality.

  • PDF

Vector Quantization by N-ary Search of a Codebook (코우드북의 절충탐색에 의한 벡터양자화)

  • Lee, Chang-Young
    • Speech Sciences
    • /
    • v.8 no.3
    • /
    • pp.143-148
    • /
    • 2001
  • We propose a new scheme for VQ codebook search. The procedure is in between the binary-tree-search and full-search and thus might be called N-ary search of a codebook. Through the experiment performed on 7200 frames spoken by 25 speakers, we confirmed that the best codewords as good as by the full-search were obtained at moderate time consumption comparable to the binary-tree-search. In application to speech recognition by HMM/VQ with Bakis model, where appearance of a specific codeword is essential in the parameter training phase, the method proposed here is expected to provide an efficient training procedure.

  • PDF

Design of the Vector-Scalar Quantizer of LSP Parameters for Wideband Speech Coder (광대역 음성부호화기를 위한 백터-스칼라 LSP 파라미터 양자화기 설계)

  • 신재현;이인성;지덕구;윤병식;최송인
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.4
    • /
    • pp.286-291
    • /
    • 2003
  • In this Paper, we designed an LSP(Line Spectral Pairs) parameter quantizer with cascaded structure of vector quantizer and scalar quantizer for the wideband speech coder. We have chosen the 16th-order of the LP coefficients. These coefficients are then transformed into the LSP parameters which have the excellent properties for quantization and easy stability checking condition of synthesis filter. In the first stage of quantization, input LSP parameters are split-vector-quantized using two 8-th order codebooks. In the second stage, the components of residual vector are individually quantized by the scalar quantizer utilizing the ordering property of LSP parameters. The designed adaptive VQ-SQ quantizer using 35 bits/frame shows the wideband transparency that the average spectral distortion should be less than 1.6 ㏈ and less than 4% of the frames should have SD above 3 ㏈. The simulation results show that the designed quantizer provides a 2-3 bits/frame saving over the typical vector-scalar quantizer.

A study on the application of residual vector quantization for vector quantized-variational autoencoder-based foley sound generation model (벡터 양자화 변분 오토인코더 기반의 폴리 음향 생성 모델을 위한 잔여 벡터 양자화 적용 연구)

  • Seokjin Lee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.2
    • /
    • pp.243-252
    • /
    • 2024
  • Among the Foley sound generation models that have recently begun to be studied, a sound generation technique using the Vector Quantized-Variational AutoEncoder (VQ-VAE) structure and generation model such as Pixelsnail are one of the important research subjects. On the other hand, in the field of deep learning-based acoustic signal compression, residual vector quantization technology is reported to be more suitable than the conventional VQ-VAE structure. Therefore, in this paper, we aim to study whether residual vector quantization technology can be effectively applied to the Foley sound generation. In order to tackle the problem, this paper applies the residual vector quantization technique to the conventional VQ-VAE-based Foley sound generation model, and in particular, derives a model that is compatible with the existing models such as Pixelsnail and does not increase computational resource consumption. In order to evaluate the model, an experiment was conducted using DCASE2023 Task7 data. The results show that the proposed model enhances about 0.3 of the Fréchet audio distance. Unfortunately, the performance enhancement was limited, which is believed to be due to the decrease in the resolution of time-frequency domains in order to do not increase consumption of the computational resources.

Quantization of LPC Coefficients Using a Multi-frame AR-model (Multi-frame AR model을 이용한 LPC 계수 양자화)

  • Jung, Won-Jin;Kim, Moo-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.2
    • /
    • pp.93-99
    • /
    • 2012
  • For speech coding, a vocal tract is modeled using Linear Predictive Coding (LPC) coefficients. The LPC coefficients are typically transformed to Line Spectral Frequency (LSF) parameters which are advantageous for linear interpolation and quantization. If multidimensional LSF data are quantized directly using Vector-Quantization (VQ), high rate-distortion performance can be obtained by fully utilizing intra-frame correlation. In practice, since this direct VQ system cannot be used due to high computational complexity and memory requirement, Split VQ (SVQ) is used where a multidimensional vector is split into multilple sub-vectors for quantization. The LSF parameters also have high inter-frame correlation, and thus Predictive SVQ (PSVQ) is utilized. PSVQ provides better rate-distortion performance than SVQ. In this paper, to implement the optimal predictors in PSVQ for voice storage devices, we propose Multi-Frame AR-model based SVQ (MF-AR-SVQ) that considers the inter-frame correlations with multiple previous frames. Compared with conventional PSVQ, the proposed MF-AR-SVQ provides 1 bit gain in terms of spectral distortion without significant increase in complexity and memory requirement.

Bitrate Reduction in Vector Quantization System Using a Dynamic Index Mapping (동적 인텍스 매핑을 이용한 벡터 양자화 시스템에서의 비트율 감축)

  • 이승준;양경호;김철우;이충웅
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.32B no.8
    • /
    • pp.1091-1098
    • /
    • 1995
  • This paper proposes an efficient noiseless encoding method of vector quantization(VQ) index using a dynamic index mapping. Using high interblock correlation, the proposed index mapper transforms an index into a new one with lower entropy. In order to achieve good performance with low computational complexity, we adopt 'the sum of differences in pixel values on the block boundaries' as the cost function for index mapping. Simulation results show that the proposed scheme reduces the average bitrate by 40 - 50 % in ordinary VQ system for image compression. In addition, it is shown that the proposed index mapping method can be also applied to mean-residual VQ system, which allows the reduction of bitrate for VQ index by 20 - 30 %(10 - 20 % reduction in total bitrate). Since the proposed scheme is one for noiseless encoding of VQ index, it provides the same quality of the reconstructed image as the conventional VQ system.

  • PDF