• Title/Summary/Keyword: Enhanced aacPlus

Search Result 5, Processing Time 0.018 seconds

Pilot-Based Coding Scheme for Parametric Stereo in Enhanced aacPlus

  • Pang, Hee-Suk
    • ETRI Journal
    • /
    • v.31 no.5
    • /
    • pp.613-615
    • /
    • 2009
  • We propose a pilot-based coding (PBC) scheme for lossless bit rate reduction of parametric stereo (PS) in enhanced aacPlus. It uses PBC in addition to the existing frequency and time differential coding to encode and decode PS parameter indexes. We also design optimal Huffman codebooks (HCBs) for PBC in the proposed scheme. Experiments show that the proposed scheme is superior to the original coding scheme, where both the new coding structure and the optimal HCBs contribute to the bit rate reduction.

Bit Rate Reduction of Enhanced aacPlus by Arithmetic Coding (Arithmetic Coding을 통한 Enhanced aacPlus의 비트율 감소)

  • Ku, Ja-Seong;Ham, Woo-Gyu;Kim, Ki-Jun;Kang, Kyeongok;Park, Hochong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2013.06a
    • /
    • pp.3-5
    • /
    • 2013
  • 본 논문에서는 enhanced aacPlus 부호화기의 스펙트럼 계수 무손실 부호화에 arithmetic coding을 적용하여 비트율을 감소시키는 방법을 연구하였다. USAC의 arithmetic coding을 enhanced aacPlus 구조에 맞게 변경하여 적용하였다. 기존 방법과 arithmetic coding 방법에 의한 부호화 비트 수를 비교하여 성능을 평가하였고, 모노 신호에서 최대 9.3%, 스테레오 신호에서 최대 6.6%의 비트 감소율을 확인하였다.

  • PDF

Search of Optimal Contexts for Context-adaptive Coding of Stereo Parameters in Parametric Stereo of Enhanced aacPlus (Enhanced aacPlus의 Parametric Stereo에서 스테레오 파라미터의 컨텍스트 적응 코딩을 위한 최적 컨텍스트 탐색)

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.7
    • /
    • pp.435-440
    • /
    • 2012
  • We propose optimal contexts for context-adaptive coding of stereo parameters in parametric stereo (PS) of enhanced aacPlus. For the quantized indexes of stereo parameters, 8 context candidates were proposed based on the index values and their combinations adjacent to a source index in the time-stereo band domain, where the time-stereo band region was further divided into 4 regions based on refresh/non-refresh frames and stereo bands. The optimal contexts for each region were proposed by experiments, which are expected to be used for context-adaptive coding of PS for improved performance.

Enhanced Spectral Hole Substitution for Improving Speech Quality in Low Bit-Rate Audio Coding

  • Lee, Chang-Heon;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.3E
    • /
    • pp.131-139
    • /
    • 2010
  • This paper proposes a novel spectral hole substitution technique for low bit-rate audio coding. The spectral holes frequently occurring in relatively weak energy bands due to zero bit quantization result in severe quality degradation, especially for harmonic signals such as speech vowels. The enhanced aacPlus (EAAC) audio codec artificially adjusts the minimum signal-to-mask ratio (SMR) to reduce the number of spectral holes, but it still produces noisy sound. The proposed method selectively predicts the spectral shapes of hole bands using either intra-band correlation, i.e. harmonically related coefficients nearby or inter-band correlation, i.e. previous frames. For the bands that have low prediction gain, only the energy term is quantized and spectral shapes are replaced by pseudo random values in the decoding stage. To minimize perceptual distortion caused by spectral mismatching, the criterion of the just noticeable level difference (JNLD) and spectral similarity between original and predicted shapes are adopted for quantizing the energy term. Simulation results show that the proposed method implemented into the EAAC baseline coder significantly improves speech quality at low bit-rates while keeping equivalent quality for mixed and music contents.

Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding (저전송률 오디오 부호화에서 음성 신호의 성능 개선을 위한 마스킹 임계값 적응기법 향상)

  • Lee, Chang-Heon;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.1
    • /
    • pp.62-68
    • /
    • 2010
  • This paper proposes a new masking threshold adjustment strategy to improve the performance for speech signals in low bit-rate audio coding. After determining formant regions, the masking threshold is adjusted by using the energy ratio of each sub-band to the average energy of each formant. More quantization noises are added to the bands that have relatively large energy, but less distortion is allowed in spectral valley regions by allocating more bits, which reflects the concept of perceptual weighting widely used in speech coding. From the results of objective speech quality measure, we verified that the proposed method improves quality for the speech input signals compared to the conventional one.