• Title/Summary/Keyword: Parametric stereo

Search Result 14, Processing Time 0.03 seconds

Study on novel hierarchical parametric stereo coding method for Multichannel audio signal (멀티채널 오디오 신호의 계층적 코딩이 가능한 파라메트릭 스테레오 코딩 방법에 대한 연구)

  • Moon, Han-Gil
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.875-876
    • /
    • 2008
  • Parametric stereo coding is a technique to efficiently code a stereo audio signal as a monaural signal plus small amount of parametric overhead to describe the stereo image. The stereo properties are analyzed, encoded, and reinstated in a decoder according to spatial psycho-acoustical principles. However, coding of multichannel audio signal using parametric stereo still requires considerable bit-rate. In this paper, enhanced parametric stereo coding for multichannel audio signal is proposed.

  • PDF

Search of Optimal Contexts for Context-adaptive Coding of Stereo Parameters in Parametric Stereo of Enhanced aacPlus (Enhanced aacPlus의 Parametric Stereo에서 스테레오 파라미터의 컨텍스트 적응 코딩을 위한 최적 컨텍스트 탐색)

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.7
    • /
    • pp.435-440
    • /
    • 2012
  • We propose optimal contexts for context-adaptive coding of stereo parameters in parametric stereo (PS) of enhanced aacPlus. For the quantized indexes of stereo parameters, 8 context candidates were proposed based on the index values and their combinations adjacent to a source index in the time-stereo band domain, where the time-stereo band region was further divided into 4 regions based on refresh/non-refresh frames and stereo bands. The optimal contexts for each region were proposed by experiments, which are expected to be used for context-adaptive coding of PS for improved performance.

Pilot-Based Coding Scheme for Parametric Stereo in Enhanced aacPlus

  • Pang, Hee-Suk
    • ETRI Journal
    • /
    • v.31 no.5
    • /
    • pp.613-615
    • /
    • 2009
  • We propose a pilot-based coding (PBC) scheme for lossless bit rate reduction of parametric stereo (PS) in enhanced aacPlus. It uses PBC in addition to the existing frequency and time differential coding to encode and decode PS parameter indexes. We also design optimal Huffman codebooks (HCBs) for PBC in the proposed scheme. Experiments show that the proposed scheme is superior to the original coding scheme, where both the new coding structure and the optimal HCBs contribute to the bit rate reduction.

Audio Object Coding Standard Technology - MPEG SAOC (오디오 객체 부호화 표준 - MPEG SAOC)

  • Jung, Yang-Won;Oh, Hyen-O
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.630-639
    • /
    • 2009
  • This paper introduces MPEG SAOC (Spatial Audio Object Coding) that has been standardized in MPEG audio subgroup. MPEG SAOC is a trendy parametric coding technology conceptually similar to PS (Parametric Stereo) and the MPEG Surround. SAOC especially parameterizes and codes the spatial information for the object signals comprising a downmixed audio scene and thus lets users render one's preferred scene in an interactive manner.

Improved Phase Synthesis for Parametric Stereo Audio Coding (파라메트릭 스테레오 오디오 부호화를 위한 향상된 위상 합성 기법)

  • Hyun, Dong-Il;Park, Young-Cheol;Youn, Dae Hee
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.12
    • /
    • pp.184-190
    • /
    • 2013
  • Parametric stereo(PS) audio coding is a specific version of spatial audio coding. In this paper, the problem due to the conventional synthesis of phase differences. In the conventional upmix matrix, phase differences are synthesized not only on downmix signal but also ambient signal, which violates the assumption that the ambient signals are anti-phased. Deterioration due to the phase synthesis is analyzed, especially, for low interchannel correlation. To solve this problem, new upmix matrix is proposed, which synthesizes phase differences only on downmix signal. The performance of the proposed upmix matrix is verified by the subjective listening tests.

Channel Expansion Technology in MPEG Audio (MPEG 오디오의 채널 확장 기술)

  • Pang, Hee-Suk
    • Journal of Broadcast Engineering
    • /
    • v.16 no.5
    • /
    • pp.714-721
    • /
    • 2011
  • MPEG audio uses the masking effect, high frequency component synthesis based on spectral band replication, and channel expansion based on parametric stereo for efficient compression of audio signals. In this paper, we present an overview of the state-of-the-art channel expansion technology in MPEG audio. We also present technical overviews and application examples to broadcasting services for HE-AAC v.2, MPEG Surround, spatial audio object coding (SAOC), and unified speech and audio coding (USAC) which are MPEG audio codecs based on the channel expansion technology.

Robust Primary-ambient Signal Decomposition Method using Principal Component Analysis with Phase Alignment (위상 정렬을 이용한 주성분 분석법의 강인한 스테레오 음원 분리 성능유지 기법)

  • Baek, Yong-Hyun;Hyun, Dong-Il;Park, Young-Cheol
    • Journal of Broadcast Engineering
    • /
    • v.19 no.1
    • /
    • pp.64-74
    • /
    • 2014
  • The primary and ambient signal decomposition of a stereo sound is a key step to the stereo upmix. The principal component analysis (PCA) is one of the most widely used methods of primary-ambient signal decomposition. However, previous PCA-based decomposition algorithms assume that stereo sound sources are only amplitude-panned without any consideration of phase difference. So it occurs some performance degradation in case of live recorded stereo sound. In this paper, we propose a new PCA-based stereo decomposition algorithm that can consider the phase difference between the channel signals. The proposed algorithm overcomes limitation of conventional signal model using PCA with phase alignment. The phase alignment is realized by using inter-channel phase difference (IPD) which is widely used in parametric stereo coding. Moreover, Enhanced Modified PCA(EMPCA) is combined to solve the problem of conventional PCA caused by Primary to Ambient energy Ratio(PAR) and panning angle dependency. The simulation results are presented to show the improvements of the proposed algorithm.

An Improved Synthesis Method of Parametric Stereo Coding Based on Tonality Information (토널리티 정보를 기반으로 한 파라메트릭 스테레오 부호화의 개선된 합성 기법)

  • Lee, Tung chin;Park, Young-Cheol;Youn, Dae Hee
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.6
    • /
    • pp.221-227
    • /
    • 2014
  • In this paper, we propose a synthesis method that can effectively suppress the ambience which affects tonal components in the PS decoder. Ambience component was obtained by using decorrelation filter and the weighting of the ambience in the decoder was determined through IC parameter. However, since the parameters are extracted in the sub-band domain, a low IC value could be analyzed even if the tonal component is dominant. The quality of the output signal may be degraded. To prevent this problem, the tonality was measured in the downmixed signal and the weighting of the ambience components were adjusted appropriately according to the measured tonality index. The performance of the proposed method was evaluated by simulations. Furthermore, the subjective test was performed and the results confirmed that the proposed method offers improved quality.

Improved Synthesis Method of Negative Inter-channel Correlation Parameter Based on Anti-phase Primary Component (반위상 주요성분에 기반을 둔 개선된 음수 채널간 상관도 파라미터 합성 기법)

  • Hyun, Dong-Il;Lee, Seok-Pil;Park, Young-Cheol;Youn, Dae-Hee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.6
    • /
    • pp.410-418
    • /
    • 2012
  • Parametric stereo(PS) and MPEG surround(MPS) are major spatial audio coding(SAC) tools. In this paper, the problem of the inter-channel correlation(ICC) synthesis in the conventional SAC is analyzed. Conventional methods assume that ambient components mixed to two output channels are anti-phased, while the primary components are assumed to be in-phased. This assumption can cause excessive ambient mixing for a negative-valued ICC. As a remedy to this problem, we propose a new ICC synthesis method based on an assumption that the primary components are anti-phased each other for a negative ICC. The proposed method is also applied to the approximation which works in practice. The performance of the proposed method was evaluated by computer simulations and the subjective listening tests verified that the proposed method is effective in not only headphones but also loudspeakers playback.

3D Building Reconstruction Using Building Model and Segment Measure Function (건물모델 및 선소측정함수를 이용한 건물의 3차원 복원)

  • Ye, Chul-Soo;Lee, Kwae-Hi
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.37 no.4
    • /
    • pp.46-55
    • /
    • 2000
  • This paper presents an algorithm for 3D building reconstruction from a pair of stereo aerial images using the 3D building model and the linear segments of building. Direct extraction of linear segments from original building images using parametric building model is attempted instead of employing the conventional procedures such as edge detection, linear approximation and line linking A segment measure function is simultaneously applied to each line segment extracted in order to improve the accuracy of building detection comparing to individual linear segment detection. The algorithm has been applied to pairs of stereo aerial images and the result showed accurate detection and reconstruction of buildings.

  • PDF