• Title/Summary/Keyword: Stereo Coding

Search Result 50, Processing Time 0.028 seconds

Study on novel hierarchical parametric stereo coding method for Multichannel audio signal (멀티채널 오디오 신호의 계층적 코딩이 가능한 파라메트릭 스테레오 코딩 방법에 대한 연구)

  • Moon, Han-Gil
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.875-876
    • /
    • 2008
  • Parametric stereo coding is a technique to efficiently code a stereo audio signal as a monaural signal plus small amount of parametric overhead to describe the stereo image. The stereo properties are analyzed, encoded, and reinstated in a decoder according to spatial psycho-acoustical principles. However, coding of multichannel audio signal using parametric stereo still requires considerable bit-rate. In this paper, enhanced parametric stereo coding for multichannel audio signal is proposed.

  • PDF

Search of Optimal Contexts for Context-adaptive Coding of Stereo Parameters in Parametric Stereo of Enhanced aacPlus (Enhanced aacPlus의 Parametric Stereo에서 스테레오 파라미터의 컨텍스트 적응 코딩을 위한 최적 컨텍스트 탐색)

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.31 no.7
    • /
    • pp.435-440
    • /
    • 2012
  • We propose optimal contexts for context-adaptive coding of stereo parameters in parametric stereo (PS) of enhanced aacPlus. For the quantized indexes of stereo parameters, 8 context candidates were proposed based on the index values and their combinations adjacent to a source index in the time-stereo band domain, where the time-stereo band region was further divided into 4 regions based on refresh/non-refresh frames and stereo bands. The optimal contexts for each region were proposed by experiments, which are expected to be used for context-adaptive coding of PS for improved performance.

Pilot-Based Coding Scheme for Parametric Stereo in Enhanced aacPlus

  • Pang, Hee-Suk
    • ETRI Journal
    • /
    • v.31 no.5
    • /
    • pp.613-615
    • /
    • 2009
  • We propose a pilot-based coding (PBC) scheme for lossless bit rate reduction of parametric stereo (PS) in enhanced aacPlus. It uses PBC in addition to the existing frequency and time differential coding to encode and decode PS parameter indexes. We also design optimal Huffman codebooks (HCBs) for PBC in the proposed scheme. Experiments show that the proposed scheme is superior to the original coding scheme, where both the new coding structure and the optimal HCBs contribute to the bit rate reduction.

Zerotree Entropy Based Coding of Stereo Video Sequences

  • Thanapirom, S.;Fernando, W.A.C.;Edirisinghe, E.A.
    • Proceedings of the IEEK Conference
    • /
    • 2002.07b
    • /
    • pp.908-911
    • /
    • 2002
  • Over the past 30 years, many efficient 2D video coding techniques have been presented and developed from many research centers for commercialization. However, direct application of these monocular compression schemes is not optimal for stereo video coding. In this paper, we present a new technique for coding stereo video sequences based on Discrete Wavelet Transform (DWT). The proposed technique exploits Zerotree Entropy Coding (ZTE) that makes use of the wavelet block concept to achieve low bit rate stereo video coding. The one of two image streams, called main stream, is independently coded by modified MPEG-4 encoder and the other stream, called auxiliary stream, is coded by predicting from its corresponding image, its previous image or its follow image.

  • PDF

Heterogeneous Resolution Stereo Video Coding System (이종 해상도 스테레오 비디오 코딩 시스템)

  • Park, Sea-Nae;Sim, Dong-Gyu
    • Journal of Broadcast Engineering
    • /
    • v.13 no.1
    • /
    • pp.162-173
    • /
    • 2008
  • In this paper, we propose an effective stereo-view video coding method that considers stereo-view and displayer characteristics. Current many stereo video displayers are designed for not only stereo display but also conventional single view display. In these systems, the resolution of two input videos for a stereo mode is half of that of single view for compatibility with conventional single view video services. In this raper, we propose a stereo video codec to deal with both single view and stereo view services by encoding whole left image and down-sampled right image. However, direct disparity estimation is not possible between two views because the resolution of a left image is different from that of the corresponding right image. So, we propose a disparity estimation method to make use of full information of the left reference image without down-sampling. In experimental result, we achieved $0.5{\sim}0.8\;dB$ coding gain, compared with several conventional algorithms.

Multiresolution Wavelet-Based Disparity Estimation for Stereo Image Compression

  • Tengcharoen, Chompoonuch;Varakulsiripunth, Ruttikorn
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.1098-1101
    • /
    • 2004
  • The ordinary stereo image of an object consists of data of left and right views. Therefore, the left and right image pairs have to be transmitted simultaneously in order to display 3-dimentional video at the remote site. However, due to the twice data in comparing with a monoscopic image of the same object, it needs to be compressed for fast transmission and resource saving. Hence, it needs an effective coding algorithm for compressing stereo image. It was found previously that compressing left and right frames independently will achieve the compression ratio lower than compressing by utilizing the spatial redundancy between both frames. Therefore, in this paper, we study the stereo image compression technique based on the multiresolution wavelet transform using varied disparity-block size for estimation and compensation. The size of disparity-block in the stereo pair subbands are scaling on a coarse-to-fine wavelet coefficients strategy. Finally, the reference left image and residual right image after disparity estimation and compensation are coded by using SPIHT coding. The considered method demonstrates good performance in both PSNR measures and visual quality for stereo image.

  • PDF

MPEG-2 AAC Encoder Implementation Using a floating-Point DSP (부동 소수점 DSP를 이용한 MPEG-2 AAC 부호차기 구현)

  • Kim Seung-Woo
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.7
    • /
    • pp.882-888
    • /
    • 2005
  • MPEG-2 Advanced Audio Coding (AAC) has already been standardized as a sophisticated next generation technology AAC provides an audio signal that has CD quality at 96-128kbps/stereo. This paper describes a high-quality and efficient software implementation of an MPEG-2 AAC LC Profile encoder. Common scalefactor and noisless coding are accelerated by $45\%$ and $27\%$, respectively, through the use of TMS320C30 instructions. The implemented encoder uses 7.5kWords of program memory, 18kWords of data ROM and 92kBytes of data RAM, respectively. The results of subjective Qualify test showed that the sound quality achieved at 96kbps/stereo was equivalent to that of MP3 at 128kbps/stereo.

  • PDF

A Balancing Method to improve efficiency of Stereo Coding (스테레오 코딩의 효율화를 위한 밸런싱 방법)

  • Kim, Jong-Su;Choi, Jong-Ho;Lee, Kang-Ho;Kim, Tae-Yong;Choi, Jong-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.12 no.4
    • /
    • pp.87-94
    • /
    • 2007
  • Imbalances in focus, luminance and color between stereo Pairs could cause disparity vector estimation error and increment of transmission data. If the distribution of errors in residual image is large, it may influence to lowering of compression performance. Therefore, in this paper, we propose an efficient balancing method between stereo pairs to reduce the effect. For this, we registrated stereo images using a FFT based method to consider the pixels in the occluded region, we eliminated the pixels of blocks which has large error of disparity vector estimation in balancing function estimation. The balancing function has estimated using histogram specification, local information of target image and residual image between stereo images. Experiments show that the proposed method is effective in error distribution, PSNR and disparity vector estimation. We expect that our method can be improving compression efficiency in stereo coding system.

  • PDF

Audio Object Coding Standard Technology - MPEG SAOC (오디오 객체 부호화 표준 - MPEG SAOC)

  • Jung, Yang-Won;Oh, Hyen-O
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.630-639
    • /
    • 2009
  • This paper introduces MPEG SAOC (Spatial Audio Object Coding) that has been standardized in MPEG audio subgroup. MPEG SAOC is a trendy parametric coding technology conceptually similar to PS (Parametric Stereo) and the MPEG Surround. SAOC especially parameterizes and codes the spatial information for the object signals comprising a downmixed audio scene and thus lets users render one's preferred scene in an interactive manner.

Intensity Compensation for Efficient Stereo Image Compression (효율적인 스테레오 영상 압축을 위한 밝기차 보상)

  • Jeon Youngtak;Jeon Byeungwoo
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.2 s.302
    • /
    • pp.101-112
    • /
    • 2005
  • As we perceive the world as 3-dimensional through our two eyes, we can extract 3-dimensional information from stereo images obtained from two or more cameras. Since stereo images have a large amount of data, with recent advances in digital video coding technology, efficient compression algorithms have been developed for stereo images. In order to compress stereo images and to obtain 3-D information such as depth, we find disparity vectors by using disparity estimation algorithm generally utilizing pixel differences between stereo pairs. However, it is not unusual to have stereo images having different intensity values for several reasons, such as incorrect control of the iris of each camera, disagreement of the foci of two cameras, orientation, position, and different characteristics of CCD (charge-coupled device) cameras, and so on. The intensity differences of stereo pairs often cause undesirable problems such as incorrect disparity vectors and consequent low coding efficiency. By compensating intensity differences between left and right images, we can obtain higher coding efficiency and hopefully reduce the perceptual burden of brain to combine different information incoming from two eyes. We propose several methods of intensity compensation such as local intensity compensation, global intensity compensation, and hierarchical intensity compensation as very simple and efficient preprocessing tool. Experimental results show that the proposed algerian provides significant improvement in coding efficiency.