• Title/Summary/Keyword: spatial audio coding

Search Result 36, Processing Time 0.021 seconds

An efficient method of spatial cues and compensation method of spectrums on multichannel spatial audio coding (멀티채널 Spatial Audio Coding에서의 효율적인 Spatial Cues 사용과 그에 따른 Spectrum 보상방법)

  • Lee, Byong-Hwa;Beack, Seung-Kwon;Seo, Jeong-Gil;Han, Min-Soo
    • MALSORI
    • /
    • no.53
    • /
    • pp.157-169
    • /
    • 2005
  • This paper proposes an efficiently representing method of spatial cues on multichannel spatial audio coding. The Binaural Cue Coding (BCC) method introduced recently represents multichannel audio signals by means of Inter Channel Level Difference (ICLD) or Source Index (SI). We tried to express more efficiently ICLD and SI information based on Inter Channel Correlation in this paper. We adopt different spatial cues according to ICC and propose a compensation method of empty spectrums created by using SI. We performed a MOS test and measuring spectral distortion. The results show that the proposed method can reduce the bitrate of side information without large degradation of the audio quality.

  • PDF

Audio Object Coding Standard Technology - MPEG SAOC (오디오 객체 부호화 표준 - MPEG SAOC)

  • Jung, Yang-Won;Oh, Hyen-O
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.630-639
    • /
    • 2009
  • This paper introduces MPEG SAOC (Spatial Audio Object Coding) that has been standardized in MPEG audio subgroup. MPEG SAOC is a trendy parametric coding technology conceptually similar to PS (Parametric Stereo) and the MPEG Surround. SAOC especially parameterizes and codes the spatial information for the object signals comprising a downmixed audio scene and thus lets users render one's preferred scene in an interactive manner.

Object Audio Coding Standard SAOC Technology and Application (객체 오디오 부호화 표준 SAOC 기술 및 응용)

  • Oh, Hyen-O;Jung, Yang-Won
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.5
    • /
    • pp.45-55
    • /
    • 2010
  • Object-based audio coding technology has been interested with its expectation to apply in wide areas. Recently, ISO/IEC MPEG has standardized a parametric object audio coding method, the SAOC (Spatial Audio Object Coding). This paper introduces parametric object audio coding techniques with special focus on the MPEG SAOC and also describes several issues and solutions that should be considered for a success in its application.

Evaluation of Spatial Audio Coding Tools for Multichannel Audio (Spatial Audio Coding 기술의 멀티채널 부호화 성능 비교)

  • Jang Inseon;Seo Jeongil;Mun Hangil;Kang Kyeongok
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.153-156
    • /
    • 2004
  • Spatial Audio Coding (SAC)은 낮은 대역폭에서 다채널/다객체 오디오 신호를 전송하기 위해 제안된 기술이다. 본 논문에서는 MPEG 에서 SAC 기술의 평가 방법으로 채택된 Multi-Stimulus test with Hidden Reference and Anchor (MUSHRA) 실험 절차에 대해서 설명한다. 또한 제 69 차 MPEG 회의에서 제안된 4 개 기관의 SAC 기술에 대한 청취실험을 수행하고 그 결과를 분석한다.

  • PDF

Angle-Based Virtual Source Location Representation for Spatial Audio Coding

  • Beack, Seung-Kwon;Seo, Jeong-Il;Moon, Han-Gil;Kang, Kyeong-Ok;Hahn, Min-Soo
    • ETRI Journal
    • /
    • v.28 no.2
    • /
    • pp.219-222
    • /
    • 2006
  • Virtual source location information (VSLI) has been newly utilized as a spatial cue for compact representation of multichannel audio. This information is represented as the azimuth of the virtual source vector. The superiority of VSLI is confirmed by comparison of the spectral distances, average bit rates, and subjective assessment with a conventional cue.

  • PDF

Channel Expansion Technology in MPEG Audio (MPEG 오디오의 채널 확장 기술)

  • Pang, Hee-Suk
    • Journal of Broadcast Engineering
    • /
    • v.16 no.5
    • /
    • pp.714-721
    • /
    • 2011
  • MPEG audio uses the masking effect, high frequency component synthesis based on spectral band replication, and channel expansion based on parametric stereo for efficient compression of audio signals. In this paper, we present an overview of the state-of-the-art channel expansion technology in MPEG audio. We also present technical overviews and application examples to broadcasting services for HE-AAC v.2, MPEG Surround, spatial audio object coding (SAOC), and unified speech and audio coding (USAC) which are MPEG audio codecs based on the channel expansion technology.

MPEG Surround for Multi-Channel Audio Coding-Part 1: Basic Structure (다채널 오디오 코딩을 위한 MPEG Surround-1부: 기본 구조)

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.599-609
    • /
    • 2009
  • An overview of the recently finalized multi-channel audio coding standard MPEG Surround is provided. This audio coding standard downmixes multi-channel signals to mono or stereo signals and, simultaneously, extracts spatial parameters for its encoding process. In its decoding process, it reconstructs multi-channel signals based on the downmix signals and spatial parameters. Since the downmix signals are coded in conventional audio coding format such as AAC and MP3 and the spatial parameters require a small amount of information MPEG Surround guarantees high sound quality multi-channel audio at low bit rates. Besides, it is backward-compatible to conventional audio coding techniques because the downmix signals can be played on portable audio devices ignoring the spatial parameter information. In this paper, Part 1 presents an overview of the basic structure of MPEG Surround and Part 2 describes various modes and tools including the binaural mode which supports the virtual 5.1-channel playback via headphones or earphones. The listening test results by various companies and organizations are also presented.

Multi-channel Audio Service in a Terrestrial-DMB System Using VSLI-Based Spatial Audio Coding

  • Seo, Jeong-Il;Moon, Han-Gil;Beack, Seung-Kwon;Kang, Kyeong-Ok;Hong, Jae-Keun
    • ETRI Journal
    • /
    • v.27 no.5
    • /
    • pp.635-638
    • /
    • 2005
  • Spatial audio coding (SAC) is an extremely high compact representation of encoded multi-channel audio material. This paper suggests a multi-channel audio service in the terrestrial digital multimedia broadcasting (T-DMB) system using a novel SAC tool, which is called a virtual source location information (VSLI)-based SAC tool. Intensive experiments are presented to evaluate the validity of the proposed VSLI-based SAC tool, and prototypical systems are also presented to demonstrate the reliability of the proposed multi-channel T-DMB system in real applications.

  • PDF

The Development of audio codec using binaural cue coding technologies (Binaural Cue Coding 기술을 이용한 오디오 코덱 구현)

  • Seo Jeongil;Kang Kyeongok;Lee Byonghwa;Hahn Minsoo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.137-140
    • /
    • 2004
  • 낮은 대역폭에서 다채널 다객체 오디오 신호를 전송하기위해 새롭게 제안된 Spatial Audio Coding 기술은 멀티채널 오디오 신호를 다운믹싱하고 나머지 채널은 음향공간상의 위치정보를 나타내는 파라미터들로 압축하여 표현하는 파라메트릭 압축 방식이다. 본 논문에서는 Spatial Audio Coding 기술중의 하나인 BCC 기술을 이용하여 스테레오 오디오 코덱을 구현하고, 주관듣기평가 실험을 통하여 AAC와 비슷한 성능을 나타내면서도 높은 압축율을 얻을 수 있음을 확인하였다.

  • PDF

An Efficient Time-Frequency Representation for Parametric-Based Audio Object Coding

  • Beack, Seung-Kwon;Lee, Tae-Jin;Kim, Min-Je;Kang, Kyeong-Ok
    • ETRI Journal
    • /
    • v.33 no.6
    • /
    • pp.945-948
    • /
    • 2011
  • Object-based audio coding can provide new music applications with interactivity. To efficiently compress a lot of target audio objects, a subband-based parametric coding scheme has been adopted for MPEG spatial audio object coding. In this letter, the time-frequency (T/F) subband analysis structure is investigated. A reconfigured T/F structure is also proposed to enhance the generating performance of sound scenes such as 'karaoke' and 'solo' play in interactive music scenarios. From the experimental results, it was confirmed that the proposed scheme remarkably improves the SNR and sound quality.