• Title/Summary/Keyword: Audio coding

Search Result 214, Processing Time 0.021 seconds

An Efficient Time-Frequency Representation for Parametric-Based Audio Object Coding

  • Beack, Seung-Kwon;Lee, Tae-Jin;Kim, Min-Je;Kang, Kyeong-Ok
    • ETRI Journal
    • /
    • v.33 no.6
    • /
    • pp.945-948
    • /
    • 2011
  • Object-based audio coding can provide new music applications with interactivity. To efficiently compress a lot of target audio objects, a subband-based parametric coding scheme has been adopted for MPEG spatial audio object coding. In this letter, the time-frequency (T/F) subband analysis structure is investigated. A reconfigured T/F structure is also proposed to enhance the generating performance of sound scenes such as 'karaoke' and 'solo' play in interactive music scenarios. From the experimental results, it was confirmed that the proposed scheme remarkably improves the SNR and sound quality.

An Integer DCT Algorithm for Lossless Audio Coding (무손실 음향부호화를 위한 정수 DCT실현기법)

  • Shin, Jae-Ho;Park, Se-Hyoung
    • Journal of The Institute of Information and Telecommunication Facilities Engineering
    • /
    • v.5 no.1
    • /
    • pp.1-11
    • /
    • 2006
  • Lifting scheme based integer transforms provides very useful properties on the multimedia coding. An integer transform outputs the integer form when the input has integer value. This doesn't produce quantization errors on coding, so integer transforms are adequate to lossless coding, In this paper, we present an integer DCT algorithm which is able to transform audio signal with longer length. Also the proposed method can be easily implemented recursively even though input is long time. We present the method to overcome the poor approximation which is produced by recursive lifting step. And we have applied the proposed integer DCT to lossless audio coding.

  • PDF

Evaluation of Spatial Audio Coding Tools for Multichannel Audio (Spatial Audio Coding 기술의 멀티채널 부호화 성능 비교)

  • Jang Inseon;Seo Jeongil;Mun Hangil;Kang Kyeongok
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.153-156
    • /
    • 2004
  • Spatial Audio Coding (SAC)은 낮은 대역폭에서 다채널/다객체 오디오 신호를 전송하기 위해 제안된 기술이다. 본 논문에서는 MPEG 에서 SAC 기술의 평가 방법으로 채택된 Multi-Stimulus test with Hidden Reference and Anchor (MUSHRA) 실험 절차에 대해서 설명한다. 또한 제 69 차 MPEG 회의에서 제안된 4 개 기관의 SAC 기술에 대한 청취실험을 수행하고 그 결과를 분석한다.

  • PDF

A Development on the Optimization Algorithm for MDCT/IMDCT of MPEG-2 AAC (MPEG-2 AAC의 MDCT/IMDCT를 위한 최적 알고리즘 개발)

  • 김병규;이강현
    • Proceedings of the IEEK Conference
    • /
    • 1999.06a
    • /
    • pp.538-541
    • /
    • 1999
  • MPEG-2 AAC(Advanced Audio Coding) is the most advanced coding scheme available for high quality audio coding. This MPEG-2 AAC audio Standard allows for ITU-R ‘indistinguishable’ quality according to at data rates of 320 kb/s for five full-bandwidth channel audio signals. The compression ratio is around a factor of 1.4 better compared to MPEG Layer 3, you get the same quality at 70% of the bitrate. This paper suggest optimization method for MDCT/IMDCT (Modified Discrete Cosine Transform/Inverse Modified Discrete Cosine Transform) in Encoder and Decoder for AAC.

  • PDF

Angle-Based Virtual Source Location Representation for Spatial Audio Coding

  • Beack, Seung-Kwon;Seo, Jeong-Il;Moon, Han-Gil;Kang, Kyeong-Ok;Hahn, Min-Soo
    • ETRI Journal
    • /
    • v.28 no.2
    • /
    • pp.219-222
    • /
    • 2006
  • Virtual source location information (VSLI) has been newly utilized as a spatial cue for compact representation of multichannel audio. This information is represented as the azimuth of the virtual source vector. The superiority of VSLI is confirmed by comparison of the spectral distances, average bit rates, and subjective assessment with a conventional cue.

  • PDF

The Development of audio codec using binaural cue coding technologies (Binaural Cue Coding 기술을 이용한 오디오 코덱 구현)

  • Seo Jeongil;Kang Kyeongok;Lee Byonghwa;Hahn Minsoo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.137-140
    • /
    • 2004
  • 낮은 대역폭에서 다채널 다객체 오디오 신호를 전송하기위해 새롭게 제안된 Spatial Audio Coding 기술은 멀티채널 오디오 신호를 다운믹싱하고 나머지 채널은 음향공간상의 위치정보를 나타내는 파라미터들로 압축하여 표현하는 파라메트릭 압축 방식이다. 본 논문에서는 Spatial Audio Coding 기술중의 하나인 BCC 기술을 이용하여 스테레오 오디오 코덱을 구현하고, 주관듣기평가 실험을 통하여 AAC와 비슷한 성능을 나타내면서도 높은 압축율을 얻을 수 있음을 확인하였다.

  • PDF

Multi-channel Audio Service in a Terrestrial-DMB System Using VSLI-Based Spatial Audio Coding

  • Seo, Jeong-Il;Moon, Han-Gil;Beack, Seung-Kwon;Kang, Kyeong-Ok;Hong, Jae-Keun
    • ETRI Journal
    • /
    • v.27 no.5
    • /
    • pp.635-638
    • /
    • 2005
  • Spatial audio coding (SAC) is an extremely high compact representation of encoded multi-channel audio material. This paper suggests a multi-channel audio service in the terrestrial digital multimedia broadcasting (T-DMB) system using a novel SAC tool, which is called a virtual source location information (VSLI)-based SAC tool. Intensive experiments are presented to evaluate the validity of the proposed VSLI-based SAC tool, and prototypical systems are also presented to demonstrate the reliability of the proposed multi-channel T-DMB system in real applications.

  • PDF

Design and Development of T-DMB Multichannel Audio Service System Based on Spatial Audio Coding

  • Lee, Yong-Ju;Seo, Jeong-Il;Beack, Seung-Kwon;Jang, Dae-Young;Kang, Kyeong-Ok;Kim, Jin-Woong;Hong, Jin-Woo
    • ETRI Journal
    • /
    • v.31 no.4
    • /
    • pp.365-375
    • /
    • 2009
  • In this paper, a terrestrial digital multimedia broadcasting (T-DMB) multichannel audio broadcasting system based on spatial audio coding is presented. The proposed system provides realistic multichannel audio service via T-DMB with a small increase of data rate as well as backward compatibility with the conventional stereo-based T-DMB player. To reduce the data rate for additional multichannel audio signals, we compress the multichannel audio signals using the sound source location cue coding algorithm, which is an efficient parametric multichannel audio compression technique. For compatibility, we use the dependent property of an elementary stream descriptor, and this property should be ignored in a conventional T-DMB player. To verify the feasibility of the proposed system, we implement the T-DMB multichannel audio encoder and a prototype player. We perform a compatibility test using the T-DMB multichannel audio encoder and conventional T-DMB players. The test demonstrates that the proposed system is compatible with a conventional T-DMB player and that it can provide a promisingly rich audio service.

MPEG-4 오디오 기술 동향

  • 한민수;강경옥;변경진
    • Broadcasting and Media Magazine
    • /
    • v.4 no.1
    • /
    • pp.62-79
    • /
    • 1999
  • In this survey paper the emerging MPEG-4 audio technology is discribed In the previous MPEG-1 and the MPEG-4 audio words, only the natural audio and the speech coding techniques were the standadization objects But in the MPEG-4 audio standadization, not only the natural audio and the speech coding, but also the structured audio and the synthetic speech techniques are inclued, The purpose of this expansion can be summarized as the preparation for the versatile high-quality multimedia services supposed emerge in the 21st century.

  • PDF

MPEG Surround for Multi-Channel Audio Coding-Part 1: Basic Structure (다채널 오디오 코딩을 위한 MPEG Surround-1부: 기본 구조)

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.599-609
    • /
    • 2009
  • An overview of the recently finalized multi-channel audio coding standard MPEG Surround is provided. This audio coding standard downmixes multi-channel signals to mono or stereo signals and, simultaneously, extracts spatial parameters for its encoding process. In its decoding process, it reconstructs multi-channel signals based on the downmix signals and spatial parameters. Since the downmix signals are coded in conventional audio coding format such as AAC and MP3 and the spatial parameters require a small amount of information MPEG Surround guarantees high sound quality multi-channel audio at low bit rates. Besides, it is backward-compatible to conventional audio coding techniques because the downmix signals can be played on portable audio devices ignoring the spatial parameter information. In this paper, Part 1 presents an overview of the basic structure of MPEG Surround and Part 2 describes various modes and tools including the binaural mode which supports the virtual 5.1-channel playback via headphones or earphones. The listening test results by various companies and organizations are also presented.