Browse > Article
http://dx.doi.org/10.4218/etrij.11.0211.0007

An Efficient Time-Frequency Representation for Parametric-Based Audio Object Coding  

Beack, Seung-Kwon (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI)
Lee, Tae-Jin (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI)
Kim, Min-Je (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI)
Kang, Kyeong-Ok (Broadcasting & Telecommunications Convergence Research Laboratory, ETRI)
Publication Information
ETRI Journal / v.33, no.6, 2011 , pp. 945-948 More about this Journal
Abstract
Object-based audio coding can provide new music applications with interactivity. To efficiently compress a lot of target audio objects, a subband-based parametric coding scheme has been adopted for MPEG spatial audio object coding. In this letter, the time-frequency (T/F) subband analysis structure is investigated. A reconfigured T/F structure is also proposed to enhance the generating performance of sound scenes such as 'karaoke' and 'solo' play in interactive music scenarios. From the experimental results, it was confirmed that the proposed scheme remarkably improves the SNR and sound quality.
Keywords
SAOC; parametric audio coding; spatial cue;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
Times Cited By Web Of Science : 0  (Related Records In Web of Science)
Times Cited By SCOPUS : 0
연도 인용수 순위
1 ISO/IEC 23003-2:2010, "Part 2: Spatial Audio Object Coding," International Standard, Oct. 2010.
2 T. Lee et al., "A Personalized Preset-based Audio System for Interactive Service," AES Convention, Oct. 2006.
3 ISO/IEC 14496-3:2001, "Parametric Coding for High Quality Audio," Dec. 2003.
4 ISO/IEC 23003-1:2007, "Part 1: MPEG Surround," International Standard, Jan. 2007.
5 C. Faller and R. Baumgarte, "Binaural Cue Coding-Part II: Schemes and Application," IEEE Trans. Speech Audio Proc., vol. 11, no. 6, Nov. 2003.
6 S. Beack et al., "Angle-Based Virtual Source Location Representation for Spatial Audio Coding," ETRI J., vol. 28, no. 2, Apr. 2006, pp. 219-222.   DOI
7 3GPP TS 26.290, Extended Adaptive Multi-Rate-Wideband Codec (AMR-WB+): Transcoding Functions.
8 ITU-R Recommendation, Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA), ITU, BS. 1543-1, Geneva, 2001.