Browse > Article

Quality Improvement of Low-Bitrate HE-AAC Encoder  

Kim, Jeong-Geun (연세대학교 전기전자공학부 디지털신호처리연구실)
Lee, Jae-Seong (연세대학교 전기전자공학부 디지털신호처리연구실)
Lee, Tae-Jin (전자통신연구원 방송미디어연구그룹)
Kang, Kyeong-Ok (전자통신연구원 방송미디어연구그룹)
Park, Young-Cheol (연세대학교 컴퓨터정보통신공학부)
Abstract
In this paper, we propose new techniques that can improve the quality of AAC and SBR encoders comprised in low bitrate HE-AAC. To reduce the pre-echo artifacts often occurring for transient blocks in AAC, we propose an extended Temporal Noise Shaping (sTNS) in which the frequency range is selectively extended down to the low-frequency region. Also, for he high-frequency region being coded by SBR encoder, tones are identified through a sinusoidal modeling and their frequencies are adjusted within the QMF band in order to reduce the noise floor due to aliasing. Spectrograms of the decoded signals were compared and listening tests were conducted to evaluate the proposed algorithm. Results confirmed the effectiveness of the proposed algorithm.
Keywords
HB-AAC; TNS; Sinusoidal model; AAC; SBR; Pre-echo; Musical noise;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. Wolters, K. Kj rling, D. Homm and H. Purnhagen, "A Closer Look into MPEG-4 High Efficiency AAC," AES 115th Convention, New York, October 2003
2 박호종, 박영철, 홍진우, "음성 및 오디오 통합 부호화 기술", Telecommunications Review, 17(5), 841-854, 2007. 10
3 3rd Generation Partnership Project, "Enhanced aacPlus encoder SBR part 3GPP TS 26.404," 3rd Generation Partnership Project Technical Specification Group Services and System Aspects, September 2004
4 ITU-R, "Method for the subjective assessment of intermediate quality level of coding systems (MUSHRA)", ITU-R Recommend, BS. 1534, 2001
5 M. Dietz, L. Liljeryd, K. Kj rling and O. Kunz, "Spectral Band Replication, A Novel Approach in Audio Coding," AES 112nd Convention, Munich, 2002 May 10-13
6 http://www.3gpp.org/ftp/Specs/html-info/26410.htm
7 E. Zwicker and H. Fastl, Springer-Verlag, Berlin Heidelberg 1990
8 Chang, Chia-Ming,Hsu, Han-Wen , "Compression Artifacts in Perceptual Audio Coding", AES 121th Convention, San Francisco, October 2006
9 Makinen, J. and Bessette, B. and Bruhn, S. and Ojala, P. and Salami, R. and Taleb, A., "AMR-WB+ : a new audio coding standard for 3rd generation mobile audio services", Proc IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '05)
10 ISO/IEC, "International Standard ISO/IEC IS 13818-7, Information technology GenericCoding of Moving Pictures and Associated Audio:, Part 7: Advanced Audio Coding (AAC)", ISO/IEC JTC1/SC29/WG11, 1997
11 ISO/IEC, "Text of ISO/IEC 14496-3:2001 / FPDAM 1, Bandwidth extensions," ISO/IEC JTC1/SC29/WG11/N5203, October 2002
12 3rd Generation Partnership Project, "Advanced Audio Coding (AAC) part 3GPP TS 26.403," 3rd Generation Partnership Project Technical Specification Group Services and System Aspects, June 2006
13 Robert J. McAulay and Thomas F. Quatieri, "Speech Analysis /Synthesis Based on a Sinusoidal Representation", IEEE transactions on acoustics, speech and signal processing, 34(4), 744-754, august 1986   DOI
14 N. Jayant, J. Johnston and R. Safranek, "Signal Compression Based on Method of Human Perception", Proc. Of IEEE, 81(10), 1385-1422, October 1993