Browse > Article
http://dx.doi.org/10.7776/ASK.2010.29.1.062

Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding  

Lee, Chang-Heon (연세대학교 전기전자공학과)
Kang, Hong-Goo (연세대학교 전기전자공학과)
Abstract
This paper proposes a new masking threshold adjustment strategy to improve the performance for speech signals in low bit-rate audio coding. After determining formant regions, the masking threshold is adjusted by using the energy ratio of each sub-band to the average energy of each formant. More quantization noises are added to the bands that have relatively large energy, but less distortion is allowed in spectral valley regions by allocating more bits, which reflects the concept of perceptual weighting widely used in speech coding. From the results of objective speech quality measure, we verified that the proposed method improves quality for the speech input signals compared to the conventional one.
Keywords
Enhanced AacPlus; HE-AAC; Psychoacoustic Model; Masking Threshold; Perceptual Weighting; Audio Coding; Speech Coding;
Citations & Related Records
연도 인용수 순위
  • Reference
1 E. Zwicker and H. FastI, Psychoacoustics, Facts and Models, 2nd Updated ed. New York: Springer, 1999.
2 J. D. Johnston, "Transform coding of audio signals using perceptual noise criteria," IEEE J. Select. Areas Commun., vol. 6, pp, 314-323, 1988.   DOI   ScienceOn
3 3GPP TS 26,401 v6.2.0, Enhanced aacPlus general audio codec; General description, Mar., 2005.
4 3GPP TS 26,403 v7.0.0, Enhanced aacPlus general audio codec; Encoder specification; Advanced audio coding (AAC) part, June, 2006.
5 M,Wolters, K. Kjorling, D. Homm, and H. Purnhagen, "A closer look into MPEG-4 High Efficiency MC," 115th AES Convention, New York, USA, Oct. 2003, preprint 5871.
6 M. R. Schroeder, B. S. Atal, and J. L. Hall, "Optimizing digital speech coders by exploiting masking properties of the human ear," J. Acoust. Soc. Amer., vol. 66, pp. 1647 - 1652, 1979.   DOI   ScienceOn
7 E. K. P. Chong and S. H. Zak, An Introduction to Optimization, Second ed. New York: Wiley, 2001.
8 C. H. Lee, H. O. Oh and H. G. Kang, "On the study of noise allocation for speech signal in low bit-rate audio coding," IEEE Signal Processing Letters, vol. 16, no. 10, pp. 849-852, 2009.   DOI