Browse > Article
http://dx.doi.org/10.7776/ASK.2013.32.2.157

Non-Intrusive Speech Quality Estimation of G.729 Codec using a Packet Loss Effect Model  

Lee, Min-Ki (연세대학교 전기전자공학과 디지털 신호처리 연구실)
Kang, Hong-Goo (연세대학교 전기전자공학과 디지털 신호처리 연구실)
Abstract
This paper proposes a non-intrusive speech quality estimation method considering the effects of packet loss to perceptual quality. Packet loss is a major reason of quality degradation in a packet based speech communications network, whose effects are different according to the input speech characteristics or the performance of the embedded packet loss concealment (PLC) algorithm. For the quality estimation system that involves packet loss effects, we first observe the packet loss of G.729 codec which is one of narrowband codec in VoIP system. In order to quantify the lost packet affects, we design a classification algorithm only using speech parameters of G.729 decoder. Then, the degradation values of each class are iteratively selected that maximizes the correlation with the degradation PESQ-LQ scores, and total quality degradation is modeled by the weighted sum. From analyzing the correlation measures, we obtained correlation values of 0.8950 for the intrusive model and 0.8911 for the non-intrusive method.
Keywords
Speech quality estimation; Packet loss concealment; G.729; PESQ; VoIP;
Citations & Related Records
연도 인용수 순위
  • Reference
1 A.W. Rix, J. G. Beerends, D. S. Kim, P. Kroon, and O. Ghitza, "Objective Assessment of Speech and Audio Quality Technology and Applications," IEEE Trans. Audio Speech Lang. Process. 14, 1890-1901 (2006).   DOI   ScienceOn
2 ITU-T Recommendation P.862, Perceptual Evaluation of Speech Quality (PESQ) : An Objective Method for End-To- End Speech Quality Assessment of Narrow-Band Telephone Networks and Speech Codecs, 2001.
3 ITU-T Recommendation P.863, Perceptual Objective Listening Quality Assessment, 2011.
4 ITU-T Recommendation P. 563, Single-Ended Method for Objective Speech Quality in Narrowband Telephony Applications, 2004.
5 ITU-T Recommendation G.107, The E-model : A Computational Model for Use in Transmission Planning, 2009.
6 S. R. Broom, "VoIP quality assessment : taking account of the edge-device," IEEE Trans. Audio Speech Lang. Process. 14, 1977-1983 (2006).   DOI   ScienceOn
7 L. Sun, and E. C. Ifeachor, "Perceived speech quality prediction for voice over IP-based networks," IEEE ICC. 2573-2577 (2002).
8 L. Ding, Z. Lin, A. Radwan, M. S. El-hennawey, and R. A. Goubran, "Non-intrusive single-ended speech quality assessment in VoIP," Speech Commun., 49, 477-489 (2007).   DOI   ScienceOn
9 M. K. Lee, K. T. Kim, H. G. Kang, and D. H. Youn, "Speech quality estimation using packet loss effects in CELP-type speech coders," Proceedings Interspeech 2007, 1697-1700 (2007).
10 3GPP2 C.S0030-0 Ver 3.0, Selectable Mode Vocoder (SMV) Service Option for Wideband Spread Spectrum Communication Systems, 2004.
11 ITU-T Recommendation G.729, Coding of Speech at 8kbit/s using Conjugate-Structure Algebraic Code-Excited Linear Prediction (CS-ACELP), 2007.
12 ITU-T Recommendation P. 862.1, Mapping Function for Transforming P. 862 Raw Result Score to MOS-LQO, 2003.
13 A. M. Kondoz, Digital Speech; Coding for Low Bit Rate Communication Systems (John Wiley & Sons, 1994).
14 ITU-T Recommendation P.Sup23, ITU-T Coded Speech Database, 1998.
15 "Multi-Lingual Speech Database for Telephonometry," NTT Advanced Technology Corporation (NTT-AT), 1994.