DOI QR코드

DOI QR Code

Packet Loss Concealment Algorithm Based on Robust Voice Classification in Noise Environment

잡음환경에 강인한 음성분류기반의 패킷손실 은닉 알고리즘

  • 김형국 (광운대학교 전파공학과) ;
  • 류상현 (광운대학교 전파공학과)
  • Received : 2013.07.25
  • Accepted : 2013.09.13
  • Published : 2014.01.31

Abstract

The quality of real-time Voice over Internet Protocol (VoIP) network is affected by network impariments such as delays, jitters, and packet loss. This paper proposes a packet loss concealment algorithm based on voice classification for enhancing VoIP speech quality. In the proposed method, arriving packets are classified by an adaptive thresholding approach based on the analysis of multiple features of short signal segments. The excellent classification results are used in the packet loss concealment. Additionally, linear prediction-based packet loss concealment delivers high voice quality by alleviating the metallic artifacts due to concealing consecutive packet loss or recovering lost packet.

실시간 VoIP 네트워크는 지연, 지터 그리고 패킷손실과 같은 네트워크 장애요소로 인해 품질저하가 발생한다. 본 논문은 VoIP 음질 향상을 위해 잡음환경에 강인한 음성분류기반의 패킷손실 은닉 알고리즘을 제안한다. 제안된 방식에서는 음성신호로부터 추출된 다양한 특징들을 분석하고 이를 기반으로 획득된 적응적인 문턱값을 사용하여 수신단에 도착한 패킷을 분류한다. 정확한 신호분류 결과는 패킷손실 은닉에 사용된다. 그리고 선형 예측 기반의 손실패킷 은닉은 연속적으로 패킷을 은닉하거나 손실된 패킷복원 시 발생하는 메탈릭 아티펙트를 제거함으로써 고품질의 음성을 제공한다.

Keywords

References

  1. A. Shallwani and P. Kabal, "An adaptive playout algorithm with delay spike detection for real-time VoIP," in Proc. of IEEE CCECE 2003, 2, 997-1000 (2003).
  2. M. J. Kim, and C. H. Kwon, "Dynamic Redundant Audio transmission for Packet Loss Recorvery in VoIP Systems," (in korean) J. Acoust. Soc. Kr. 21, 349-360 (2002).
  3. H. Sanneck, A. Stenger, K. B. Younes, and B. Girod, "A new technique for audio packet loss concealment," in Proc. of GLOBECOM'96, 48-52, (1996).
  4. W. Chu and A. Alwan, "Reducing f0 frame error of f0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend," in Proc. of IEEE ICASSP, 4769-I4772, (2009).
  5. C. Shahnaz, W.-P. Zhu, and M. O. Ahmad, "Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme," IEEE Trans. on ASLP. 20, 1, 322-335 (2012).
  6. M. G. Christensen, "A method for low-delay pitch tracking and smoothing," in Proc. of ICASSP, 345-348 (2012).
  7. S. L. Ng, S. Hoh, and D. Singh, "Effectiveness of adaptive codec switching VoIP application over heterogeneous networks," 2nd Int'l Conf. on Mobile Technology, Applications and Systems, 7-13 (2005).
  8. ITU-T Rec. P.862, "Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs," ITU (2001).
  9. H. Li, G. Zhang, and W. Kleijn, "Adaptive playout scheduling for VoIP using the K-Erlang distribution," in Proc. of EUSIPCO, 1494-1498 (2010).