An efficient transcoding algorithm for AMR and G.723.1 speech coders and performance evaluation

AMR과 G.723.1 음성부호화기를 위한 효율적인 상호부호화 알고리듬 및 성능평가

  • Published : 2004.07.01

Abstract

In the application requiring the interoperability of different networks such as VoIP and wireless communication system, two speech codecs must work together with the structure of cascaded connection, tandem. Tandem has several problems such as long delay, high complexity and quality degradation due to twice complete encoding/decoding process. Transcoding is one of the best solutions to solve these problems. Transcoding algorithm is varied with the structure of source and target coder. In this paper, transcoding algorithm including the LSP conversion, the pitch estimation and new perceptual weighting filter for reducing complexity and improving qualify is proposed. These algorithms are applied to the pair of AMR md G.723.1. By employing the proposed algorithms in the transcoder, the complexity is reduced by about 20%-58% and quality is improved compared to tandem.

무선망과 VoIP 같은 서로 다른 음성 통신 네트워크간의 통신을 할 경우, 서로 다른 구조를 갖는 두 음성부호화기간의 효율적인 연동이 필요하다. 이런 경우, 가장 간단한 방법으로 두 음성부호화기의 복호화기와 부호화기를 직렬로 연결시키는 tandem방식을 사용할 수 있다. 하지만, tandem방식은 긴 지연시간과 많은 연산량, 그리고 음질저하의 문제점들을 갖는데, 이는 상호부호화 방법을 통해서 해결할 수 있다. 상호부호화 알고리듬은 송신단과 수신단의 음성 부호화기의 구조에 의해 결정되고, 본 논문에서는 연산량은 감소시키고, 음질은 향상시킬 수 있는 LSP 변환, 개선된 고속 피치 검색, 상호부호화기를 위한 새로운 지각가중 필터 알고리듬을 제안한다. 제안된 알고리듬은 AMR과 G.723.1간의 상호부호화기에 적용하였다. 제안된 상호부호화 알고리듬을 사용함으로써 tandem 방식에 비하여 연산량은 약 20%-58% 감소되는 반면, 음질은 향상된다.

Keywords

References

  1. ITU-T Rec. G.723.1 'Dual-rate speech coder for multi-media communications transmitting at 5.3 and 6.3 kbit/s,' 1996
  2. 3GPP TS 26.090 V5.0.0, AMR speech codec; Transcoding functions, Jun 2002
  3. S.W.Yoon, S.K. Jung, Y.C. Park, and D.H Youn, 'An efficient transcoding algorithm for G.723.1 and G.729A speech coders,' in Proc. Eurospeech 2001, pp. 2499-2502, Sep 2001
  4. K. T. Kim, S.K. Jung, yc. Park, YS. Choi, D.H Youn 'An efficient transcoding algorithm for G.723.1 and EVRC speech coders,' in Proc. IEEE VTS 54th Vehicular Technology Conference (VTC 2001), vol.3, pp.1561-1564, Oct.7-1O, 2001 https://doi.org/10.1109/VTC.2001.956460
  5. J.K. Choi, C.H. Lee, H.G. kang, Y.C. Park and D.H. Youn, 'Improvement issues on transcoding algorithm:for the flexible usage to the various pairs of speech codec,' in ICASSP 2004, to be published, 2004 https://doi.org/10.1109/ICASSP.2004.1325974
  6. H.G. Kang, H.K. Kim and Richard V. Cox, 'Improving the Transcoding Capability of Speech codecs,' in IEEE Transaction on Multimedia, VOL. 5, NO.1, Mar., 2003 https://doi.org/10.1109/TMM.2003.808823
  7. A.M.Kondoz, Digital speech coding for low rate communication system, John Wiley & Sons, 1994
  8. W.B. Kelijn, Speech coding and synthesis, Elsevier Science B.V., 1995
  9. Simon Haykin, ADAPTIVE FILTER THEORY 4th Edition, Pentice-Hall, Inc., 2002
  10. ITU-T Draft Rec P.191 'Software tools for speech and audio coding standardization,' Nov 2002
  11. ITU-T Draft Rec P.862 'Perceptual evaluation of speech quality (PESQ), an objective method of end-to-end speech quality assessment of narrowband telephone networks and speech codecs,' May 2000
  12. http://www.ntt-at.com