DOI QR코드

DOI QR Code

A Study on APC-MPC in 8kbps of Convergence System

융복합 시스템의 8kbps에 있어서 APC-MPC에 관한 연구

  • Lee, See-Woo (Dept. of Information and Telecommunication Engineering)
  • 이시우 (상명대학교 정보통신공학과)
  • Received : 2015.04.21
  • Accepted : 2015.07.20
  • Published : 2015.07.28

Abstract

In a MPC(Multi-Pulse Coding) using excitation source of voiced and unvoiced, it would be a distortion of voice waveform. This is caused by normalization of synthesis speech waveform of voiced in the process of restoration. To solve this problem, this paper present APC-MPC of amplitude-position compensation in a multi-pulses each pitch interval in order to reduce distortion of synthesis waveform. Also, I was implemented that the APC-MPC in coding system. And I evaluate the SNRseg of APC-MPC in 8kbps coding condition of convergence system. As a result, SNRseg of APC-MPC was 13.9dB for female voice and 14.3dB for male voice respectively. And so, I expect to be able to this method for cellular phone and smart phone using excitation source of low bit rate.

유성음원과 무성음원을 사용하는 멀티펄스 음성부호화 방식(MPC)에 있어서, 유성음의 파형에서 일그러짐이 발생한다. 이러한 문제를 해결하기 위해, 재생파형의 일그러짐이 감소하도록 피치구간 마다 멀티펄스의 진폭과 위치를 보정하는 APC-MPC를 제안하였다. 또한 융복합 시스템의 8kbps 부호화 조건에서 APC-MPC의 SNRseg를 검토하고 부호화 시스템으로 구현하였다. APC-MPC의 SNRseg를 평가한 결과, APC-MPC의 남자음성에서 14.3dB, 여자음성에서 13.9dB 임을 확인할 수 있었다. 본 방법은 셀룰러폰이나 스마트폰과 같이 Low Bit Rate의 음원을 사용하여 음성신호를 부호화하는 방식에 활용할 수 있을 것으로 기대된다.

Keywords

References

  1. Selma Ozaydm, Buyurman Baykal:"Matrix quantization and mixed excitation based linear predictive speech coding at very low bit rates",Speech Communication 41,p381-392, 2003 https://doi.org/10.1016/S0167-6393(03)00009-8
  2. Ghaemmaghami, S., Sridharan, S.:"Very low rate speech coding using temporal decomposition".IEE Electron. Lett.35(6), p456-457.1999 https://doi.org/10.1049/el:19990316
  3. McCree, A.V, Barnwell, T.P.,:"A mixed excitation LPC vocoder model for low bit rate speech coding", IEEE Trans. Speech Audio Process, p242-250,1995
  4. Phu Chien Nguyen, Masato Akagi, Binh Phu Nguyen: "Limited error based event localizing temporal decomposition and its application to variable-rate seech coding", Speech Communication 49, p292-304, 2007 https://doi.org/10.1016/j.specom.2007.02.007
  5. LeBlanc, W.P., Bhattacharya,B.,Mahmoud, S.A.: "Efficient search and design procedures for robust multi stage vector quantization of LPC parameters for 4kbps speech coding".IEEE Trans. Speech Audio Process.p373-385.1993
  6. David A. Krubsack and Russell J. Niederjohn:"An Autocorrelation Pitch Detector and Voicing Decision with Confidence Measures Developed for Noise-Corrupted Speech", IEEE, Transactions of Signal Processing, Vol.39, No.2, 1991
  7. Kazunori Ozawa, Shigeru Ono and Takashi Araseki:"A study on pulse search algorithm for multipulse excited speech coder realization", IEEE, Jounal on Selected areas in Communications, Vol. SAC-4, No.1, 1986
  8. B.S.Atal and J.R.Remdo:"A New Medel of LPC Excitation for Producing Natural Sounding Speech at Low Bit Rates", IEEE,ICASSP, p614-617, 1982
  9. Z.A.Putnins, G.A.Wilson, J.Kumar and R.D. Trupp: "A Multi-Pulse LPC Synthesizer for Telecommunications use",IEEE,ICASSP,Mar,1985
  10. Kazunori OZAWA, Takashi ARASEKI: "Multi-Pulse Excited Speech Coding Utilizing Pitch Information at Rates Between 9.6 and 4.8 kbit/s", IEICE, Vol.J72-D-2, No.8, 1989
  11. K.Krishna, V.L.N.Murty,.R.Ramakrishnan:"Vector quantization of excitation gains in speech coding", Signal Processing 81,p203-209, 2001 https://doi.org/10.1016/S0165-1684(00)00200-0
  12. Widrow B. and Hoff M. E.:"Adaptive Switching Circuit", IRE WESCON Conv. Rec, June 2000
  13. Campbell,J.P.,Tremain,T.E.:"Voiced/unvoiced classification of speech with applications to the U.S. Government LPC-10e algorithm", Proc.IEEE Int.Conf. on Acoustics, Speech, Sinal Processing, p473-476.1986
  14. LEAH.J.SIEGE and ALANC. BESSEY: "Voiced/Unvoiced/Mixed Excitation Classification of Speech", IEEE, Vol. ASSP-30, No.3, 1982
  15. HIDEFUMI KOBATAKE:"Optimization of Voiced/Unvoiced Decisions in Nonstationary Noise Environments", IEEE, Vol. ASSP-35, No.1, 1987
  16. Nobuhiko KITAWAKI, Fumitada ITAKURA and Shuzo SAITO: "Optimum Coding of Transmission Parameters in PARCOR Speech Analysis Synthesis System", IEICE, Vol. J61-A No.2, 1978