DOI QR코드

DOI QR Code

A Study on 8kbps FBD-MPC Method Considering Low Bit Rate

Low Bit Rate을 고려한 8kbps FBD-MPC 방식에 관한 연구

  • Lee, See-Woo (Dept. of Information and Telecommunication Engineering, Sangmyung University)
  • 이시우 (상명대학교 정보통신공학과)
  • Received : 2014.04.02
  • Accepted : 2014.06.20
  • Published : 2014.06.28

Abstract

In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech quality in case coexist with a voiced and unvoiced consonants in a frame. In this paper, I propose a method of 8kbps Multi-Pulse Speech Coding(FBD-MPC: Frequency Band Division MPC) by using TSIUVC(Transition Segment Including Unvoiced Consonant) searching, extraction and approximation-synthesis method in a frequency domain. I evaluate the 8kbps MPC and FBD-MPC. As a result, SNRseg of FBD-MPC was improved 0.5dB for female voice and 0.2dB for male voice respectively. Compared to the MPC, SNRseg of FBD-MPC has been improved that I was able to control the distortion of the speech waveform finally. And so, I expect to be able to this method for cellular phone and smart phone using excitation source of low bit rate.

유성음원과 무성음원을 사용하는 음성부호화 방식에 있어서, 같은 프레임 안에 모음과 무성자음이 있는 경우에 음질저하현상이 나타난다. 본 연구에서는 연속음성에서 무성자음을 포함한 천이구간을 탐색, 추출하고 주파수대역에서 근사합성하는 8kbps의 멀티펄스 음성부호화 방식(FBD-MPC)를 제안하였다. 기존의 8kbps MPC와 FBD-MPC의 SNRseg를 평가한 결과, FBD-MPC의 남자음성에서 0.5dB, 여자음성에서 0.2dB 개선된 것을 확인할 수 있었다. 결국, MPC에 비해 FBD-MPC의 SNRseg가 개선되어 음성파형의 일그러짐을 제어할 수 있었으며, 본 방법은 셀룰러폰이나 스마트폰과 같이 Low Bit Rate의 음원을 사용하여 음성신호를 부호화하는 방식에 활용할 수 있을 것으로 기대된다.

Keywords

References

  1. B.S.Atal and J.R.Remdo: "A New Medel of LPC Excitation for Producing Natural Sounding Speech at Low Bit Rates", IEEE,ICASSP, p614-617, 1982
  2. Z.A.Putnins, G.A.Wilson, J.Kumarand R.D.Trupp: "A Multi-Pulse LPC Synthesizer for Telecommunications use",IEEE,ICASSP,Mar,1985
  3. Kazunori OZAWA, Takashi ARASEKI: "Multi-Pulse Excited Speech Coding Utilizing Pitch Information at Rates Between 9.6 and 4.8 kbit/s", IEICE, Vol.J72-D-2, No.8, 1989
  4. Campbell,J.P.,Tremain,T.E.:"Voiced/unvoiced classification of speech with applications to the U.S.Government LPC-10e algorithm",Proc.IEEE Int.Conf. on Acoustics, Speech, Sinal Processing, p473-476. 1986
  5. Nobuhiko KITAWAKI, FumitadaI TAKURA and Shuzo SAITO: "Optimum Coding of Transmission Parameters in PARCOR Speech Analysis Synthesis System", IEICE, Vol. J61-A No.2, 1978
  6. K.Krishna, V.L.N.Murty, K.R.Ramakrishnan:"Vector quantization of excitation gains in speech coding", Signal Processing 81,p203-209, 2001 https://doi.org/10.1016/S0165-1684(00)00200-0
  7. Selma Ozaydm, Buyurman Baykal:"Matrix quantization and mixed excitation based linear predictives peech coding at very low bit rates", Speech Communication 41, p381-392, 2003 https://doi.org/10.1016/S0167-6393(03)00009-8