DOI QR코드

DOI QR Code

Low Bit Rate을 고려한 8kbps FBD-MPC 방식에 관한 연구

A Study on 8kbps FBD-MPC Method Considering Low Bit Rate

  • 이시우 (상명대학교 정보통신공학과)
  • Lee, See-Woo (Dept. of Information and Telecommunication Engineering, Sangmyung University)
  • 투고 : 2014.04.02
  • 심사 : 2014.06.20
  • 발행 : 2014.06.28

초록

유성음원과 무성음원을 사용하는 음성부호화 방식에 있어서, 같은 프레임 안에 모음과 무성자음이 있는 경우에 음질저하현상이 나타난다. 본 연구에서는 연속음성에서 무성자음을 포함한 천이구간을 탐색, 추출하고 주파수대역에서 근사합성하는 8kbps의 멀티펄스 음성부호화 방식(FBD-MPC)를 제안하였다. 기존의 8kbps MPC와 FBD-MPC의 SNRseg를 평가한 결과, FBD-MPC의 남자음성에서 0.5dB, 여자음성에서 0.2dB 개선된 것을 확인할 수 있었다. 결국, MPC에 비해 FBD-MPC의 SNRseg가 개선되어 음성파형의 일그러짐을 제어할 수 있었으며, 본 방법은 셀룰러폰이나 스마트폰과 같이 Low Bit Rate의 음원을 사용하여 음성신호를 부호화하는 방식에 활용할 수 있을 것으로 기대된다.

In a speech coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech quality in case coexist with a voiced and unvoiced consonants in a frame. In this paper, I propose a method of 8kbps Multi-Pulse Speech Coding(FBD-MPC: Frequency Band Division MPC) by using TSIUVC(Transition Segment Including Unvoiced Consonant) searching, extraction and approximation-synthesis method in a frequency domain. I evaluate the 8kbps MPC and FBD-MPC. As a result, SNRseg of FBD-MPC was improved 0.5dB for female voice and 0.2dB for male voice respectively. Compared to the MPC, SNRseg of FBD-MPC has been improved that I was able to control the distortion of the speech waveform finally. And so, I expect to be able to this method for cellular phone and smart phone using excitation source of low bit rate.

키워드

참고문헌

  1. B.S.Atal and J.R.Remdo: "A New Medel of LPC Excitation for Producing Natural Sounding Speech at Low Bit Rates", IEEE,ICASSP, p614-617, 1982
  2. Z.A.Putnins, G.A.Wilson, J.Kumarand R.D.Trupp: "A Multi-Pulse LPC Synthesizer for Telecommunications use",IEEE,ICASSP,Mar,1985
  3. Kazunori OZAWA, Takashi ARASEKI: "Multi-Pulse Excited Speech Coding Utilizing Pitch Information at Rates Between 9.6 and 4.8 kbit/s", IEICE, Vol.J72-D-2, No.8, 1989
  4. Campbell,J.P.,Tremain,T.E.:"Voiced/unvoiced classification of speech with applications to the U.S.Government LPC-10e algorithm",Proc.IEEE Int.Conf. on Acoustics, Speech, Sinal Processing, p473-476. 1986
  5. Nobuhiko KITAWAKI, FumitadaI TAKURA and Shuzo SAITO: "Optimum Coding of Transmission Parameters in PARCOR Speech Analysis Synthesis System", IEICE, Vol. J61-A No.2, 1978
  6. K.Krishna, V.L.N.Murty, K.R.Ramakrishnan:"Vector quantization of excitation gains in speech coding", Signal Processing 81,p203-209, 2001 https://doi.org/10.1016/S0165-1684(00)00200-0
  7. Selma Ozaydm, Buyurman Baykal:"Matrix quantization and mixed excitation based linear predictives peech coding at very low bit rates", Speech Communication 41, p381-392, 2003 https://doi.org/10.1016/S0167-6393(03)00009-8