Overlap and Add Sinusoidal Synthesis Method of Speech Signal Lising the Damping Harmonic Magnitude Parameter

Park, Jong-Bae;Kim, Young-Joon;Lee, In-Sung;

The Journal of Korean Institute of Communications and Information Sciences (한국통신학회논문지)

Volume 34 Issue 3C
/
Pages.251-256
/
2009
/
1226-4717(pISSN)
/
2287-3880(eISSN)

The Korean Institute of Commucations and Information Sciences (한국통신학회)

Overlap and Add Sinusoidal Synthesis Method of Speech Signal Lising the Damping Harmonic Magnitude Parameter

감쇄(damping) 하모닉 크기 파라미터를 이용한 음성의 중첩합산 정현파 합성 방법

Park, Jong-Bae (Department of Radio Engineering, Chungbuk National University) ;
Kim, Young-Joon (Department of Radio Engineering, Chungbuk National University) ;
Lee, In-Sung (Department of Radio Engineering, Chungbuk National University)

박종배 (충북대학교 전파공학과) ;
김영준 (충북대학교 전파공학과) ;
이인성 (충북대학교 전파공학과)

Published : 2009.03.31

PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

In this paper, we propose a new method with the improved continuity performance of overlap and add speech signal synthesis method using damping harmonic amplitude parameter. The existing method uses the average value of past and current parameters for the sinusoidal amplitude used as the weight of phase error function. But, the proposed method extracts the more accurate sinusoidal amplitude by using a correlation between the original signals and the synthesized signals for the sinusodal amplitude used as the weights. To verify the performance of the proposed method, we observed the average differential error value between the synthesized signals.

본 논문에서는 음성신호의 정현파 합성방법 중 하나인 선형위상을 사용한 중첩합산방법에 대하여 감쇄(Damping) 하모닉 크기 파라미터를 사용하여 합성음성의 연속성을 개선시킨 새로운 방법을 제안한다. 기존의 중첩합산 정현파 합성방법은 프레임의 중간 지점에 대한 정현파 파라미터를 얻기 위해서 가중치로 사용된 정현파 크기값을 과거 프레임과 현재 프레임의 평균값을 사용하였으나 제안하는 방법은 정현파 크기값을 단순히 과거와 현재 프레임에서 평균값이 아닌 원 신호와 합성신호 사이의 상관성을 이용하여 감쇄(Damping)요소를 정의하고 보다 정확한 정현파 크기의 파라미터 값을 추출한 후 합성한다. 이렇게 제안한 합성 방법의 성능을 관찰하기 위해 합성방법의 연속성 평가를 통해 기존의 방법과 비교 평가한다. 제안한 방법의 평균 MSE값이 N/2 중첩길이에서 0.251dB, N/4 중첩길이에서 0.298dB 낮아짐을 볼 수 있다.

Keywords

References

R. J. McAulay and T. F. Quatieri, 'Speech analysis/synthesis based on a sinusoidal representation,' IEEE Trans. on ASSP, vol. 34,no. 4, pp. 744–754, Aug. 1986 https://doi.org/10.1109/TASSP.1986.1164910
W. B. Kleijin and K. K. Paliwal, Speech coding and synthesis, Elevier Science Publishers, Amsterdam, 1995
T. F. Quatieri and R. J. McAulay, 'Speech transformations based on a sinusoidal representation,' IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 1449-1464, 1986
E. B. George and M. J. T. Smith, 'Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model,' IEEE Trans. Speech Audio Processing, vol. 5, no. 5, pp. 389-406, 1997 https://doi.org/10.1109/89.622558
Y. Stylianou, 'Applying the harmonic plus noise model in concatenative speech synthesis,' IEEE Trans. Speech Audio Processing, vol. 9, pp. 232-239, Mar. 2001 https://doi.org/10.1109/89.890068
J. Jensen and J. H. L. Hansen, 'Speech enhancement using a constrained iterative sinusoidal model,' IEEE Trans. Speech Audio Processing, vol. 9, pp. 731-740, Oct. 2001 https://doi.org/10.1109/89.952491
J. Nieuwenhuijse, R. Heusdens, and E.F. Deprettere, 'Robust exponential modeling of audio signals,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '98, Seattle, Washington, USA, vol.6, pp. 3581–3584, May 1998 https://doi.org/10.1109/ICASSP.1998.679650
T. S. Verma and T. H. Y. Meng, 'Sinusoidal modeling using frame-based perceptually weighted matching pursuits,' IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '99, Phoenix, Arizona, USA, vol. 2, pp. 981–984, May 1999 https://doi.org/10.1109/ICASSP.1999.759861
M. Goodwin,'Matching Pursuit with Damped Sinusoids', Proc. IEEE ICASSP 1997, vol.3, pp.2037-2040 https://doi.org/10.1109/ICASSP.1997.599345
R. J. McAulay and T. F. Quatieri, "Computationally efficient sine-wave synthesis and its application to sinusoidal Transform coding" Proc. IEEE ICASSP pp.370~373, 1998 https://doi.org/10.1109/ICASSP.1988.196594
박종배, 김규진, 정규혁, 김종학, 이인성 "정현파 크기로 가중치 된 위상 오류 함수를 사용한 음성의 중첩합산 정현파 합성 방법" 한국 통신 학회 논문지 제 32권 제 12호, pp. 1149~1155, 2007

The Journal of Korean Institute of Communications and Information Sciences (한국통신학회논문지)

Overlap and Add Sinusoidal Synthesis Method of Speech Signal Lising the Damping Harmonic Magnitude Parameter

감쇄(damping) 하모닉 크기 파라미터를 이용한 음성의 중첩합산 정현파 합성 방법

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)