DOI QR코드

DOI QR Code

Vocal Tract Modeling with Unfixed Sectionlength Acoustic Tubes(USLAT)

비고정 구간 길이 음향 튜브를 이용한 성도 모델링

  • 김동준 (청주대 공대 전자정보공학부)
  • Received : 2010.04.22
  • Accepted : 2010.05.18
  • Published : 2010.06.01

Abstract

Speech production can be viewed as a filtering operation in which a sound source excites a vocal tract filter. The vocal tract is modeled as a chain of cylinders of varying cross-sectional area in linear prediction acoustic tube modeling. In this modeling the most common implementation assumes equal length of tube sections. Therefore, to model complex vocal tract shapes, a large number of tube sections are needed. This paper proposes a new vocal tract model with unfixed sectionlengths, which uses the reduced lattice filter for modeling the vocal tract. This model transforms the lattice filter to reduced structure and the Burg algorithm to modified version. When the conventional and the proposed models are implemented with the same order of linear prediction analysis, the proposed model can produce more accurate results than the conventional one. To implement a system within similar accuracy level, it may be possible to reduce the stages of the lattice filter structure. The proposed model produces the more similar vocal tract shape than the conventional one.

Keywords

References

  1. G. Fant : Acoustic Theory of Speech Production, Mouton, 1970.
  2. H. Wakita, "Direct Estimation of the Vocal Tract Shape by Inverse Filtering of Acoustic Speech Waveforms," IEEE Trans. Acoust., Speech, Signal Processing, Vol. AU-21, No. 5, Oct. 1973.
  3. B. S. Atal, "Speech analysis and synthesis by linear prediction of speech wave," J. Acoust. Soc. Am, Vol. 41, pp. 65(A), 1970.
  4. J. Schroter, J. N. Larar, and M. M. Sondhi, "Speech Parameter Estimation using a Vocal Tract/Cord Model," IEEE Int. Conf. on Acoustics. Speech. and Signal Processing, pp. 308-311, 1987.
  5. J. D. Markel, A. H. Gray : Linear Prediction of Speech, Springer-Verlag. Berlin. Heidelberg. New York, 1976.
  6. T. F. Quatieri : Discrete-Time Speech Signal Processing, Principles and Practice, Prentice Hall, 2002.
  7. E.P. Neuburg, W.R. Bauer, "On the Source-Filter Model of the Vocal Tract," IEEE Int. Conf. on Acoustics. Speech. and Signal Processing, pp. 1609-1612, 1986.
  8. H. Fusisaki, M. Ljungqvist, "Estimation of Voice Source and Vocal Tract Parameters Based on ARMA Analysis and a Model for the Glottal Source Waveform," IEEE Int. Conf. on Acoustics. Speech. and Signal Processing, pp. 637-640, 1987.
  9. A. M. de L. Araújo, F. Violaro, "Formant Frequency Estimation Using a MEL Scale LPC Algorithm," IEEE Int. Conf. on Acoustics. Speech. and Signal Processing, pp. 207-212, 1998.