Browse > Article
http://dx.doi.org/10.5370/KIEE.2010.59.6.1126

Vocal Tract Modeling with Unfixed Sectionlength Acoustic Tubes(USLAT)  

Kim, Dong-Jun (청주대 공대 전자정보공학부)
Publication Information
The Transactions of The Korean Institute of Electrical Engineers / v.59, no.6, 2010 , pp. 1126-1130 More about this Journal
Abstract
Speech production can be viewed as a filtering operation in which a sound source excites a vocal tract filter. The vocal tract is modeled as a chain of cylinders of varying cross-sectional area in linear prediction acoustic tube modeling. In this modeling the most common implementation assumes equal length of tube sections. Therefore, to model complex vocal tract shapes, a large number of tube sections are needed. This paper proposes a new vocal tract model with unfixed sectionlengths, which uses the reduced lattice filter for modeling the vocal tract. This model transforms the lattice filter to reduced structure and the Burg algorithm to modified version. When the conventional and the proposed models are implemented with the same order of linear prediction analysis, the proposed model can produce more accurate results than the conventional one. To implement a system within similar accuracy level, it may be possible to reduce the stages of the lattice filter structure. The proposed model produces the more similar vocal tract shape than the conventional one.
Keywords
Vocal Tract Modeling; Unfixed Sectionlength Acoustic Tubes(USLAT); Lattice Filter; Area Function;
Citations & Related Records

Times Cited By SCOPUS : 0
연도 인용수 순위
  • Reference
1 E.P. Neuburg, W.R. Bauer, "On the Source-Filter Model of the Vocal Tract," IEEE Int. Conf. on Acoustics. Speech. and Signal Processing, pp. 1609-1612, 1986.
2 H. Fusisaki, M. Ljungqvist, "Estimation of Voice Source and Vocal Tract Parameters Based on ARMA Analysis and a Model for the Glottal Source Waveform," IEEE Int. Conf. on Acoustics. Speech. and Signal Processing, pp. 637-640, 1987.
3 B. S. Atal, "Speech analysis and synthesis by linear prediction of speech wave," J. Acoust. Soc. Am, Vol. 41, pp. 65(A), 1970.
4 T. F. Quatieri : Discrete-Time Speech Signal Processing, Principles and Practice, Prentice Hall, 2002.
5 G. Fant : Acoustic Theory of Speech Production, Mouton, 1970.
6 J. Schroter, J. N. Larar, and M. M. Sondhi, "Speech Parameter Estimation using a Vocal Tract/Cord Model," IEEE Int. Conf. on Acoustics. Speech. and Signal Processing, pp. 308-311, 1987.
7 H. Wakita, "Direct Estimation of the Vocal Tract Shape by Inverse Filtering of Acoustic Speech Waveforms," IEEE Trans. Acoust., Speech, Signal Processing, Vol. AU-21, No. 5, Oct. 1973.
8 A. M. de L. Araújo, F. Violaro, "Formant Frequency Estimation Using a MEL Scale LPC Algorithm," IEEE Int. Conf. on Acoustics. Speech. and Signal Processing, pp. 207-212, 1998.
9 J. D. Markel, A. H. Gray : Linear Prediction of Speech, Springer-Verlag. Berlin. Heidelberg. New York, 1976.