On Formant Extraction Based on Transfer Function

  • Jiang, Gang-Yi (Division of Electronics Engineering, Ajou University) ;
  • Park, Tae-Young (Division of Electronics Engineering, Ajou University) ;
  • Mei Yu (Division of Electronics Engineering, Ajou University)
  • Published : 1999.06.01

Abstract

This paper focuses on extracting formants from transfer function, derived from linear prediction analysis of speech signal. The second derivative of the log magnitude spectrum of the transfer function, the first and third derivatives of the phase spectrum of the transfer function in the z-plane are discussed. Their resolutions of detecting formants are analyzed and some comparisons are given. Theoretical analyses and experimental results show that the third derivative of the phase spectrum decays more rapidly around the formant locations than the first derivative of the phase spectrum and the second derivative of the log magnitude spectrum. Compared with the second derivative of the log spectrum and the first derivative of the phase spectrum, the third derivative of the phase spectrum has higher resolution in frequency domain and provides more accurate formant extraction.

Keywords

References

  1. Fundamentals of Speech Recognition L.R.Rabiner;B.H.Juang
  2. Digital Processing of Speech Signals L.R.Rabiner;R.W.Schafer
  3. Linear prediction of speech J.D.Markel;A.H.Gray,Jr.
  4. J. Am. Acoust. Soc. v.50 Speech analysis and synthesis by linear prediction of the speech ware B.S.Atal;S.L.Hananer
  5. IEEE Trans. on Audio and Electroacoustics v.20 Digital Inverse Filtering, A New Tool for Formant Trajectory Estimation J.D.Markel
  6. IEEE Trans. on Audio and Electroacoustics v.21 Application of a Digital Inverse Filter for Automatic Formant and F0 Analysis J.D.Markel
  7. IEEE Trans. on Audio and Electroacoustics v.21 Spectral analysis of speech by linear prediction J.Markel
  8. IEEE. Trans. on ASSP v.22 An Algorithm for Automatic Formant Extraction Using Linear Prediction Spectra S.S.McCandless
  9. IEEE Trans. on ASSP v.24 A comparison of three methods of extracting resonance information from predictor-coefficient coded speech R.L.Christensen;W.J.Sreong;E.P.Palmer