Speech Rate Variation in Synchronous Speech

동시발화에 나타나는 발화 속도 변이 분석

  • Received : 2012.11.02
  • Accepted : 2012.12.10
  • Published : 2012.12.31


When two speakers read a text together, the produced speech has been shown to reduce a high degree of variability (e.g., pause duration and placement, and speech rate). This paper provides a quantitative analysis of speech rate variation exhibited in synchronous speech by examining the global and local patterns in two dialects of Mandarin Chinese (Taiwan and Shanghai). We analyzed the speech data in terms of mean speech rate and the reference of "Just Noticeable difference (JND)" within a subject and across subjects. Our findings show that speakers show lower and less variable speech rates when they read a text synchronously than when they read alone. This global pattern is observed consistently across speakers and dialects maintaining the unique local variation patterns of speech rate for each dialect. We conclude that paired speakers lower their speech rates and decrease the variability in order to ensure the synchrony of their speech.



  1. Abercrombie, D. (1967). Elements of General Phonetics. Edinburgh University Press.
  2. Cummins, F. (2002). On synchronous speech. Acoustic Research Letters Online 3(1), 7-11.
  3. Cummins, F. (2003). Practice and performance in speech produced synchronously. Journal of Phonetics 31(2), 139-148.
  4. Dauer, R. M. (1983). Stress-timing and Syllable-timing reanalysed. Journal of Phonetics 11, 51-62.
  5. Flege, J. E. (1987). The production of "new" and "similar" phones in a foreign language: Evidence for the effect of equivalence classification. Journal of Phonetics 15(1), 47-65.
  6. Fowler, C. A., Sramko, V., Ostry, D. J. Rowland, S. A. & Hallé, P. (2008). Cross language phonetic influences on the speech of French-English bilinguals. Journal of Phonetics 36(4), 649-663.
  7. Grosjean, F. & Deschamps, A. (1975). Analyse contrastive des variables temporelles de l'anglais et du francais: vitesse de parole et variables composantes, phenomenes d'hesitation. Phonetica 31, 144-184.
  8. Keller, E. (1987). The variation of absolute and relative measures of speech activity. Journal of Phonetics 15, 335-347.
  9. Kim, M., Horton, W. S. & Bradlow, A. R. (2011). Phonetic convergence in spontaneous conversations as a function of interlocutor language distance. Laboratory Phonology 2(1), 125-156.
  10. Lacheret-Dujour, A. (1991). Le debit de la parole: un filter utilize pour la generation des variants de prononciation en francais parisien. Actes du XIIeme Congres International des Sciences Phonetiques, 194-197.
  11. Marlsen-Wilson, W. (1973). Linguistic structure and speech shadowing at very short latencies. Nature 244, 522-523.
  12. Park, Hanyong & de Jong, Kenneth. (2008). Perceptual category mapping between English and Korean prevocalic obstruents: Evidence from mapping effects in second language identification skills. Journal of Phonetics 36, 704-723.
  13. Quene, H. (2007). On the just noticeable difference for tempo in speech. Journal of Phonetics 35, 353-362.
  14. Sawusch, J. R. & Newman, R. S. (2000). Perceptual normalization for speaking rate II: effects of signal discontinuities. Perception and Psychophysics 62(2), 285-300.
  15. Zellner, B. (1998). Fast and slow speech rate: a characterization for French. 5th International Conference on Spoken Language Processing, Vol. 7, 3159-3163.
  16. Zvonic, E. & Cummins, F. (2003). The effect of surrounding phrase lengths on pause duration. In Proceedings of EUROSPECH, 777-780.

Cited by

  1. An acoustical analysis of synchronous English speech using automatic intonation contour extraction vol.7, pp.1, 2015,