DOI QR코드

DOI QR Code

Speaker age estimation and acoustic characteristics: According to pitch and speech rate

화자 연령 지각과 음성적 특성: 음높이와 발화 속도를 중심으로

  • Seo, YoonJeong (Department of Korean Language and Literatures, Korea University) ;
  • Shin, Jiyoung (Department of Korean Language and Literatures, Korea University)
  • 서윤정 (고려대학교 국어국문학과) ;
  • 신지영 (고려대학교 국어국문학과)
  • Received : 2019.08.01
  • Accepted : 2019.12.12
  • Published : 2019.12.31

Abstract

This study aimed to investigate the correlation between speaker's chronological age (CA) and perceived age (PA) and to specify the effect of pitch and speech rate as acoustic cue on judging age, using perceptual testing and acoustic analysis. Three tasks were conducted to identify the degree of listener's accuracy about age estimation. Three perception tasks were conducted to measure the accuracy of 80 Korean listeners when presented with different types of speech. In all the tasks, participants listened to speech samples and gave their estimate of the speaker's age in figures. It was found that Korean listeners are able to gauge the age of a speaker fairly precisely. CA and mean PA were positively correlated in all three tasks. It is clear that the amount and type of information included in the voice samples affected the accuracy of a listener's judgement. Moreover, the result revealed that listeners make use of acoustic information such as pitch and speech rate to estimate speaker's age.

본고는 한국인 피험자를 대상으로 지각 실험을 진행하여 화자의 실제 연령(Chronological age)과 지각 연령(Perceived age) 간의 상관관계를 살피고, 한국인 피험자가 얼마나 정확하게 익명의 화자의 연령을 지각할 수 있는지를 밝히고자 한다. 또한, 이러한 연령 지각에 음성적 단서가 되는 음높이와 발화 속도와 지각 연령 간의 영향 관계를 검토하고자 한다. 이를 위해, 성인 80명을 대상으로 3가지 과제로 구성된 지각 실험을 진행하였다. 실험 자극은 표준어 화자 40명에게서 추출되었으며, 자유 발화, 낭독 발화, 모음 연장 발성으로 구성되었다. 각 실험은 10초 내외의 음성을 듣고 연령을 구체적인 숫자로 답하는 방식으로 진행되었다. 분석 결과, 한국인 피험자들은 상당히 높은 판단 정확도를 보였으며, 모음 연장 발성을 들었을 때보다 자유 발화와 낭독 발화를 들었을 때 화자의 연령을 더욱 정확하게 짐작하였다. 이러한 결과는 음성이 포함하고 있는 정보량의 차이에 기인한 것으로 보인다. 또한, 음성 분석을 수행한 결과 피험자들은 화자의 음높이와 발화 속도를 참고하여 화자의 연령을 추정하는 것으로 나타났으며, 음높이보다는 발화 속도가 연령 지각에 더 적극적으로 기여한 것으로 나타났다.

Keywords

References

  1. Baken, R. J. (2005). The aged voice: A new hypothesis. Journal of Voice, 19(3), 317-325. https://doi.org/10.1016/j.jvoice.2004.07.005
  2. Harnsberger, J. D., Shrivastav, R., Brown, W. S., Rothman, H., & Hollien, H. (2008). Speaking rate and fundamental frequency as speech cues to perceived age. Journal of Voice, 22(1), 58-69. https://doi.org/10.1016/j.jvoice.2006.07.004
  3. Horii, Y. & Ryan, W. J. (1981). Fundamental frequency characteristics and perceived age of adult male speakers. Folia Phoniatrica et Logopaedica, 33(4), 227-233. https://doi.org/10.1159/000265597
  4. Huntley, R., Hollien, H., and Shipp, T. (1987). Influences of listener characteristics on perceived age estimations. Journal of Voic,. 1(1), 49-52. https://doi.org/10.1016/S0892-1997(87)80024-3
  5. Jacewicz, E., Fox, R. A., O’Neill, C., & Salmons, J. (2009). Articulation rate across dialect, age, and gender. Language Variation and Change, 21, 233-256. https://doi.org/10.1017/S0954394509990093
  6. Kim, J., & Seong, C. (2014). Listener’s age estimation by prosody manipulation. Phonetics and Speech Sciences, 6(2), 81-88. https://doi.org/10.13064/KSSS.2014.6.2.081
  7. Lee, N., Shin, J., Yoo, D., & Kim, K. W. (2017). Speech rate in Korean across region, gender and generation. Phonetics and Speech Sciences, 9(1), 27-39. https://doi.org/10.13064/KSSS.2017.9.1.027
  8. Linville, S. E. (2001). Vocal aging. San Diego, CA: Singular Thomson Learning.
  9. Neiman, G. S., & Applegate, J. A. (1990). Accuracy of listener judgments of perceived age relative to chronological age in adults. Folia Phoniatrica et Logopaedica, 42(6), 327-330. https://doi.org/10.1159/000266090
  10. Ptacek, P. H., & Sander, E. K. (1966). Age recognition from voice. Journal of Speech and Hearing Research, 9(2), 273-277. https://doi.org/10.1044/jshr.0902.273
  11. Ramig, L. A. (1983). Effects of physiological aging on speaking and reading rates. Journal of Communication Disorders, 16(3), 217-226. https://doi.org/10.1016/0021-9924(83)90035-7
  12. Ryan, W. J., & Burk, K. W. (1974). Perceptual and acoustic correlates of aging in the speech of males. Journal of Communication Disorders, 7(2), 181-192. https://doi.org/10.1016/0021-9924(74)90030-6
  13. Schotz, S. (2007). Acoustic analysis of adult speaker age. In C. Muller (Ed.), Speaker classification I: Fundamentals, features, and methods, (vol. 1, pp. 88-107). Heidelberg, Germany: Springer.
  14. Shin, J., & Kim, K. W. (2017). Developing a Korean standard speech DB (II). Phonetics and Speech Sciences, 9(2), 9-22. https://doi.org/10.13064/KSSS.2017.9.1.009
  15. Shipp, T., & Hollien, H. (1969). Perception of the aging male voice. Journal of Speech and Hearing Research, 12(4), 703-710. https://doi.org/10.1044/jshr.1204.703