DOI QR코드

DOI QR Code

Error Correction and Praat Script Tools for the Buckeye Corpus of Conversational Speech

벅아이 코퍼스 오류 수정과 코퍼스 활용을 위한 프랏 스크립트 툴

  • Received : 2012.02.14
  • Accepted : 2012.03.03
  • Published : 2012.03.31

Abstract

The purpose of this paper is to show how to convert the label files of the Buckeye Corpus of Spontaneous Speech [1] into Praat format and to introduce some of the Praat scripts that will enable linguists to study various aspects of spoken American English present in the corpus. During the conversion process, several types of errors were identified and corrected either manually or automatically by the use of scripts. The Praat script tools that have been developed can help extract from the corpus massive amounts of phonetic measures such as the VOT of plosives, the formants of vowels, word frequency information and speech rates that span several consecutive words. The script tools can extract additional information concerning the phonetic environment of the target words or allophones.

Keywords

References

  1. Pitt, M.A., Dilley, L., Johnson, K., Kiesling, S., Raymond, W., Hume, E. and Fosler-Lussier, E. (2007). Buckeye Corpus of Conversational Speech (2nd release) [www.buckeyecorpus. osu. edu] Columbus, OH: Department of Psychology, Ohio State University (Distributor).
  2. Boersma, Paul & Weenink, David (2012). Praat: doing phonetics by computer [Computer program]. Version 5.3.04, retrieved 12 January 2012 from http://www.praat.org/
  3. Sjolander, Kare & Beskow, Jonas. (2000). Wavesurfer - An Open Source Speech Tool.
  4. The CMU Pronouncing Dictionary, URL: http://www.speech. cs.cmu.edu/ cgi-bin/cmudict.
  5. R Development Core Team. (2008). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL http://www.R-project.org.

Cited by

  1. An Analysis of the Vowel Formants of the Young Males in the Buckeye Corpus vol.4, pp.2, 2012, https://doi.org/10.13064/KSSS.2012.4.2.041
  2. Reduction and Frequency Analyses of Vowels and Consonants in the Buckeye Speech Corpus vol.4, pp.3, 2012, https://doi.org/10.13064/KSSS.2012.4.3.075