The Korean Corpus of Spontaneous Speech

Yun, Weonhee;Yoon, Kyuchul;Park, Sunwoo;Lee, Juhee;Cho, Sungmoon;Kang, Ducksoo;Byun, Koonhyuk;Hahn, Hyeseung;Kim, Jungsun;

doi:10.13064/KSSS.2015.7.2.103

말소리와 음성과학 (Phonetics and Speech Sciences)

제7권2호
/
Pages.103-109
/
2015
/
2005-8063(pISSN)
/
2586-5854(eISSN)

한국음성학회 (Korean Society of Speech Sciences)

DOI QR Code

The Korean Corpus of Spontaneous Speech

Yun, Weonhee (Keimyung University) ;
Yoon, Kyuchul (Yeungnam University) ;
Park, Sunwoo (Keimyung University) ;
Lee, Juhee (Kyung Hee University) ;
Cho, Sungmoon (Hanyang University) ;
Kang, Ducksoo (Hankuk University) ;
Byun, Koonhyuk (Hankuk University) ;
Hahn, Hyeseung (Chung-Ang University) ;
Kim, Jungsun (Yeungnam University)

투고 : 2015.04.20
심사 : 2015.06.09
발행 : 2015.06.30

https://doi.org/10.13064/KSSS.2015.7.2.103 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

This paper describes the development of the Korean corpus of spontaneous speech, also called the Seoul corpus. The corpus contains the audio recording of the interview-style spontaneous speech from the 40 native speakers of Seoul Korean. The talkers are divided into four age groups; talkers in their teens, twenties, thirties and forties. Each age group has ten talkers, five males and five females. The method used to elicit and record the speech is described. The corpus containing around 220,000 phrasal words was phonemically labeled along with information on the boundaries for Korean phrasal words and utterances, which were additionally romanized. According to the test result of labeling consistency, the inter-labeler agreement on phoneme identification was 98.1% and the mean deviation on boundary placement was 9.04 msec. The corpus will be made available for free to the research community in March, 2015.

키워드

참고문헌

Pitt, M. A., Dilley, L., Johnson, K., Hume, E., Kiesling, S. and W. D. Raymond. (2005). The Buckeye corpus of conversational speech: labeling conventions and a test of transcriber reliability. Speech Communication 45, 89-95. https://doi.org/10.1016/j.specom.2004.09.001
Fosler-Lussier, Eric, Dilley, Laura, Tyson, Na'im & Pitt, Mark (2007). The Buckeye Corpus of Speech: Updates and Enhancements. Proceedings of Interspeech 2007, Antwerp, Belgium.
Boersma, Paul & Weenink, David (2012). Praat: doing phonetics by computer [Computer program]. Version 5.3.04, retrieved 12 January 2012 from http://www.praat.org/
Yun, Weonhee (2003). Multiple acoustic cues for Korean stops and automatic speech recognition. Ph.D thesis. University of Edinburgh.
Cohen, Jacob (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement 20 (1), 37-46. https://doi.org/10.1177/001316446002000104

피인용 문헌

Phonological processes of vowels from orthographic to pronounced words in the Buckeye Corpus by sex and age groups* vol.10, pp.2, 2018, https://doi.org/10.13064/KSSS.2018.10.2.025
Phonological processes of vowels from orthographic to pronounced words in the Buckeye Corpus by sex and age groups* vol.10, pp.2, 2018, https://doi.org/10.13064/KSSS.2018.10.2.25
Effects of gender, age, and individual speakers on articulation rate in Seoul Korean spontaneous speech vol.10, pp.4, 2018, https://doi.org/10.13064/KSSS.2018.10.4.019

말소리와 음성과학 (Phonetics and Speech Sciences)

The Korean Corpus of Spontaneous Speech

초록

키워드

참고문헌

피인용 문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)