Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2008.15-B.6.595

A Study of Fundamental Frequency for Focused Word Spotting in Spoken Korean  

Kwon, Soon-Il (한국과학기술연구원 지능인터랙션연구센터)
Park, Ji-Hyung (과학기술연합대학원대학교 HCI 및 로봇응용공학)
Park, Neung-Soo (건국대학교 정보통신대학 컴퓨터공학부)
Abstract
The focused word of each sentence is a help in recognizing and understanding spoken Korean. To find the method of focused word spotting at spoken speech signal, we made an analysis of the average and variance of Fundamental Frequency and the average energy extracted from a focused word and the other words in a sentence by experiments with the speech data from 100 spoken sentences. The result showed that focused words have either higher relative average F0 or higher relative variances of F0 than other words. Our findings are to make a contribution to getting prosodic characteristics of spoken Korean and keyword extraction based on natural language processing.
Keywords
Focused Word Spotting; Keyword Extraction; Fundamental Frequency; Spoken Korean; Prosody;
Citations & Related Records
연도 인용수 순위
  • Reference
1 K. Sonmez, E. Shriberg, L. Heck, and M. Weintraub, “Modeling Dynamic Prosodic Variation for Speaker Verifi cation,” Proc. of International Conference on Spoken Language Processing, Sydney, Australia, Vol.7, pp.3189-3192, 1998
2 D. Wang and S. Narayanan, “An Acoustic Measure For Word Prominence In Spontaneous Speech,” IEEE Transactions on Speech, Audio and Language Processing, 15(2), pp.690-701, Feb., 2007   DOI   ScienceOn
3 구희산, “영어와 한국어 낱말 운율의 음성학적 연구”, 응용언어학, 제8호, pp.123-140, 1995년 2월
4 S. Ananthakrishnan and S. Narayanan, “Automatic Prosody Labeling using Acoustic, Lexical, and Syntactic Evidence,” IEEE Transactions on Speech, Audio and Language Processing, 16(1), pp.216-228, Jan., 2008   DOI   ScienceOn
5 S.-A. Jun and H.-S. Kim, “VP Focus and Narrow Focus in Korean,” In Proc. of ICPhS, Saarbruecken, Germany, 2007
6 Speech Filing System [Online]. Available: http://www.phon.ucl.ac.uk/resource/sfs
7 F. Tamburini, “Automatic prosodic prominence detection in speech using acoustic features: an unsupervised system,” In Proc. of Eurospeech, pp.129-132, 2003
8 D. Baron, E. Shriberg and A. Stolcke, “Automatic punctuation and disfluency detection in multi-party meetings using prosodic and lexical cues,” In Proc. of International Conference on Spoken Language Processing (ICSLP), pp. 949-952, 2002
9 S.-A. Jun and H.-J. Lee, “Phonetic and phonological markers of contrastive focus in Korean,” In Proc. International Conference on Spoken Language Processing (ICSLP), pp.1295-1298, 1998
10 S.-A. Jun, “Intonational Phonology of Seoul Korean Revisited,” Japanese-Korean Linguistics 14 , Stanford: CSLI [Also printed in UCLA Working Papers in Phonetics, #104, pp.14-25, 2005], 2006
11 S. Kang and S. Speer, “Prosody and clause boundaries in Korean,” Proc. of International conference on Speech Prosody, pp.419-422, 2002
12 E.-S. Kim and B. Scassellati, “Learning to refine behavior using prosodic feedback,” In Proc. of IEEE 6th International Conference on Development and Learning, pp.205-210, 2007   DOI
13 H.-S. Kim, S.-A. Jun, H.-J. Lee, and J.-B. Kim, “Argument Structure and Focus Projection in Korean,” Proc. of International conference on Speech Prosody, Dresden, Germany, 2006
14 D. Wang and S. Narayanan, “A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues,” In Proc. of International Conference on Acoustics, Speech, and Signal Processing, pp.525-528, May, 2004   DOI
15 B. Secrest and G. Doddington, “An integrated pitch tracking algorithm for speech systems,” Proc. of International Conference on Acoustics, Speech, and Signal Processing, pp.1352-1355, Apr., 1983