Browse > Article

Establishment of the Korean Standard Vocal Sound into Character Conversion Rule  

이계영 (동국대학교 컴퓨터·멀티미디어학과)
임재걸 (동국대학교 컴퓨터·멀티미디어학과)
Publication Information
Abstract
The purpose of this paper is to establish the Standard Korean Vocal Sound into Character Conversion Rule (Standard VSCC Rule) by reversely applying the Korean Standard Pronunciation Rule that regulates the way of reading written Hangeul sentences. The Standard VSCC Rule performs a crucially important role in Korean speech recognition. The general method of speech recognition is to find the most similar pattern among the standard voice patterns to the input voice pattern. Each of the standard voice patterns is an average of several sample voice patterns. If the unit of the standard voice pattern is a word, then the number of entries of the standard voice pattern will be greater than a few millions (taking inflection and postpositional particles into account). This many entries require a huge database and an impractically too many comparisons in the process of finding the most similar pattern. Therefore, the unit of the standard voice pattern should be a syllable. In this case, we have to resolve the problem of the difference between the Korean vocal sounds and the writing characters. The process of converting a sequence of Korean vocal sounds into a sequence of characters requires our Standard VSCC Rule. Making use of our Standard VSCC Rule, we have implemented a Korean vocal sounds into Hangeul character conversion system. The Korean Standard Pronunciation Rule consists of 30 items. In order to show soundness and completeness of our Standard VSCC Rule, we have tested the conversion system with various data sets reflecting all the 30 items. The test results will be presented in this paper.
Keywords
Standard Pronunciation Rule; Speech Recognition; Vocal Sound into Character Conversion; Hangeul Phonetic Value; Petri Net;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 K. H. Davis, R. Davis and S. Balashek, 'Automatic Recognition of Spoken Digits', J. Acoust. Soc.Am., 24(6), 1952   DOI
2 F. Jelink, 'Continuous speech recognition by statistical method', Proc. IEEE, vol. 64, pp. 532-556, Apr. 1976   DOI   ScienceOn
3 C. S. Myers and L. R. Rabiner, 'A comparaive study of several Dynamic Time Warping Algorithms for Connected Word Recogniion', Bell system Tech. J, 60(7) : 1,389 1,409, September 1981   DOI
4 S. E. Levinson, 'Continuous speech recogniion by means of acoustic-phonetic classificaion obtained from a hidden Markov model,' in Proc. ICASSP '87 (Dallas TX), Apr. 1987   DOI
5 Mariani, 'Recent advances in speech process ng', Proc. of Int. Conf. on Acoustics, Spee h, and Signal Processing, pp, 429-440, Glasg w, May 1989   DOI
6 D. Mansour and B. J juang, 'A family of di orion measures based upon projection oper tion for robust speech recognition', IEEE Trans. on ASSP, Vol. 37, No. 11, pp. 1,659 1,671, 1989   DOI   ScienceOn
7 양진석, 김재범, 이정현, 운율 및 길이 정보를 이용한 무제한 음성 합성기의 설계 및 구현, 한국정보처리학회논문지, 제3권 제5호, 1996   과학기술학회마을
8 K Kita. Kawabata, and H. Saito. 'HMM: con inuous speech recognition using predictive LR parsing', Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Glasgow, Scotland, vol. 2, pp.703-700, 1989   DOI
9 한국과학기술원, 무제한 한국어 음성합성 시스템, 연구보고서, 1990
10 구명완, 대용량 단어 음성인식 시스템을 위한 화자적응에 관한 연구, 한국과학기술원, 박사학위논문, 1991
11 김회린, 음성 신호의 부분 정보를 이용한 음성인식 성능 향상, 한국과학기술원, 박사학위논문, 1991
12 이기문 외 9인, 국어 어문 규정법, 대한교과서 주식회사, 1996
13 김혜순, 변영태, 이기철, 멀티미디어를 이용한 한국어 발음교육 시스템, 한국정보과학회논문지, 93.12, vol.20, NO1.12
14 S.J. Yun, Y. H. Oh, G. C. Shin, 'Improved Lexicon Modeling for Continuous Speech Recognition', International Conference on Acoustics, Speech, and Signal Processing, pp. 1,827-1,830, Munich, Germany, Apr. 1997   DOI
15 문화교육부, 표준어 규정, 문교부 고시 제 88-2호, 1988
16 서울대학교 사범대학 국어교육연구소, 고등학교 문법, 1996
17 이계영, 임재걸, 김경징, 패트리넷을 이용한 표준 발음법 분석 시스템 디자인', 한국정보과학회 봄학술발표논문집, pp. 369-371, 1999
18 T. Murata, 'Pertri nets: properties, analysis and applications', Proc. of the IEEE, Vol77. no.4, pp.541-580, April 1989   DOI   ScienceOn
19 이계영, 임재걸, 한국어 음성합성을 위한 음가변환 테이블 생성, 대한전자공학회논문지, 38CI-5-5, pp. 284-297, 2001