Browse > Article

An English-to-Korean Transliteration Model based on Grapheme and Phoneme  

Oh Jong-Hoon (한국과학기술원 전산학과)
Choi Key-Sun (한국과학기술원 전산학과)
Abstract
There has been increasing interest in English-to-Korean transliteration recently. Previous ,works are related to a direct method like Korean graphemes> and a pivot method like English phoneme$\rightarrow$Korean graphemes>. Though most of the previous works focus on the direct method, transliteration, however, is a phonetic process rather than an orthographic one. In this point of view, we present an English-Korean transliteration model using grapheme and phoneme information. Unlike the previous works, our method uses phonetic information such as phonemes and their context. Moreover, we also use graphemes corresponding to phonemes. Our method shows about $60\%$ word accuracy.
Keywords
transliteration; foreign word; phoneme; grapheme; information retrieval;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Kang I.H. and G.C. Kim, 'English-to-Korean Transliteration using Multiple Unbounded Overlapping Phoneme Chunks', In Proceedings of the 18th International Conference on Computational Linguistics, 2000   DOI
2 Knight, K. and J. Graehl, 'Machine Transliteration'. In Proceedings. of the 35th Annual Meetings of the Association for Computational Linguistics (ACL) Madrid, Spain, 1997
3 Kang, Y. and A. A. Maciejewski, 'An algorithm for Generating a Dictionary of Japanese Scientific Terms', Literary and Linguistic Computing, 11(2)., 1996   DOI   ScienceOn
4 이재성, 다국어 정보검색을 위한 영-한 음차표기 및 복원 모델, 박사학위논문, 한국과학기술원 전산학과, 1999
5 이희승, 안병희, 고친판 한글 맞춤법강의, 신구문화사, 1994
6 Cost, S. and S. Salzberg, A weighted nearest neighbour algorithm for learning with symbolic features. Machine Learning, 10:5778, 1993   DOI
7 Manning, C.D. and Hinrich Schutze, Foundations of Statistical natural language Processing, MIT Press, 1999
8 CMU, The Carnegie Mellon University (CMU) Pronouncing Dictionary v0.6, http://www.speech.cs.cmu.edu/cgi-bin/cmudict
9 Hall, P., and G. Dowling, 'Approximate string matching,' Computing Surveys, 12(4), 381-402, 1980   DOI
10 Kolodner, J., Case-based reasoning. San Mateo, CA: Morgan Kaufmann, 1993
11 Aha, D. W., Lazy learning: Special issue editorial. Artificial Intelligence Review, 11:710, 1997   DOI
12 Cover, T. M. and P. E. Hart. 1967. Nearest neighbor pattern classification. Institute of Electrical and Electronics Engineers Transactions on Information Theory, 13:2127, 1967
13 Aha, D. W., D. Kibler, and M. Albert, Instancebased learning algorithms. Machine Learning, 6:3766, 1991
14 C. Stanfill and D. Waltz, 'Toward Memory-based Reasoning,' Communications of the ACM, 29(12), pp. 1213-1228, 1986   DOI   ScienceOn
15 Devijver, P. A. and J, Kittler. Pattern recognition. A statistical approach. Prentice-Hall, London, UK, 1982
16 Covington, M. A., 'An algorithm to align words for historical comparison', Computational Linguistics, 22,.1996
17 Quinlan, J.R., 'Induction of decision trees,' Machine Learning, Vol. 1 pp. 81-106., 1986   DOI
18 Quinlan, J.R., 'C4.5: Programs for Machine Learning', Morgan Kauffman., 1993
19 Levenshtein V. I., Binary codes capable of correcting deletions, insertions and reversals. Doklady Akademii Nauk SSSR 163(4) p845-848, 1965
20 Daelemans W., Jakub Zavrel, Ko van der Sloot, and Antal van den Bosch, Timble TiMBL: Tilburg Memory Based Learner, version 4.3, Reference Guide, ILK Technical Report 02-10, 2002
21 남영신, 1997, 최신 외래어 사전, 국어사전 별책부록, 서울: 성안당 출판사
22 Lee, J. S. and K. S. Choi, English to Korean Statistical transliteration for information retrieval. Computer Processing of Oriental Languages, 12(1):17-37., 1998
23 Soong Frank. K. and Eng-Fong Huang, A tree-trellis based fast search for finding the n best sentence hypotheses in continuous speech recognition. In IEEE International Conference on Acoustic Speech and Signal Processing, pp, 546-549, 1991
24 James A., Natural Language Understanding, The Benjamin/ Cummings Publishing Company, 1995
25 문화부, 표준 외래어 표기법, 1995
26 김정재, 이재성, 최기선, '신경망을 이용한 발음단위 기반 자동 영-한 음차 표기 모델', 한국인지과학회 춘계학술대회 발표논문집, pp. 247-252, 1999
27 Kang B.J. and K-S. Choi, 'Automatic Transliteration and Back-transliteration by Decision Tree Learning', In Proceedings of the 2nd International Conference on Language Resources and Evaluation, Athens, Greece, 2000
28 강병주, 한국어 정보검색에서 외래어와 영어로 인한 단어불일치문제의 해결, 박사학위논문, 한국과학기술원 전산학과. 2001