• Title/Summary/Keyword: statistical transliteration model

Search Result 2, Processing Time 0.014 seconds

Phonics-based Rules for Improving Performance of English-to-Korean Transliteration (영.한 음차 표기 성능 향상을 위한 음철법 기반 규칙 구축)

  • Kim, Min-Jeong;Hong, Gum-Won;Park, So-Young;Rim, Hae-Chang
    • Phonetics and Speech Sciences
    • /
    • v.1 no.4
    • /
    • pp.133-144
    • /
    • 2009
  • This paper presents a method for constructing and using transliteration rules which are based on Phonics, an instructional method for speaking and writing English letters. Conventional approaches to automatic transliteration often focused on statistical methods. However, the construction or the collection of correct transliteration examples is always the bottleneck of the statistical transliteration model. Also, in practical domains where the collection of such data is very difficult, such as education and tourism, it is reasonable to build a system without much qualified data. Furthermore, compared with Korean orthography of borrowed foreign words, the proposed approach is much easier to construct, and can generate more refined rules. The experimentation result shows that the proposed approach can improve the performance of a statistical-based transliteration system.

  • PDF

Automatic Extraction of English-Chinese Transliteration Pairs using Dynamic Window and Tokenizer (동적 윈도우와 토크나이저를 이용한 영-중 음차표기 대역쌍 자동 추출)

  • Jin, Cheng-Guo;Na, Seung-Hoon;Kim, Dong-Il;Lee, Jong-Hyeok
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.13 no.6
    • /
    • pp.417-421
    • /
    • 2007
  • Recently, many studies have focused on extracting transliteration pairs from bilingual texts. Most of these studies are based on the statistical transliteration model. The paper discusses the limitations of previous approaches and proposes novel approaches called dynamic window and tokenizer to overcome these limitations. Experimental results show that the average rates of word and character precision are 99.0% and 99.78%, respectively.