• Title/Summary/Keyword: English-Korean translation

Search Result 307, Processing Time 0.029 seconds

A Research on Test Suites for Machine Translation Systems. (기계번역 시스템 측정 장치 연구)

  • Lee, Min-Haeng;Jee, Kwang-Sin;Chung, So-Woo
    • Language and Information
    • /
    • v.2 no.2
    • /
    • pp.185-220
    • /
    • 1998
  • The purpose of this research is to propose a set of basic guidelines for the construction of English test suites, a set of basic guidelines for the construction of Korean test suites to objectively evaluate the performance of machine translation systems. For this end, we constructed 650 English test sentences, 650 Korean test sentences, and developed the statistical methods and tools for the comparative evaluation of the English-Korean machine translation systems. It also evaluates the existing commercial English-Korean machine translation systems. The importance of this research lies in that it will promote an awareness of the importance and need of testing machine translation systems within the Natural Language Community. This research will also make a big contribution to the development of evaluation methods and techniques for appropriate test suites for Korean information processing systems. The results of this research can be used by the natural language community to test the performance and development of their information processing systems or machine translation systems.

  • PDF

Evaluating English Loanwords and Their Usage for Professional Translation, Focusing on News Texts

  • Bokyung Noh
    • International Journal of Advanced Culture Technology
    • /
    • v.12 no.2
    • /
    • pp.161-166
    • /
    • 2024
  • As globalization has accelerated, the use of English loanwords is increasing in South Korea. In this paper, we have analyzed news stories from four Korean quality newspapers-Chosun Ilbo, Dong-A Ilbo, KyungHyang Sinmun, and Chung-Ang Ilbo to investigate the usage of English loanwords in news texts. Thirty-eight news stories on life, politics, business and IT were collected from the four newspapers and then analyzed based on the five types of loanwords-Direct, Mixed Code Combination, Clipping and Neologism and Double Notation, partly following Lee's and Rudiger's classification. As a result, the followings were revealed: first, the use of the category Direct was overwhelming the others with 90%, indicating that English loanwords were not translated from its source language and introduced into Korean directly with little modification; second, the use of English loanwords was significantly higher in the sections of business and IT than in other sectors, implying that English loanwords function in a similar way as a lingua franca does within those fields. Furthermore, the linguistic trends can provide a basic guide for translators to make an informed decision between the use of English loanwords and its translated Korean version in English-into Korean translation.

Effects of Name Agreement and Word Frequency on the English-Korean Word Translation Task (영어-한국어 단어번역과제에서 이름-일치도와 단어빈도의 효과)

  • Koo, Min-Mo;Nam, Ki-Chun
    • MALSORI
    • /
    • no.61
    • /
    • pp.31-48
    • /
    • 2007
  • This study investigated the roles of name agreement and word frequency in the English-Korean word translation task. Using the low-frequency homonyms with low name agreement as stimuli, Experiment 1 revealed that the name agreement of materials is a determinant which could modulate times to translate English words into Korean equivalents. On the contrary, Experiment 2 showed that the name agreement of materials does not play a decisive role in the translation task, using the low-frequency homonyms having high name agreement as stimuli. In Experiment 3, we identified that the frequency effects observed from previous two experiments are indeed brought about during the lexical access. Our findings suggest that the word frequencies of materials have a strong influence on English-Korean word translation times, and homonyms are represented independently each other in the lexeme level.

  • PDF

An English Translation and Terminology Study of "Dongeuisusebowon.Discourse on Nature and Act" ("동의수세보원(東醫壽世保元).성명론(性命論)"의 용어(用語) 정의(定義) 및 영역(英譯) 연구(硏究))

  • Shin, Sun-Mi;Kang, Goo;Baek, Jin-Ung
    • Journal of Korean Medical classics
    • /
    • v.24 no.4
    • /
    • pp.69-101
    • /
    • 2011
  • Based on the previous translation studies and "WHO-IST", we selected terminology, which are required the definition and explanation among jargon expressed in "Dongeuisusebowon Discourse on Nature and Act", and the procedure of the definition, explanation, and translation in Korean and English has been followed. The outcomes of this study are presented as below: First, based on the existing translation studies, Korean and English translation of "Dongeuisusebowon Discourse on Nature and Act" is provided. Second, few of Terminology in "Dongeuisusebowon(東醫壽世保元)" have been written in WHO-IST, even most of them have been standardized in terms of "Huang Di Nei Jing(黃帝內經)". Therefore, terminology related to Four-constitution medicine in WHO-IST would be required to be corrected, and unattatched terminology should be added in the future. Third, in order to standardize and globalize Four-constitution medicine, further definition, explanation, and translation studies of the rest of Dongeuisusebowon should be continued.

An Analysis on the Vocabulary in the English-Translation Version of Donguibogam Using the Corpus-based Analysis (코퍼스 분석방법을 이용한 『동의보감(東醫寶鑑)』 영역본의 어휘 분석)

  • Jung, Ji-Hun;Kim, Dong-Ryul;Kim, Do-Hoon
    • The Journal of Korean Medical History
    • /
    • v.28 no.2
    • /
    • pp.37-45
    • /
    • 2015
  • Objectives : A quantitative analysis on the vocabulary in the English translation version of Donguibogam. Methods : This study quantitatively analyzed the English-translated texts of Donguibogam with the Corpus-based analysis, and compared the quantitative results analyzing the texts of original Donguibogam. Results : As the results from conducting the corpus analysis on the English-translation version of Donguibogam, it was found that the number of total words (Token) was about 1,207,376, and the all types of used words were about 20.495 and the TTR (Type/Token Rate) was 1.69. The accumulation rate reaching to the high-ranking 1000 words was 83.54%, and the accumulation rate reaching to the high-ranking 2000 words was 90.82%. As the words having the high-ranking frequency, the function words like 'the, and of, is' mainly appeared, and for the content words, the words like 'randix, qi, rhizoma and water' were appeared in multi frequencies. As the results from comparing them with the corpus analysis results of original version of Donguibogam, it was found that the TTR was higher in the English translation version than that of original version. The compositions of function words and contents words having high-ranking frequencies were similar between the English translation version and the original version of Donguibogam. The both versions were also similar in that their statements in the parts of 'Remedies' and 'Acupuncture' showed higher composition rate of contents words than the rate of function words. Conclusions : The vocabulary in the English translation version of Donguibogam showed that this book was a book keeping the complete form of sentence and an Korean medical book at the same time. Meanwhile, the English translation version of Donguibogam had some problems like the unification of vocabulary due to several translators, and the incomplete delivery of word's meanings from the Chinese character-culture area to the English-culture area, and these problems are considered as the matters to be considered in a work translating Korean old medical books in English.

Development of Korean-to-English and English-to-Korean Mobile Translator for Smartphone (스마트폰용 영한, 한영 모바일 번역기 개발)

  • Yuh, Sang-Hwa;Chae, Heung-Seok
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.3
    • /
    • pp.229-236
    • /
    • 2011
  • In this paper we present light weighted English-to-Korean and Korean-to-English mobile translators on smart phones. For natural translation and higher translation quality, translation engines are hybridized with Translation Memory (TM) and Rule-based translation engine. In order to maximize the usability of the system, we combined an Optical Character Recognition (OCR) engine and Text-to-Speech (TTS) engine as a Front-End and Back-end of the mobile translators. With the BLEU and NIST evaluation metrics, the experimental results show our E-K and K-E mobile translation equality reach 72.4% and 77.7% of Google translators, respectively. This shows the quality of our mobile translators almost reaches the that of server-based machine translation to show its commercial usefulness.

A Satisfaction Survey on the Human Translation Outcomes and Machine Translation Post-Editing Outcomes

  • Hong, Junghee;Lee, Il Jae
    • International journal of advanced smart convergence
    • /
    • v.10 no.2
    • /
    • pp.86-96
    • /
    • 2021
  • This cross-sectional survey research carried out with the inquisitive agenda on satisfaction of the translation outcomes as performed by human translation and (machine translation) post-editing. The survey group consisted of 166 Korean translators primarily working with the English, Chinese, and Japanese languages. They were asked to rate the satisfactory level with accuracy, fluency, idiomatic expression, and terminology in the Richter's scale of four. The result reveals that human translation is more satisfactory than post-editing with respect to accuracy, but it is uneasy to assert that accuracy is unsatisfactory in post-editing. On the other hand, the Korean translators are less satisfied with fluency, idiomatic expression, and terminology than accuracy. It can be assumed that although human translation is more satisfactory than post-editing, the accuracy of post-editing seems to be more acknowledged than fluency, idiomatic expression, and terminology, which lead the translators to take the accuracy of raw machine-translation products and to go on to improve the fluency, idiomatic expression, and terminology. Nevertheless, Korean translators believe Korean idiomatic expressions cannot be satisfactorily produced in post-editing, while fluency and terminology can be improved in post-editing.

Probabilistic Part-Of-Speech Determination for Efficient English-Korean Machine Translation (효율적 영한기계번역을 위한 확률적 품사결정)

  • Kim, Sung-Dong;Kim, Il-Min
    • The KIPS Transactions:PartB
    • /
    • v.17B no.6
    • /
    • pp.459-466
    • /
    • 2010
  • Natural language processing has several ambiguity problems, and English-Korean machine translation especially includes those problems to be solved in each translation step. This paper focuses on resolving part-of-speech ambiguity of English words in order to improve the efficiency of English analysis, which is in part of efforts for developing practical English-Korean machine translation system. In order to improve the efficiency of the English analysis, the part-of-speech determination must be fast and accurate for being integrated with machine translation system. This paper proposes the probabilistic models for part-of-speech determination. We use Penn Treebank corpus in building the probabilistic models. In experiment, we present the performance of the part-of-speech determination models and the efficiency improvement of the machine translation system by the proposed part-of-speech determination method.

Construction of English-Korean Automatic Translation System for Patent Documents Based on Domain Customizing Method (도메인 특화 방법에 의한 영한 특허 자동 번역 시스템의 구축)

  • Choi, Sung-Kwon;Kwon, Oh-Woog;Lee, Ki-Young;Roh, Yoon-Hyung;Park, Sang-Kyu
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.2
    • /
    • pp.95-103
    • /
    • 2007
  • This paper describes an English-to-Korean automatic translation system for patent documents which is constructed by a method customizing from a general domain to a specific domain. The customizing method consists of following steps: 1) linguistically studying about characteristics of patent documents, 2) extracting unknown words from large patent documents and terminologically constructing, 3) customizing the target language words of existing terms, 4) extracting and constructing patent translation patterns peculiar to patent documents, 5) customizing existing translation engine modules according to linguistic study about characteristics of patent documents, 6) evaluation of automatic translation results. The English-to-Korean patent machine translation system implemented by these customization steps shows a translation accuracy of 81.03% and is improving.

English-Korean Transfer Dictionary Extension Tool in English-Korean Machine Translation System (영한 기계번역 시스템의 영한 변환사전 확장 도구)

  • Kim, Sung-Dong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.1
    • /
    • pp.35-42
    • /
    • 2013
  • Developing English-Korean machine translation system requires the construction of information about the languages, and the amount of information in English-Korean transfer dictionary is especially critical to the translation quality. Newly created words are out-of-vocabulary words and they appear as they are in the translated sentence, which decreases the translation quality. Also, compound nouns make lexical and syntactic analysis complex and it is difficult to accurately translate compound nouns due to the lack of information in the transfer dictionary. In order to improve the translation quality of English-Korean machine translation, we must continuously expand the information of the English-Korean transfer dictionary by collecting the out-of-vocabulary words and the compound nouns frequently used. This paper proposes a method for expanding of the transfer dictionary, which consists of constructing corpus from internet newspapers, extracting the words which are not in the existing dictionary and the frequently used compound nouns, attaching meaning to the extracted words, and integrating with the transfer dictionary. We also develop the tool supporting the expansion of the transfer dictionary. The expansion of the dictionary information is critical to improving the machine translation system but requires much human efforts. The developed tool can be useful for continuously expanding the transfer dictionary, and so it is expected to contribute to enhancing the translation quality.