• 제목/요약/키워드: Accuracy of Machine Translation

검색결과 42건 처리시간 0.021초

A Satisfaction Survey on the Human Translation Outcomes and Machine Translation Post-Editing Outcomes

  • Hong, Junghee;Lee, Il Jae
    • International journal of advanced smart convergence
    • /
    • 제10권2호
    • /
    • pp.86-96
    • /
    • 2021
  • This cross-sectional survey research carried out with the inquisitive agenda on satisfaction of the translation outcomes as performed by human translation and (machine translation) post-editing. The survey group consisted of 166 Korean translators primarily working with the English, Chinese, and Japanese languages. They were asked to rate the satisfactory level with accuracy, fluency, idiomatic expression, and terminology in the Richter's scale of four. The result reveals that human translation is more satisfactory than post-editing with respect to accuracy, but it is uneasy to assert that accuracy is unsatisfactory in post-editing. On the other hand, the Korean translators are less satisfied with fluency, idiomatic expression, and terminology than accuracy. It can be assumed that although human translation is more satisfactory than post-editing, the accuracy of post-editing seems to be more acknowledged than fluency, idiomatic expression, and terminology, which lead the translators to take the accuracy of raw machine-translation products and to go on to improve the fluency, idiomatic expression, and terminology. Nevertheless, Korean translators believe Korean idiomatic expressions cannot be satisfactorily produced in post-editing, while fluency and terminology can be improved in post-editing.

A Quality Comparison of English Translations of Korean Literature between Human Translation and Post-Editing

  • LEE, IL-JAE
    • International Journal of Advanced Culture Technology
    • /
    • 제6권4호
    • /
    • pp.165-171
    • /
    • 2018
  • As the artificial intelligence (AI) plays a crucial role in machine translation (MT) which has loomed large as a new translation paradigm, concerns have also arisen if MT can produce a quality product as human translation (HT) can. In fact, several MT experimental studies report cases in which the MT product called post-editing (PE) as equally as HT or often superior ([1],[2],[6]). As motivated from those studies on translation quality between HT and PE, this study set up an experimental situation in which Korean literature was translated into English, comparatively, by 3 translators and 3 post-editors. Afterwards, a group of 3 other Koreans checked for accuracy of HT and PE; a group of 3 English native speakers scored for fluency of HT and PE. The findings are (1) HT took the translation time, at least, twice longer than PE. (2) Both HT and PE produced similar error types, and Mistranslation and Omission were the major errors for accuracy and Grammar for fluency. (3) HT turned to be inferior to PE for both accuracy and fluency.

Cyber Learners' Use and Perceptions of Online Machine Translation Tools

  • Moon, Dosik
    • International journal of advanced smart convergence
    • /
    • 제10권4호
    • /
    • pp.165-171
    • /
    • 2021
  • The current study investigated cyber learners' use and perceptions of online machine translation (MT) tools. The results show that learners use several MT tools frequently and extensively for various second language learning (L2) purposes according to their needs. The learners' overall perceptions of using MT for English learning were generally positive. The learners reported several advantages of machine translation: ease of use, helpful feedback, effective revision, and facilitation of self-directed learning. At the same time, a considerable number of learners were aware of MT's drawbacks, such as awkward sentences, inaccurate grammar, and inappropriate words, and thus held a negative or skeptical view on the quality and accuracy of MT. These findings have important pedagogical implications for using MT in the context of a cyber university. For successful integration of MT in English classes, teachers need to provide appropriate guidelines and training that will help learners use MT effectively.

Customizing an English-Korean Machine Translation System for Patent Translation

  • Choi, Sung-Kwon;Kim, Young-Gil
    • 한국언어정보학회:학술대회논문집
    • /
    • 한국언어정보학회 2007년도 정기학술대회
    • /
    • pp.105-114
    • /
    • 2007
  • This paper addresses a method for customizing an English-to-Korean machine translation system from general domain to patent domain. The customizing method consists of following steps: 1) linguistically studying about characteristics of patent documents, 2) extracting unknown words from large patent documents and constructing large bilingual terminology, 3) extracting and constructing the patent-specific translation patterns 4) customizing the translation engine modules of the existing general MT system according to linguistic study about characteristics of patent documents, and 5) evaluating the accuracy of translation modules and the translation quality. This research was performed under the auspices of the MIC (Ministry of Information and Communication) of Korean government during 2005-2006. The translation accuracy of the customized English-Korean patent translation system is 82.43% on the average in 5 patent fields (machinery, electronics, chemistry, medicine and computer) according to the evaluation of 7 professional human translators. In 2006, the patent MT system started an on-line patent MT service in IPAC (International Patent Assistance Center) under MOCIE (Ministry of Commerce, Industry and Energy) in Korea. In 2007, KIPO (Korean Intellectual Property Office) tries to launch an English-Korean patent MT service.

  • PDF

Classification-Based Approach for Hybridizing Statistical and Rule-Based Machine Translation

  • Park, Eun-Jin;Kwon, Oh-Woog;Kim, Kangil;Kim, Young-Kil
    • ETRI Journal
    • /
    • 제37권3호
    • /
    • pp.541-550
    • /
    • 2015
  • In this paper, we propose a classification-based approach for hybridizing statistical machine translation and rulebased machine translation. Both the training dataset used in the learning of our proposed classifier and our feature extraction method affect the hybridization quality. To create one such training dataset, a previous approach used auto-evaluation metrics to determine from a set of component machine translation (MT) systems which gave the more accurate translation (by a comparative method). Once this had been determined, the most accurate translation was then labelled in such a way so as to indicate the MT system from which it came. In this previous approach, when the metric evaluation scores were low, there existed a high level of uncertainty as to which of the component MT systems was actually producing the better translation. To relax such uncertainty or error in classification, we propose an alternative approach to such labeling; that is, a cut-off method. In our experiments, using the aforementioned cut-off method in our proposed classifier, we managed to achieve a translation accuracy of 81.5% - a 5.0% improvement over existing methods.

A Survey of Machine Translation and Parts of Speech Tagging for Indian Languages

  • Khedkar, Vijayshri;Shah, Pritesh
    • International Journal of Computer Science & Network Security
    • /
    • 제22권4호
    • /
    • pp.245-253
    • /
    • 2022
  • Commenced in 1954 by IBM, machine translation has expanded immensely, particularly in this period. Machine translation can be broken into seven main steps namely- token generation, analyzing morphology, lexeme, tagging Part of Speech, chunking, parsing, and disambiguation in words. Morphological analysis plays a major role when translating Indian languages to develop accurate parts of speech taggers and word sense. The paper presents various machine translation methods used by different researchers for Indian languages along with their performance and drawbacks. Further, the paper concentrates on parts of speech (POS) tagging in Marathi dialect using various methods such as rule-based tagging, unigram, bigram, and more. After careful study, it is concluded that for machine translation, parts of speech tagging is a major step. Also, for the Marathi language, the Hidden Markov Model gives the best results for parts of speech tagging with an accuracy of 93% which can be further improved according to the dataset.

통계 정보를 이용한 전치사 최적 번역어 결정 모델 (A Statistical Model for Choosing the Best Translation of Prepositions.)

  • 심광섭
    • 한국언어정보학회지:언어와정보
    • /
    • 제8권1호
    • /
    • pp.101-116
    • /
    • 2004
  • This paper proposes a statistical model for the translation of prepositions in English-Korean machine translation. In the proposed model, statistical information acquired from unlabeled Korean corpora is used to choose the best translation from several possible translations. Such information includes functional word-verb co-occurrence information, functional word-verb distance information, and noun-postposition co-occurrence information. The model was evaluated with 443 sentences, each of which has a prepositional phrase, and we attained 71.3% accuracy.

  • PDF

기계번역과 인간번역의 혼합적 접근법 (The Blended Approach of Machine Translation and Human Translation)

  • 김양순
    • 문화기술의 융합
    • /
    • 제8권1호
    • /
    • pp.239-244
    • /
    • 2022
  • 인공신경망기계번역은 점진적으로 인간번역과 기계번역의 경계를 허물어가고 있다. 기계번역의 현재와 미래 그리고 기계번역의 장·단점을 논의하는 가운데 인간번역과 기계번역의 실제 번역사례들을 살펴보고 최근 수년간 놀라운 발전을 이룬 인공신경망기계번역이 왜 인간번역의 손길을 필요로 하는지를 논의하는 것이 본 연구의 목표이다. 번역분야에서 인간이 기계로 대치될 수 있는가? 인공신경망기계번역 시대에 인간번역가는 도태되어야 하는가? 그리고 인공신경망기계번역과 지역의 세계화에 기반을 두고 확장하고 있는 세계영어들이라는 언어다양성 시대에 언어장벽을 없애는 것이 가능한가라는 질문 모두에 대하여 부정적인 결론과 함께 기계번역은 신속성, 정확성, 저비용의 생산성이라는 장점을 갖는 유용한 도구임에도 불구하고, 문화, 차용어, 중의성, (국가)방언, 신조어 등의 분야에서는 인간번역이 요구된다고 제안한다. 기계학습을 기반으로 하는 기계번역과 직관과 습득을 기반으로 하는 인간번역은 협업의 상태로 공생 발전해야할 것이다. 기계번역은 역 번역과 인간의 사후편집과 같은 방법을 활용할 때 도덕적 문제를 야기하지 않는 유용한 번역도구가 될 것이다. 결론적으로 기계번역은 인간번역의 손길 없이는 완성될 수 없다는 혼합적 접근법을 제안한다.

Hindi Correspondence of Bengali Nominal Suffixes

  • Chatterji, Sanjay
    • Journal of Multimedia Information System
    • /
    • 제8권4호
    • /
    • pp.221-232
    • /
    • 2021
  • One bottleneck of Bengali to Hindi transfer based machine translation system is the translation of suffixes of noun. The appropriate translation of a nominal suffix often depends on the semantic role of the corresponding noun chunk in the sentence. With the availability of a high performance Bengali morphological analyzer and a basic Bengali parser it is possible to identify the role of each noun chunk. This information may be used for building rules for translating the ambiguous nominal suffixes. As there are some similarities between the uses of Bengali and Hindi nominal suffixes we find that the rules may be identified by linguistically analyzing corpus data. In this paper, we identify rules for the ambiguous four Bengali nominal suffixes from corpus data and evaluate their performances. This set of rules is able to resolve a majority of the nominal suffix ambiguities in Bengali to Hindi transfer based machine translation system. Using the rules, we are able to translate 98.17% Bengali nouns correctly which is much better than the baseline ILMT system's accuracy of 62.8%.

직교 좌표에서 카메라 시스템의 방향과 위치 결정 (Determination of Camera System Orientation and Translation in Cartesian Coordinate)

  • 이용중
    • 한국공작기계학회:학술대회논문집
    • /
    • 한국공작기계학회 2000년도 춘계학술대회논문집 - 한국공작기계학회
    • /
    • pp.109-114
    • /
    • 2000
  • A new method for the determination of camera system rotation and translation from in 3-D space using recursive least square method is presented in this paper. With this method, the calculation of the equation is found by a linear algorithm. Where the equation are either given or be obtained by solving five or more point correspondences. Good results can be obtained in the presence if more than the eight point. A main advantage of this new method is that it decouple rotation and translation, and then reduces computation. With respect to error in the solution point number in the input image data, adding one more feature correspondence to required minimum number improves the solution accuracy drastically. However, further increase in the number of feature correspondence improve the solution accuracy only slowly. The algorithm proposed by this paper is used to make camera system rotation and translation easy to recognize even when camera system attached at end effecter of six degrees of freedom industrial robot manipulator are applied industrial field.

  • PDF