• 제목/요약/키워드: language translation

검색결과 563건 처리시간 0.18초

최신 기계번역 품질 예측 연구 (Research on Recent Quality Estimation)

  • 어수경;박찬준;문현석;서재형;임희석
    • 한국융합학회논문지
    • /
    • 제12권7호
    • /
    • pp.37-44
    • /
    • 2021
  • 기계번역 품질 예측(Quality Estimation, QE)은 정답 문장(Reference sentence) 없이도 기계번역 결과의 질을 평가할 수 있으며, 활용도가 높다는 점에서 그 필요성이 대두되고 있다. Conference on machine translation(WMT)에서 매년 이와 관련한 shared task가 열리고 있고 최근에는 대용량 데이터 기반 Pretrained language model(PLM)을 적용한 연구들이 주로 진행되고 있다. 본 논문에서는 기계번역 품질 예측 task에 대한 설명 및 연구 동향에 대한 전반적인 survey를 진행했고, 최근 자주 활용되는 PLM의 특징들에 대해 정리하였다. 더불어 아직 활용된 바가 없는 multilingual BART 모델을 이용하여 기존 연구들인 XLM, multilingual BERT, XLM-RoBERTa와 의 비교 실험 및 분석을 진행하였다. 실험 결과 어떤 사전 학습된 다중언어 모델이 QE에 적용했을 때 가장 효과적인지 확인하였을 뿐 아니라 multilingual BART 모델의 QE 태스크 적용 가능성을 확인했다.

중국인 학습자를 위한 문화교육으로서 한·중 소설 비교읽기 -4.19와 문화대혁명을 중심으로-

  • 전영의;엄영욱
    • 중국학논총
    • /
    • 제62호
    • /
    • pp.85-100
    • /
    • 2019
  • The article purpose is 'Reading Chinese translation text as a Korean integrated education for Chinese students'. Although number of foreign students has increased rapidly to the economic growth of Korea, the influence of Korean Wave, and the popularity of Korean popular culture like K-pop at domestic universities but the problems of their curriculum have been found in many places. Korean literary education through novel text has an important place in Korean studies, but literary education is often excluded in Korean language education as a foreign language education. Chinese students already have background knowledge of Korean translation novels through Chinese novels. They can get the learning effect as the Korean language study. Second, they can compared with Korean national violence and Chinese national violence through 'Red Revolution' and understand about Korean-Chinese understanding of the times, social and cultural phenomena, Third, they are able to study the theory of literature itself. also It was the educational purpose pursued by the humanities. Chinese students develop their Korean language skills by studying the Brothers which are translated into Korean, and we can see the similarities and differences of national violence by comparing Korea's '4.19' with China's 'Cultural Revolution' After comparing people, background, dynamics of the space where they are located, we can raise awareness of the historical and social problems of both countries. It is possible to study subjects' memories of space, change of local meaning, the formation of urban space or individual space in the text in the specific space where national violence occurs. In this way, the method of learning Korean integrated education through Brothers of the Chinese translation novels makes an opportunity to look at national violence in the Korean-Chinese space of the 1960s and 1970s. It has a subjective perspective from subordination to the nationality of the modern nation-state. This is an educational effect that can be obtained through reading a Chinese translation novel as a Korean language integrated education.

PDMT 번역 방법론에 기반한 대화체 음성 언어 번역 시스템 (Spoken language Translation System Based on PDMT)

  • 윤승;유초롱;최미란;오승신;박준;이영직
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 2003년도 제15회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.279-283
    • /
    • 2003
  • ETRI가 참여하고 있는 자동 통역 관련 국제 컨소시엄인 C-STAR에서는 여행자 영역의 대규모 다국어 병렬 말뭉치를 공동으로 구축하였고, 현재 각 기관에서는 이를 이용한 대화체 음성 언어 번역 시스템을 개발 중이다. ETRI에서는 핵심어 처리, 통계정보를 이용하는 구 단위 자동 설정, 설정된 구의 자동대응 및 재배치 등을 특징으로 하는 구 기반 직접 번역 방식(PDMT: Phrase-based Direct Machine Translation)의 번역 방법론을 제안하고 관련 연구를 진행하고 있다. 본 논문에서는 ETRI 대화체 음성 언어 번역 시스템의 구성에 대해 알아보고 PDMT 번역 방법론의 등장 배경과 그 구체적인 번역 방법 및 특징에 대해 자세히 논의하기로 한다.

  • PDF

영한 대화체 자동번역을 위한 특화 방안 (Customization for English-Korean Spoken Language Machine Translation)

  • 이기영;노윤형;권오욱;최승권;김영길
    • 한국정보과학회 언어공학연구회:학술대회논문집(한글 및 한국어 정보처리)
    • /
    • 한국정보과학회언어공학연구회 2009년도 제21회 한글 및 한국어 정보처리 학술대회
    • /
    • pp.50-55
    • /
    • 2009
  • 현재, 자동번역의 도메인은 응용 프로그램의 요구에 따라, 점차 문어체에서 대화체(spoken language)로 옮겨가고 있는 추세이다. 본 논문은 대화체가 지니는 특성을 자동번역 시스템을 구성하는 각 모듈별 및 지식 관점에서 분석하였다. 특성 분석을 기반으로 하여, 본 논문에서는 여행 영역을 대상으로 하는 대화체 자동번역시스템의 특화를 수행하였다. 대화체 자동번역을 위한 새로운 지식으로 구조화 번역메모리(Translation Memory)가 도입되었으며, 시스템을 구성하는 각 모듈별로 대화체 특화가 이루어졌다. 또한 기존의 문어체용 기구축 패턴 등이 정비되었으며, 고빈도 대화체 표현에 대한 신규 패턴이 도입되었다. 제안하는 방법의 검증을 위해 수동평가를 수행하였으며, 그 결과, 영한 대화체 자동번역에 있어서 번역률 향상이 있었다.

  • PDF

『국역본 <>·<>』 고찰 -표기적 특징과 이본적 성격을 중심으로- (A Study on 『Korean Translation of ·』 -Focused on declared characteristics and characteristics in different versions-)

  • 간호윤
    • 고전문학과교육
    • /
    • 제15호
    • /
    • pp.355-387
    • /
    • 2008
  • The purpose of the study was to decide Korean translation and the copying period of "Korean Translation of " and to look all around their characteristics in different versions carefully until now. The "Korean Translation" is a collection of Korean-translated romance and love stories excavated by a professor Kim,Il Geun, and there is not a little meaning in the context of novel history in the point of view of 'Korean translation of a court possession'. Arranging conclusion of the study generally, it is as follows. (1) Considering phonological phenomena, grammar and vocabulary in the study of Korean language, it is presumed that they would be translated into Korean and copied between the regime period of the King Sukjong and the regime period of the King Yungjo in the Joseon Dynasty. For, they were composed of a middle declaration of copied 'Myeoknambon "Korean Translation of Taepyeonggwanggi(태평광기)"' and 'NakseonJaebon(낙선재본)' between the middle of the 17th century and the middle of the 18th century and the regime period of the King Jeongjo in the Joseon Dynasty appointed as the background period of the novels should be excepted. Consequently, through the Korean Translation, we can confirm that the novel scope between the 17th century and the 18th century in Korean novel history was widened until 'The Royal Court' and 'Women'. (2) In the side of vocabulary, the "Korean Translation" also has not a little meaning in the side of a collection translated in the Royal Court. It doesn't have new vocabularies, but partial vocabularies as '(Traces:痕)' '(Clean eyes:明眸)', ' (Sail:帆)', '(Get up:起)', '글이플(Weak grass:弱草)', '쇼록(Owl:? 梟 or 鴉?)', '이 사라심(This life:此生)', and '노혀오매(Look for:訪)' are good data in the study of Korean language. (3) The "Korean Translation" is a valuable data about translation and copying of a court novel and we can discover intentionally changed parts and partially omitted sentences rather in the than in the . There are differences between a translation book and a copying book and we can catch sight of intention of translation and unsettledness of copying in the second work. Therefore, we can know that the "Korean Translation" has a double context which one work is translated and a work in different version is derived, compared to a simple copy. (4) The "Korean Translation" has a close relation with "Hangoldong(閒汨董)", but it doesn't regard the same copy as a foundation. The basic copy of translation of the "Korean Translation" is a different version of the same line as "Hangoldong" and "Jeochobon(저초본:정명기 소장본)" and is more similar line to "Hangoldong", but it is also not the same basic copy. (5) Considering that the "Korean Translation" doesn't has a distinct relation with the "Hangoldong", there is no correlation between the "Korean Translation" and and the "Hangoldong" and . In addition, we could not discover a writer's identity between the two.

Linguistic Processing in Automatic Interpretation System between English-Korean Language Pair

  • Choi, K.S.;Lee, S.M.;Lee, Y.J.
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
    • /
    • pp.1076-1081
    • /
    • 1994
  • This paper presents the linguistic processing for the Automatic Interpretation system between English/Korean language pair. We introduce two machine translation systems, each for English-to-Korean and Korean-to-English, describe the system configuration and several characteristics, and discuss the translation evaluation results.

  • PDF

한-베 기계번역에서 한국어 분석기 (UTagger)의 영향 (Effect of Korean Analysis Tool (UTagger) on Korean-Vietnamese Machine Translations)

  • 원광복;옥철영
    • 한국어정보학회:학술대회논문집
    • /
    • 한국어정보학회 2017년도 제29회 한글및한국어정보처리학술대회
    • /
    • pp.184-189
    • /
    • 2017
  • With the advent of robust deep learning method, Neural machine translation has recently become a dominant paradigm and achieved adequate results in translation between popular languages such as English, German, and Spanish. However, its results in under-resourced languages Korean and Vietnamese are still limited. This paper reports an attempt at constructing a bidirectional Korean-Vietnamese Neural machine translation system with the supporting of Korean analysis tool - UTagger, which includes morphological analyzing, POS tagging, and WSD. Experiment results demonstrate that UTagger can significantly improve translation quality of Korean-Vietnamese NMT system in both translation direction. Particularly, it improves approximately 15 BLEU scores for the translation from Korean to Vietnamese direction and 3.12 BLEU scores for the reverse direction.

  • PDF

Sentence Translation and Vocabulary Retention in an EFL Reading Class

  • Kim, Boram
    • 영어어문교육
    • /
    • 제18권2호
    • /
    • pp.67-84
    • /
    • 2012
  • The present study investigated the effect of sentence translation as a production task on short-term and long-term retention of foreign vocabulary. 87 EFL university students at a beginning level, enrolled in reading class participated in the study. The study compared the performance of three groups on vocabulary recall: (1) Control group, (2) Translation group, and (3) Copy group. During the treatment sessions, translation group translated L1 sentences into English, while copy group simply copied given English sentences with each target word. Results of the immediate test were collected each week from week 2 to week 5 and analyzed by one-way ANOVA. Results revealed that regarding short-term vocabulary retention, participants in rote-copy condition outperformed those in translation group. Four weeks later a delayed test was administered to measure long-term vocabulary retention. In contrast, the results of two-way repeated measures ANOVA showed that long-term vocabulary retention of translation group was significantly greater than copy group. The findings suggest that although sentence translation is rather challenging to low-level learners, it may facilitate long-term retention of new vocabulary given the more elaborate and deeper processing the task entails.

  • PDF

대화체 문장 번역을 위한 토큰기반 변환중심 한일 기계번역 (A Token Based Transfer Driven Koran -Japanese Machine Translation for Translating the Spoken Sentences)

  • 양승원
    • 한국산업정보학회논문지
    • /
    • 제4권4호
    • /
    • pp.40-46
    • /
    • 1999
  • 본 논문에서는 음성언어 자동 통역시스템의 일부 모듈로 구현한 한일 기계번역 시스템을 소개하였다. 이 번역시스템은 예제중심 기계번역(EBMT)에 기초를 둔 변환중심 기계번역(TDMT) 방법을 기반으로 구현하였다. 본 시스템에서는 토큰(TOKEN)이라는 새로운 번역단위를 정의하여 사용하였다. 토큰단위의 번역방법을 사용함으로써 한국어 문장의 매우 비 정형적인 점을 해결하고 번역의 질을 높일 수 있다. 본 시스템의 구문분석 단계에서는 대역어를 선정하기에 적합한 정도까지의 의존트리를 생성하는 간이파싱만을 함으로써 필요없는 노력을 경감시켰다. 대역어 사전은 한국전자통신 연구원이 수집한 음성 데이터베이스로부터 추출한 말뭉치를 사용해 구성하였다. 구현한 시스템은 여행 계획영역에서 수집된 600 발화 안의 문장을 대상으로 시험하였는데 제한된 환경에서 87%, 아무런 제약이 없는 환경에서는 71%의 성공률을 보였다.

  • PDF

GCC2Verilog Compiler Toolset for Complete Translation of C Programming Language into Verilog HDL

  • Huong, Giang Nguyen Thi;Kim, Seon-Wook
    • ETRI Journal
    • /
    • 제33권5호
    • /
    • pp.731-740
    • /
    • 2011
  • Reconfigurable computing using a field-programmable gate-array (FPGA) device has become a promising solution in system design because of its power efficiency and design flexibility. To bring the benefit of FPGA to many application programmers, there has been intensive research about automatic translation from high-level programming languages (HLL) such as C and C++ into hardware. However, the large gap of syntaxes and semantics between hardware and software programming makes the translation challenging. In this paper, we introduce a new approach for the translation by using the widely used GCC compiler. By simply adding a hardware description language (HDL) backend to the existing state-of- the-art compiler, we could minimize an effort to implement the translator while supporting full features of HLL in the HLL-to-HDL translation and providing high performance. Our translator, called GCC2Verilog, was implemented as the GCC's cross compiler targeting at FPGAs instead of microprocessor architectures. Our experiment shows that we could achieve a speedup of up to 34 times and 17 times on average with 4-port memory over PICO microprocessor execution in selected EEMBC benchmarks.