• 제목/요약/키워드: text translation

검색결과 148건 처리시간 0.026초

결속구조 비교와 번역 - 중한텍스트 대조분석을 중심으로

  • 박은숙
    • 중국학논총
    • /
    • 제71호
    • /
    • pp.107-129
    • /
    • 2021
  • 近几十年来, 翻译学与语言学, 社会学, 文化学, 哲学等学科相结合, 取得了很大的发展。特别是语言学和翻译学一直有着密切的관关系。自上世纪六十年代起, 语言学家们开始逐步突破以句子为最高语言单位的研究范围, 将视角扩大到语篇, "篇章语言学"自此兴起。"衔接理论"作为语言学或翻译学的一个重要课题, 早已在国内外语言学界得到广泛而深入的研究。但是与语言对比研究中的众多课题一样, 两个语言在篇章衔接手段上的对比还鲜有人问津。因此本论文从篇章语言学的角度出发, 将Halliday和Hason提出的衔接(cohesion)理论运用于中韩翻译中, 进行了对比分析和研究。还讨论中韩语篇对比分析对中韩翻译实践和研究带来的影响。第一章是绪论, 介绍了篇章语言学的兴起和国内外代表学者。第二章, 把衔接机制分为衔接的定义和衔接的分类两小节, 了解中韩语篇的衔接机制。第三章, 把衔接理论运用于新闻中韩语篇中, 对两个语篇的衔接机制进行对比分析, 实质上浅谈衔接理在中韩语篇翻译中的应用与实践。

Detecting and Segmenting Text from Images for a Mobile Translator System

  • Chalidabhongse, Thanarat H.;Jeeraboon, Poonsak
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2004년도 ICCAS
    • /
    • pp.875-878
    • /
    • 2004
  • Researching in text detection and segmentation has been done for a long period in the OCR area. However, there is some other area that the text detection and segmentation from images can be very useful. In this report, we first propose the design of a mobile translator system which helps non-native speakers to understand the foreign language using ubiquitous mobile network and camera mobile phones. The main focus of the paper will be the algorithm in detecting and segmenting texts embedded in the natural scenes from taken images. The image, which is captured by a camera mobile phone, is transmitted to a translator server. It is initially passed through some preprocessing processes to smooth the image as well as suppress noises. A threshold is applied to binarize the image. Afterward, an edge detection algorithm and connected component analysis are performed on the filtered image to find edges and segment the components in the image. Finally, the pre-defined layout relation constraints are utilized in order to decide which components likely to be texts in the image. A preliminary experiment was done and the system yielded a recognition rate of 94.44% on a set of 36 various natural scene images that contain texts.

  • PDF

의단(醫斷)의 번역(飜譯)에 대한 고찰(考察) (A Study on translation of Idan)

  • 김태영;김석영;강구현
    • 대한상한금궤의학회지
    • /
    • 제4권1호
    • /
    • pp.93-98
    • /
    • 2012
  • Objective : to increase understanding of readers of Idan with translating in compliance with and restraining spoken language Method : referred to Chinese ancient language grammar and Korean standard language grammar Results & Conclusions : 1. spaced the original text by adequate syntax 2. corrected typo in typed text under the original text 3. translated in compliance with and restraining spoken language 4. footnoted in reference to fables and phrases.

딥러닝 기반 기계번역 개념을 활용한 Text-to-Ontology 변환 사례 (A case study on Text-to-Ontology transformation on the basis of neural translation)

  • 신유진;이지항
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2021년도 추계학술발표대회
    • /
    • pp.891-894
    • /
    • 2021
  • 온톨로지(Ontology)는 사람과 컴퓨터, 또는 컴퓨터 간의 개념 및 개념 표현을 공유하기 위한 개념화의 명시적 규약을 의미한다. 기존의 온톨로지 생성은 전문가에 의한 수작업에 의존되어 비용과 시간이 많이 드는 한계가 있다. 이에 본 논문에서는 딥러닝(Deep learning)기반의 기계번역 개념을 적용한 사례를 활용하여, 수작업의 의존성이 감소한 방법으로 텍스트로부터 온톨로지를 생성하는 방법을 구현하였다. 특히 기존 연구에서 제안한, 딥러닝을 이용해 텍스트로부터 지식 표현 시퀀스를 추출한 정보를 활용하여, 지식 표현 구조를 온톨로지로 변환하고 지식 베이스로 확장하는 과정을 통해 자동화 된 Text-to-Ontology 변환 방법론을 제안하고자 한다.

동화를 활용한 《중국어강독》 수업 방안 연구 - 대학의 경우를 중심으로

  • 황지유
    • 중국학논총
    • /
    • 제61호
    • /
    • pp.255-277
    • /
    • 2019
  • This paper presented a course plan based on the ideas I gained from conducting a lecture on Chinese language for students in the second semester of the Chinese language department at a four-year university. In the paper, we sought to deviate from the traditional grammar-translation teaching style and find ways for students to enjoy learning without difficulty in all areas by using the 'total language approach' such as writing, speaking, listening and reading through reading skills. Therefore, we discussed the educational significance and expression of the 'Chinese Languages' class, and introduced the class stages and methods of progress. In other words, they suggested introduction of text plots, explanation of vocabulary and grammar, presentation of original text, questions about text, arrangement of words, ordering sentences to fit the plot, and understanding the plot while looking at the picture.

『상한론(傷寒論)』 영역본과 『동의보감(東醫寶鑑)』 영역본 잡병편 '한(寒)'문의 비교 연구 (A Study on the English Translations of Shanghanlun (Treatise on Cold Damage) and the Cold Pathogen Chapter of Donguibogam)

  • 김도훈;김동율;정지훈
    • 한국의사학회지
    • /
    • 제30권1호
    • /
    • pp.33-41
    • /
    • 2017
  • This study utilized Corpus-based Analysis process to compare the Cold Pathogen chapter in the 'English version of "Donguibogam"' to the 'English version of the "Shanghanlun"' translated by 罗希文 (Luo xi wen). Results of the linguistic analysis indicate that TTR, a ratio of number of types to number of tokens in the English version of "Shanghanlun" was 5.92% while TTR in the Cold pathogen chapter of English version of "Donguibogam" was 6.01%. It was also noted that the types of words frequently appearing in the two publications were the scientific name of medicinal herbs; the method of producing the herbal prescription (including terminology representing weights and measures); and Chinese descriptions of concepts considered important in both Korean and Chinese medicinal practices. Finally, it was possible to find points of comparison in naming of symptoms, diagnosis, prescriptions, and respective names of six meridians. Though the language difference is minimal, the vocabulary found in the Cold Pathogen chapter of "Donguibogam" was more diverse than Luo's translation of "Sanghanlun". In general, literal translation in keeping with the sense of original text was better performed in Luo's translation of the "Sanghanlun" whereas the English version of the Cold Pathogen chapter in the "Donguibogam" was more of a "free" translation.

Spoken-to-written text conversion for enhancement of Korean-English readability and machine translation

  • HyunJung Choi;Muyeol Choi;Seonhui Kim;Yohan Lim;Minkyu Lee;Seung Yun;Donghyun Kim;Sang Hun Kim
    • ETRI Journal
    • /
    • 제46권1호
    • /
    • pp.127-136
    • /
    • 2024
  • The Korean language has written (formal) and spoken (phonetic) forms that differ in their application, which can lead to confusion, especially when dealing with numbers and embedded Western words and phrases. This fact makes it difficult to automate Korean speech recognition models due to the need for a complete transcription training dataset. Because such datasets are frequently constructed using broadcast audio and their accompanying transcriptions, they do not follow a discrete rule-based matching pattern. Furthermore, these mismatches are exacerbated over time due to changing tacit policies. To mitigate this problem, we introduce a data-driven Korean spoken-to-written transcription conversion technique that enhances the automatic conversion of numbers and Western phrases to improve automatic translation model performance.

Translated Picture Books in Korea from 1969 to 2012

  • Ko, Seonju
    • Child Studies in Asia-Pacific Contexts
    • /
    • 제4권1호
    • /
    • pp.65-76
    • /
    • 2014
  • This study aims to explore the characteristics of translated picture books in South Korea and their cultural meanings over a five-decade period. This time can broadly be divided into three periods, being the Settlement Period (pre-1990), the Flourishing Years (1991-2000) and Globalization (post-2001). During the Settlement Period, picture books in South Korea were derived mainly from Japan and America and tended to be informational in nature or based on folk tales. These were translated into Korean to meet the public's curiosity for foreign cultures or for scientific information. The Flourishing Years were characterised by the availability of picture books on a wide variety on themes and forms from all over the world. In this period, the translation of books into Korean focused on a literal rendition of the meanings and sounds of names from the original text. There was also a proliferation of audiotapes, videos and TV programs based on famous picture books. In the current period of Globalization, Korean publishers, who have built confidence through studying foreign picture books over time, have increased efforts to produce their own picture books and export them abroad.

Multilingual Automatic Translation Based on UNL: A Case Study for the Vietnamese Language

  • Thuyen, Phan Thi Le;Hung, Vo Trung
    • IEIE Transactions on Smart Processing and Computing
    • /
    • 제5권2호
    • /
    • pp.77-84
    • /
    • 2016
  • In the field of natural language processing, Universal Networking Language (UNL) has been used by various researchers as an inter-lingual approach to automatic machine translation. The UNL system consists of two main components, namely, EnConverter for converting text from a source language to UNL, and DeConverter for converting from UNL to a target language. Currently, many projects are researching how to apply UNL to different languages. In this paper, we introduce the tools that are UNL's applications and discuss how to reuse them to encode a Vietnamese sentence into UNL expressions and decode UNL expressions into a Vietnamese sentence. The testing was done with about 1,000 Vietnamese sentences (a dictionary that includes 4573 entries and 3161 rules). In addition, we compare the proportion of sentences translated based on a direct method (Google Translator) and another one based on UNL.

오픈소스를 이용한 문자/음성 인식 및 번역 앱 개발 (Text/Voice Recognition & Translation Application Development Using Open-Source)

  • 윤태진;서효종;김도헌
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2017년도 제56차 하계학술대회논문집 25권2호
    • /
    • pp.425-426
    • /
    • 2017
  • 본 논문에서는 Google에서 지원하는 오픈소스인 Tesseract-OCR을 이용한 문자/음성 인식 및 번역 앱에 대해 제안한다. 최근 한국어를 포함한 외국어 인식과 번역기능을 이용한 다양한 스마트폰 앱이 개발되어 여행에 필수품으로 자리잡고 있다. 스마트폰의 카메라기능을 이용하여 촬영한 영상을 인식률을 높이도록 처리하고, Crop기능을 넣어 부분 인식기능을 지원하며, Tesseract-OCR의 train data를 보완하여 인식률을 높이고, Google 음성인식 API를 이용한 음성인식 기능을 통해 인식된 유사한 문장들을 선택하도록 하고, 이를 번역하고 보여주도록 개발하였다. 번역 기능은 번역대상 언어와 번역할 언어를 선택할 수 있고 기본적으로 영어, 한국어, 일본어, 중국어로 번역이 가능하다. 이 기능을 이용하여 차량번호 인식, 사진에 포함된 글자를 통한 검색 등 다양한 응용분야에 맞게 앱을 개발할 수 있다.

  • PDF