• Title/Summary/Keyword: Rule-Based Translation

Search Result 33, Processing Time 0.025 seconds

Study of Contents Localization Case on the Game 'Paper, Please': Based on the Korean and North Korean Translations (게임 'Paper, Please'의 번역을 통한 콘텐츠 현지화 사례 연구: 한국어와 문화어 번역의 차이를 중심으로)

  • Won, Ho-Hyeuk;Gu, Bon-Hyeok;Kim, Hyoung-Youb
    • Journal of Korea Game Society
    • /
    • v.19 no.2
    • /
    • pp.145-160
    • /
    • 2019
  • In this research, we attempt to suggest the differences between Korean translation and the North Korean translation of the game 'Paper, Please'; moreover, we will consider about the effect of language and image on localization through this. North Korean language and cultural contents in 'Paper, Please' are evaluated well by many people that they show real life of North Korea even though there are some errors like loanword translations and using anachronic symbol, 'Kaksital' as secret organization. Through the research, we could know that people could concentrate on cultural contents by images and motives without critical errors so have fun.

Spoken-to-written text conversion for enhancement of Korean-English readability and machine translation

  • HyunJung Choi;Muyeol Choi;Seonhui Kim;Yohan Lim;Minkyu Lee;Seung Yun;Donghyun Kim;Sang Hun Kim
    • ETRI Journal
    • /
    • v.46 no.1
    • /
    • pp.127-136
    • /
    • 2024
  • The Korean language has written (formal) and spoken (phonetic) forms that differ in their application, which can lead to confusion, especially when dealing with numbers and embedded Western words and phrases. This fact makes it difficult to automate Korean speech recognition models due to the need for a complete transcription training dataset. Because such datasets are frequently constructed using broadcast audio and their accompanying transcriptions, they do not follow a discrete rule-based matching pattern. Furthermore, these mismatches are exacerbated over time due to changing tacit policies. To mitigate this problem, we introduce a data-driven Korean spoken-to-written transcription conversion technique that enhances the automatic conversion of numbers and Western phrases to improve automatic translation model performance.

Syntactic Rule Compiler in Rule-based English-Korean Machine Translation (규칙 기반 영한 기계번역에서의 구문 규칙 컴파일러)

  • Kim, Sung-Dong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2013.11a
    • /
    • pp.1315-1317
    • /
    • 2013
  • 규칙 기반의 영한 기계번역 시스템의 구문 분석 시스템은 영어의 구문 구조를 기술하는 규칙 부분과 규칙을 적용하여 차트 파싱을 수행하는 실행 부분으로 구성된다. 구문 규칙은 문맥 자유 문법의 형식으로 기술되는데, 기술된 구문 규칙을 적용하여 파싱을 실행하는 실행 부분은 C 언어 함수로 표현되므로, 구문 규칙을 C 언어 함수로 변환해야 한다. 본 논문에서는 문맥 자유 문법 형식으로 기술된 구문 규칙을 C 언어 함수로 변환하는 도구인 구문 규칙 컴파일러를 개발하였다. 구문 규칙 컴파일러는 자동적으로 구문 규칙을 C 언어 함수로 변환함으로써 영한 기계번역 시스템의 성능 개선 과정에서 빈번하게 발생하는 구문 규칙의 생성과 수정을 용이하게 하여 번역 성능을 개선하는 작업을 지원한다.

Automatic translation system for hangul's romanization Based on the World Wide Web (웹 기반하의 국어의 로마자 전사 표기 자동 변환 시스템)

  • 김홍섭
    • Journal of the Korea Society of Computer and Information
    • /
    • v.7 no.4
    • /
    • pp.108-114
    • /
    • 2002
  • After automatic translation system for hangul's romanization based on the World Wide Web converting korean-word, sentence, document to Transliteration letters by applying algorithm based phonological principles. even though a user do not know the basic principles of the usage of Korean-to-Romanization notations, It refers to corresponding character table that has been currently adopted the authority's standard proposition for Korean-to-Romanization notation rule concurrently, add to make possible to convert a machinized code as well. It provides font for toggling Korean-English mode, insert-edit mode by assigning ASCII codes and Unicode are hardly used to them. This program could be made in C++ progamming language and Unified Modeling Language to implement various font. font-expanding and condensing. alternative printing.

  • PDF

Design of automatic translation system for hangul's romanization Based on the World Wide Web (웹 기반하의 국어의 로마자 표기 자동 변환 시스템 설계)

  • 김홍섭
    • Journal of the Korea Society of Computer and Information
    • /
    • v.6 no.4
    • /
    • pp.6-11
    • /
    • 2001
  • After automatic translation system for hangul's romanization based on the World Wide Web converting korean-word. sentence, document to Transliteration letters by applying algorithm based phonological principles. even though a user do not know the basic principles of the usage of Korean-to-Romanization notations. It refers to corresponding character table that has been currently adopted the authority's standard proposition for Korean-to-Romanization notation rule concurrently, add to make possible to convert a machinized code as well. It Provides font for toggling Korean-English mode, insert-edit mode by assigning ASCII codes are hardly used to them. This program could be made in C++ programming language and Unified Modeling Language to implement various font. font-expanding and condensing, alternative printing.

  • PDF

An Analysis System of Prepositional Phrases in English-to-Korean Machine Translation (영한 기계번역에서 전치사구를 해석하는 시스템)

  • Gang, Won-Seok
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.7
    • /
    • pp.1792-1802
    • /
    • 1996
  • The analysis of prepositional phrases in English-to Korean machine translation has problem on the PP-attachment resolution, semantic analysis, and acquisition of information. This paper presents an analysis system for prepositional phrases, which solves the problem. The analysis system consists of the PP-attachment resolution hybrid system, semantic analysis system, and semantic feature generator that automatically generates input information. It provides objectiveness in analyzing prepositional phrases with the automatic generation of semantic features. The semantic analysis system enables to generate natural Korean expressions through selection semantic roles of prepositional phrases. The PP-attachment resolution hybrid system has the merit of the rule-based and neural network-based method.

  • PDF

An Evolution of Cellular Automata Neural Systems using DNA Coding Method (DNA 코딩방법을 이용한 셀룰라 오토마타 신경망의 진화)

  • Lee, Dong-Wook;Sim, Kwee-Bo
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.36S no.12
    • /
    • pp.10-19
    • /
    • 1999
  • Cellular Automata Neural Systems(CANS) are neural networks based on biological development and evolution. Each neuron of CANS has local connection and acts as a form of pulse according to the dynamics of the chaotic neuron. CANS are generated from initial cells according to the CA rule. In the previous study, to obtain the useful ability of CANS, we make the pattern of initial cells evolve. However, it is impossible to represent all solution space, so we propose an evolving method of CA rule to overcome this defect in this paper. DNA coding has the redundancy and overlapping of gene and is apt for the representation of the rule. In this paper, we show the general expression of CA rule and propose translation method from DNA code to CA rule. The effectiveness of the proposed scheme was verified by applying it to the navigation problem of autonomous mobile robot.

  • PDF

Using Machine Translation Agent Based on Ontology Study of Real Translation (온톨로지 기반의 지능형 번역 에이전트를 이용한 실시간 번역 연구)

  • Kim Su-Gyeong;Kim Gyeong-A;An Gi-Hong
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2006.06a
    • /
    • pp.229-233
    • /
    • 2006
  • 기계번역(Machine Translaton, MT), 다국어 정보 검색, 의미 정보 검색 등에 대한 연구는 시소러스, 지식베이스, 사전 검색, 의미망, 코퍼스등과 같은 다양한 방법으로 이루어지고 있다. 시맨틱 웹이 등장과 시맨틱 웹 기반 기술의 발전에 따라 위 연구들을 시맨틱 웹에 적용시킬 필요성도 제안되었다. 특히 한국어 시소러스, 워드넷(WordNet), 전자 세종 사전, 가도까와(Kadokawa) 시소러스와 같은 지식베이스가 개발되었으나 활용 분야에 따라 그 구축 방법론이 다르게 적용되어, 위 연구에 효과적으로 통용될 수 있는 지식베이스는 실질적으로 구축되지 못한 실정이다. 따라서 본 연구에서는 세종 사전과 가도까와 시소러스, 한/일 기계 번역 사전 그리고 전문 용어 사전을 기반으로 한국어와 일본어 지식베이스를 위한 사전 온톨로지 서버를 정의하여 의미 정보를 구성하고, Semantic Web Rule Markup Language (이하 SWRL)을 이용해 구문 정보 규칙을 정의한다. 그리고 SWRL 기반 정방향 추론 엔진을 이용하여 번역에 필요한 추론 엔진을 구성하고 문장 구문형성 규칙 추론 엔진을 통해 사용자에게 한국어와 일본어의 문장 구성 변환을 제공한다. 본 연구는 현재 기계 번역이 갖고 있는 다의성, 술부 어순의 차이, 경어체 등 아직 해결해야 할 많은 부분들에 대한 해결 방안으로서 시맨틱 웹 기반 기술과의 활용방안을 제시하고자 한다.

  • PDF

Conservation of cis-Regulatory Element Controlling Timely Translation in the 3'-UTR of Selected Mammalian Maternal Transcripts

  • Lee, Hyun-Joo;Lim, Yoon-Ki;Chang, Sang-Ho;Min, Kwan-Sik;Han, Ching-Tack;Hwang, Sue-Yun
    • Genomics & Informatics
    • /
    • v.5 no.4
    • /
    • pp.174-178
    • /
    • 2007
  • The earliest stages of mammalian embryogenesis are governed by the activity of maternally inherited transcripts and proteins. Cytoplasmic polyadenylation of selected maternal mRNA has been reported to be a major control mechanism of delayed translation during preimplantation embryogenesis in mice. The presence of cis-elements required for cytoplasmic polyadenylation (e.g., CPE) can serve as a useful tag in the screening of maternal genes partaking in key functions in the transcriptionally dormant egg and early embryo. However, due to its relative simplicity, UA-rich sequences satisfying the canonical rule of known CPE consensus sequences are often found in the 3'-UTR of maternal transcripts that do not actually undergo cytoplasmic polyadenylation. In this study, we developed a method to confirm the validity of candidate CPE sequences in a given gene by a multiplex comparison of 3'-UTR sequences between mammalian homologs. We found that genes undergoing cytoplasmic polyadenylation tend to create a conserved block around the CPE, while CPE-like sequences in the 3'-UTR of genes lacking cytoplasmic polyadenylation do not exhibit such conservation between species. Through this cross-species comparison, we also identified an alternative CPE in the 3'-UTR of tissue-type plasminogen activator (tPA), which is more likely to serve as a functional element. We suggest that verification of CPEs based on sequence conservation can provide a convenient tool for mass screening of factors governing the earliest processes of mammalian embryogenesis.

Sentence Type Identification in Korean Applications to Korean-Sign Language Translation and Korean Speech Synthesis (한국어 문장 유형의 자동 분류 한국어-수화 변환 및 한국어 음성 합성에의 응용)

  • Chung, Jin-Woo;Lee, Ho-Joon;Park, Jong-C.
    • Journal of the HCI Society of Korea
    • /
    • v.5 no.1
    • /
    • pp.25-35
    • /
    • 2010
  • This paper proposes a method of automatically identifying sentence types in Korean and improving naturalness in sign language generation and speech synthesis using the identified sentence type information. In Korean, sentences are usually categorized into five types: declarative, imperative, propositive, interrogative, and exclamatory. However, it is also known that these types are quite ambiguous to identify in dialogues. In this paper, we present additional morphological and syntactic clues for the sentence type and propose a rule-based procedure for identifying the sentence type using these clues. The experimental results show that our method gives a reasonable performance. We also describe how the sentence type is used to generate non-manual signals in Korean-Korean sign language translation and appropriate intonation in Korean speech synthesis. Since the method of using sentence type information in speech synthesis and sign language generation is not much studied previously, it is anticipated that our method will contribute to research on generating more natural speech and sign language expressions.

  • PDF