Browse > Article
http://dx.doi.org/10.5916/jkosme.2012.36.5.683

A Bidirectional Korean-Japanese Statistical Machine Translation System by Using MOSES  

Lee, Kong-Joo (충남대학교 정보통신공학과)
Lee, Song-Wook (한국교통대학교 컴퓨터정보공학과)
Kim, Jee-Eun (한국외국어대학교 영어학과)
Abstract
Recently, statistical machine translation (SMT) has received many attention with ease of its implementation and maintenance. The goal of our works is to build bidirectional Korean-Japanese SMT system by using MOSES [1] system. We use Korean-Japanese bilingual corpus which is aligned per sentence to train the translation model and use a large raw corpus in each language to train each language model. The proposed system shows results comparable to those of a rule-based machine translation system. Most of errors are caused by noises occurred in each processing stage.
Keywords
machine translation; statistical machine translation; MOSES;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Xiaoyi Ma, "Champollion: A robust parallel text sentence aligner", Proceedings of the Fifth International Conference on Language Resources and Evaluation, Genova, Italy, 2006.
2 C. Bannard and C.B. Callison, "Paraphrasing with Bilingual Parallel Corpora", Proceedings of The Association for Computational Linguistics 2005, pp. 597-604, 2005.
3 Y.S. Hwang, Y.K. Kim, and S.K. Park, "Paraphrasing depending on bilingual context toward generalization of translation knowledge", Proceedings of the Third Int'l Joint Conf. on Natural Language Processing, pp. 327-334, 2008.
4 Daniel Marcu and William Wong, "A phrase-Based, joint probability model for statistical machine translation", Proceedings of Empirical Methods on Natural Language Processing 2002, pp. 133-139.
5 Nicola Ueffing and Hermann Ney, "Using POS information for statistical machine translation into morphologically rich languages", Proceedings of The European Chapter of the Association for Computational Linguistics 2003.
6 Philipp Koehn, Franz Josef Och and Daniel Marcu, "Statistical phrase-based translation", Proceedings of Human Language Technologiesthe North American Chapter of the Association for Computational Linguistics 2003, pp. 347-354.
7 Eleftherios Avramidis and Philipp Koehn, "Enriching morphologically poor languages for statistical machine translation", proceedings of Association for Computational Linguistics 2008, pp. 763-770.
8 http://www.statmt.org/moses/
9 P. Brown, S. Della Pietra, V. Della Pietra, and R. Mercer, "The mathematics of statistical machine translation: Parameter Estimation", Computational Linguistics, vol. 19, no. 2, pp. 263-311, 1991.
10 Yun Kim and et al, "The trends of machine translation techonology and case study", Electronics and Telecommunications Trends, vol. 23, no. 1, 2008.
11 U. Germann, "Greedy decoding for statistical machine translation in almost linear time", Proceedings of HLT-NAACL pp. 1-8, 2003.
12 F. J. Och and H. Ney, "A systematic comparison of various statistical alignment models", Computational Linguistics, vol. 29, no. 1, pp. 19-51, 2003.   DOI   ScienceOn
13 K. Yamada, and K. Knight, "A syntax-based statistical translation model", Proceedings of The Association for Computational Linguistics 2001, pp. 523-530, 2001.
14 The 21st Century Sejong Project, http://sejong.or.kr/sejong_kr/index.html, 2006.
15 R. Rosenfeld, "Two decades of statistical language modeling: where do we go from here?", Proceedings of IEEE, vol. 88, no. 8, pp. 1270-1278, 2000.   DOI   ScienceOn