DOI QR코드

DOI QR Code

Ranking Translation Word Selection Using a Bilingual Dictionary and WordNet

  • Kim, Kweon-Yang (School of Computer Engineering, Kyungik University) ;
  • Park, Se-Young (Dept. of Computer Engineering, Kyungpook National University)
  • Published : 2006.02.01

Abstract

This parer presents a method of ranking translation word selection for Korean verbs based on lexical knowledge contained in a bilingual Korean-English dictionary and WordNet that are easily obtainable knowledge resources. We focus on deciding which translation of the target word is the most appropriate using the measure of semantic relatedness through the 45 extended relations between possible translations of target word and some indicative clue words that play a role of predicate-arguments in source language text. In order to reduce the weight of application of possibly unwanted senses, we rank the possible word senses for each translation word by measuring semantic similarity between the translation word and its near synonyms. We report an average accuracy of $51\%$ with ten Korean ambiguous verbs. The evaluation suggests that our approach outperforms the default baseline performance and previous works.

Keywords

References

  1. Banerjee, S. and Pedersen, T., 'Extended. Gloss Overlaps as a Measure of Semantic Relatedness,' Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp. 805-810, 2003
  2. Brown, P., Cocke, J, Pietra, V., Peitra, S., Jelinek, F., Lafferty, J., Mercer, R. and Roosin, P., 'A Statistical Approach to Machine Translation, Computational Linguistics,' Vol. 16, No.2, pp. 79-85, 1990
  3. Dagan, I. And Itai, A, ''Word Sense Disambiguation Using a Second Language Monolingual Corpus,' Computational Linguistics, Vol. 20, No. 4, pp. 563-596, 1994
  4. Fellbaum, c., WordNet: An Electronic Lexical Database, MIT Press, 1998
  5. Inkpen, D. and Hirst, G., 'Automatic' Sense Disambiguation of the Near-Synonyms in a Dictionary Entry,' Proceedings, Fourth Conference on Intelligent Text Processing and Computational Linguistics, pp. 258-267, 2003 https://doi.org/10.1007/3-540-36456-0_25
  6. Jiang, J. and Conrath, D., 'Semantic Similarity based on Corpus Statistics and Lexicon Taxqnomy,' Proceedings on International Conference on Research in Computational Linguistics, pp. 19-33, 1997
  7. Koehn, P. and Knight K., 'Knowledge Sources for Word-Level Translation Models,' Empirical Methods in Natural Language Processing Conference, pp. 27-35, 2001
  8. Lesk, M., 'Automatic Sense Disambiguation Using Machine Readable Dictionaries: how to tell a pine code from an ice cream cone,' Proceedings. of the Fifth Annual International Conference on Systems Documentations, pp. 24-26, 1986
  9. Li, H. and Li, C., 'Word Translation Disambiguation Using Bilingual Boostrapping,' Computational Linguistics, Vol. 30, No.1, pp. 1-22, 2004 https://doi.org/10.1162/089120104773633367
  10. Patwardhan, S. and Perdersen, T., The cpan wordnet::similarity package, http://search.cpan.org /~sid/WordNet-Similarity-0.06/, 2005