Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2012.19B.3.183

Retrieval Model Based on Word Translation Probabilities and the Degree of Association of Query Concept  

Kim, Jun-Gil (전북대학교 컴퓨터공학과)
Lee, Kyung-Soon (전북대학교 컴퓨터공학부 영상정보신기술연구센터)
Abstract
One of the major challenge for retrieval performance is the word mismatch between user's queries and documents in information retrieval. To solve the word mismatch problem, we propose a retrieval model based on the degree of association of query concept and word translation probabilities in translation-based model. The word translation probabilities are calculated based on the set of a sentence and its succeeding sentence pair. To validate the proposed method, we experimented on TREC AP test collection. The experimental results show that the proposed model achieved significant improvement over the language model and outperformed translation-based language model.
Keywords
Information Retrieval; Word Relationships; Query Concept; Word Translation Probabilities; Translation-based Language Model;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 V. Murdock and W. B. Croft, "A Translation Model for sentence retrieval," Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pp.684-691, 2005.
2 J. Jeon, W. B. Croft and J. H. Lee, "Finding Similar Questions in Large Question and Answer Archives," Proceedings of the 14th ACM CIKM Conference, pp.84-90, 2005.
3 J. M. Ponte and W. B. Croft, "A language modeling approach to information retrieval," Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.275-281, 1998.
4 R. Jin, A. G. Hauptmann, and C. Zhai, "Title language model for information retrieval," Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp.42-48, 2002.
5 X. Xue, J. Jeon and W. B. Croft, "Retrieval Models for Question and Answer Archives," Proceedings of the 31st annual international ACM SIGIR conference, pp.475-482, 2008.
6 김설영, 이경순, "질문대답 아카이브에서 어휘 연관성을 이용한 질문 분류," 정보처리학회논문지B, 제17권 제4호, pp.327-332, 2010.   과학기술학회마을   DOI
7 GIZA tool. http://code.google.com/p/giza-pp/
8 F. J and Och, H. Ney. "A Systematic Comparison of Various Statistical Alignment Models," Proceedings of the Computational Linguistics, Vol.29, No.1, pp.19-51, 2003.   DOI   ScienceOn
9 P. F. Brown, V. J. D. Pietra, S. A. D. Pietra, and R. L. Mercer, "The mathematics of statistical machine translation: parameter estimation," Computational Linguistics 19(2), pp.263-311, 1993.
10 A. Berger and J. Lafferty, "Information retrieval as statistical translation," Proceedings of the 22nd annual international ACM SIGIR conference, pp.222-229, Aug., 1999.