Browse > Article
http://dx.doi.org/10.3745/KIPSTB.2008.15-B.2.171

The Refinement Effect of Foreign Word Transliteration Query on Meta Search  

Lee, Jae-Sung (충북대학교 컴퓨터교육과)
Abstract
Foreign word transliterations are not consistently used in documents, which hinders retrieving some important relevant documents in exact term matching information retrieval systems. In this paper, a meta search method is proposed, which expands and refines relevant variant queries from an original input foreign word transliteration query to retrieve the more relevant documents. The method firstly expands a transliteration query to the variants using a statistical method. Secondly the method selects the valid variants: it queries each variant to the retrieval systems beforehand and checks the validity of each variant by counting the number of appearance of the variant in the retrieved document and calculating the similarity of the context of the variant. Experiment result showed that querying with the variants produced at the first step, which is a base method of the test, performed 38% in average F measure, and querying with the refined variants at the second step, which is a proposed method, significantly improved the performance to 81% in average F measure.
Keywords
Query Expansion; Foreign Word Transliteration Query; Query Refinement; Meta Search; Information Retrieval;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Salton, G., “Automatic Text Processing - The transformation, analysis, and retrieval of information by computer,” pp.318-319, Addison Wesley Publishing Company, 1989
2 Google, http://www.google.co.kr/
3 Naver, http://www.naver.com/
4 Manning, C., H. Schutze, “Foundations of Statistical Natural Language Processing,” pp.268-269, The MIT Press, 1999
5 Jeong, K., S. H. Myaeng, J. S. Lee and K.-S. Choi, “Automatic Identification and Back-Transliteration of Foreign Words for Information Retrieval,” Information Processing and Management, Vol.35, No.4, pp.523-540, 1999   DOI   ScienceOn
6 이현복, “외래어 표기법 개정 시안의 문제점,” 어학연구 15.1, pp.39-59, 1979
7 Lee, J. S., K. Choi, “English to Korean Statistical Transliteration for Information Retrieval,” Computer Processing of Oriental Languages, Vol.12, No.1, pp. 17-37, 1998
8 이재성, “다국어 정보검색을 위한 영한 음차 표기 및 복원 모델,” 박사학위논문, 한국과학기술원, 1999
9 이희승, 안병주, “한글 맞춤법 강의-고친판,” 신구문화사, 1994
10 강병주, 이재성, 최기선, “외국어 음차 표기의 음성적 유사 도 비교 알고리즘,” 정보과학회 논문지(B), 제26권, 제10호, pp.1237-1246, 1999
11 강병주, “한국어 정보검색에서 외래어와 영어로 인한 단어 불일치문제의 해결,” 박사학위논문, 한국과학기술원, 2001
12 Cheon, S. M. “Construction of English Loanwords Contents for the Development of Educational Tools: a Step Towards the Prosperity of CALL Courseware,” Ph. D dissertation. Hankuk University of Foreign Studies, 2005
13 Mettler, M. “TRW Japanese Fast Data Finder,” TIPSTER Text Program Phase I Proc., Sep., pp.113-116, 1993
14 김병혜, “영어단어의 알파벳표기로부터 한글표기로의 자동변환,” 석사학위논문, 서강대학교 공공정책대학원, 1991
15 이재성, “효과적인 외래어 이형태 생성을 위한 확률 문맥 의존 치환 방법,” 한국 콘텐츠학회논문지, 제7권, 제2호, pp. 73-83, 2007   과학기술학회마을
16 Aslam, J. A., M. Montague, “Models for metasearch,” Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 276-284, New Orleans, USA, 2001   DOI
17 SERI/KIST, “지능형 정보처리기의 개발에 관한 연구,” 제1차년도 최종 보고서, 과학기술처, 1995